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Preface to the Sixth English Edition 


This sixth English edition is based on the fifth English edition (2007) and corresponds to the improved 
seventh (2008), eighth (2012) and ninth (2013) German edition. It contains all the chapters of the 
mentioned editions, but in a renewed, revised and extended form (see also the preface to the fifth English 
edition). 


Special new parts to be mentioned here are such supplementary sections as Geometric and Coordinate 
Transformations and Plain Projections in Chapter on Geometry, as Quaternions and Applications in 
Chapter on Linear Algebra, as Lie Groups and Lie Algebras in Chapter on Algebra and Discrete Math- 
ematics and as Matlab in Chapter on Numerical Analysis. The Chapter on Computer Algebra Systems 
is restricted to Mathematica only as a representative example for such systems. 


Extended and revised paragraphs are given in Chapter on Geometry about Cardan and Euler angles 
and about Coordinate transformations, in Chapter on Integral Calculus about Applications of Definite 
Integrals, in Chapter on Optimization about Evaluation Strategies, in Chapter on Tables about Natural 
Constants and Physical Units (System SI). The Index has been completed to an extent as in the previous 
German editions. 


We would like to cordially thank all readers and professional colleagues who helped us with their valu- 
able statements, remarks and suggestions on the German editions of the book during the revision pro- 
cess. Special thanks go to Mrs. Professor Dr. Gabriella Szép (Budapest), who made this English 
edition possible by valuable contributions and the basic translation into the English. Furthermore our 
thanks go to all co-authors for the critical treatment of their chapters. 


Dresden, December 2014 


Prof. Dr. GERHARD MUSIOL Prof. Dr. HEINER MUHLIG 


Preface to the Fifth English Edition 


This fifth edition is based on the fourth English edition (2003) and corresponds to the improved sixth 
German edition (2005). It contains all the chapters of the both mentioned editions, but in a renewed 
revised and extended form. 


So in the work at hand, the classical areas of Engineering Mathematics required for current practice are 
presented, such as “Arithmetic”, “Functions”, “Geometry”, “Linear Algebra”, “Algebra and Discrete 
Mathematics”, (including “Logic”, “Set Theory”, “Classical Algebraic Structures”, “Finite Fields”, 
“Elementary Number Theory”, ” Cryptology”, “Universal Algebra” , “Boolean Algebra and Switch Al- 
gebra”, “Algorithms of Graph Theory”, “Fuzzy Logic”), “Differentiation” , “Integral Calculus”, “Dif- 
ferential Equations”, “Calculus of Variations”, “Linear Integral Equations”, “Functional Analysis” , 
“Vector Analysis and Vector Fields”, “Function Theory”, “Integral Transformations”, “Probability 
Theory and Mathematical Statistics” . 


Fields of mathematics that have gained importance with regards to the increasing mathematical mod- 
eling and penetration of technical and scientific processes also receive special attention. Included 
amongst these chapters are “Stochastic Processes and Stochastic Chains” as well as “Calculus of Er- 
rors”, “Dynamical Systems and Chaos”, “Optimization”, “Numerical Analysis”, “Using the Com- 
puter” and “Computer Algebra Systems". 


The Chapter 21 containing a large number of useful tables for practical work has been completed by 


VI 





| adding tables with the physical units of the International System of Units (SI). 
Dresden, February 2007 


Prof. Dr. GERHARD MUSIOL Prof. Dr. HEINER MUHLIG 


From the Preface to the Fourth English Edition 


The “Handbook of Mathematics” by the mathematician, I. N. BRONSHTEIN and the engineer, K. A. 
SEMENDYAYEV was designed for engineers and students of technical universities. It appeared for the 
first time in Russian and was widely distributed both as a reference book and as a text book for colleges 
and universities. It was later translated into German and the many editions have made it a permanent 
fixture in German-speaking countries, where generations of engineers, natural scientists and others in 
technical training or already working with applications of mathematics have used it. 

On behalf of the publishing house Harri Deutsch, a revision and a substantially enlarged edition was 
prepared in 1992 by Gerhard Musiol and Heiner Mühlig, with the goal of giving " Bronshtein" the mod- 
ern practical coverage requested by numerous students, university teachers and practitioners. The 
original style successfully used by the authors has been maintained. It can be characterized as “short, 
easily understandable, comfortable to use, but featuring mathematical accuracy (at a level of detail 
consistent with the needs of engineers)”*. Since 2000, the revised and extended fifth German edition of 
the revision has been on the market. Acknowledging the success that “BRONSTEIN” has experienced 
in the German-speaking countries, Springer Verlag Heidelberg/Germany is publishing a fourth English 
edition, which corresponds to the improved and extended fifth German edition. 

The book is enhanced with over a thousand complementary illustrations and many tables. Special 
functions, series expansions, indefinite, definite and elliptic integrals as well as integral transformations 
and statistical distributions are supplied in an extensive appendix of tables. 


In order to make the reference book more effective, clarity and fast access through a clear structure 
were the goals, especially through visual clues as well as by a detailed technical index and colored tabs. 


An extended bibliography also directs users to further resources. 


Special thanks go to Mrs. Professor Dr. Gabriella Szép (Budapest), who made this English debut ver- 
sion possible. 
Dresden, June 2003 


Prof. Dr. GERHARD MUSIOL Prof. Dr. HEINER MUHLIG 
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1 Arithmetics 


1.1 Elementary Rules for Calculations 


1.1.1 Numbers 
1.1.1.1 Natural, Integer, and Rational Numbers 


1. Definitions and Notation 
The positive and negative integers, fractions, and zero together are called the rational numbers. In 
relation to these the following notations are used (see 5.2.1, 1., p. 327): 


e Set of natural numbers: N={0, 1, 2, 3,...}, 

e Set of integers: Z= {...,—2,-1,0,1,2,...}, 

e Set of rational numbers: Q= {z|x = P with pEZ, qEZ and q #0}. 
q 


The notion of natural numbers arose from enumeration and ordering. The natural numbers are also 
called the non-negative integers. 

2. Properties of the Set of Rational Numbers 

e The set of rational numbers is infinite. 

e Theset is ordered, i.e., for any two different given numbers a and b one can tell which is the smaller 
one. 

e  Theset is dense everywhere, i.e., between any two different rational numbers a and b (a « b) there 
is at least one rational number c (a < c « b). Consequently, there is an infinite number of other rational 
numbers between any two different rational numbers. 

3. Arithmetical Operations 

The arithmetical operations (addition, subtraction, multiplication and division) can be performed with 
any two rational numbers, and the result is a rational number. The only exception is division by zero, 
which is not possible: The operation written in the form a : 0 is meaningless because it does not have 
any result: Ifa A 0, then there is no rational number b such that b-0 — a could be fulfilled, and if a — 0 
then b can be any of the rational numbers. The frequently occurring formula a : 0 — oo (infinity) does 
not mean that the division is possible; it is only the notation for the statement: If the denominator 
approaches zero and, e.g., the numerator does not, then the absolute value (magnitude) of the quotient 
exceeds any finite limit. 

4. Decimal Fractions, Continued Fractions 

Every rational number a can be represented as a terminating or periodically infinite decimal fraction 
or as a finite continued fraction (see 1.1.1.4, p. 3). 

5. Geometric Representation 

Fixing an origin the zero point 0, a positive direction the orientation, and the unit of length l the 
measuring rule, (see also 2.17.1, p. 115 and (Fig. 1.1)), then every rational number a corresponds to 
a certain point on this line. This point has the coordinate a, and it is a so-called rational point. The 
line is called the numerical axis. Because the set of rational numbers is dense everywhere, between two 
rational points there are infinitely many further rational points. 





11 3 8 \ 
3-1 -2 -1 0 15 2 3 3 A 1K 
φο d 9 * 9 9 Lnd » 0 1 2 3 
1-1 τ 
Figure 1.1 Figure 1.2 
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1.1.1.2 Irrational and Transcendental Numbers 
The set of rational numbers is not satisfactory for calculus. Even though it is dense everywhere, it does 
not cover the whole numerical axis. If for example the diagonal AB of the unit square rotates around 
A so that B goes into the point K, then K does not have any rational coordinate (Fig. 1.2). 
The introduction of irrational numbers allows to assign a number to every point of the numerical axis. 
In textbooks there are given exact definitions for irrational numbers, e.g., by nests of intervals. For this 
survey it is enough to note that the irrational numbers take all the non-rational points of the numerical 
axis and every irrational number corresponds to a point of the axis, and that every irrational number 
can be represented as a non-periodic infinite decimal fraction. 
First of all, the non-integer real roots of the algebraic equation 

x" pa, ax" ess aux dao — 0 (n 1, integer; integer coefficients), (1.1a) 
belong to the irrational numbers. These roots are called algebraic irrationals. 
W A: The simplest examples of algebraic irrationals are the real roots of z^ — a = 0 (a > 0), as 
numbers of the form 4/a , if they are not rational. 
B B: 72=1.414..., W10 = 2.154... are algebraic irrationals. 
The irrational numbers which are not algebraic irrationals are called transcendental. 
W A: 7 —3.141592... e — 2.718281... are transcendental numbers. 
E B: The decimal logarithm of the integers, except the numbers of the form 10”, are transcendental. 
The non-integer roots of the quadratic equation 

xz +ayx +a9 =0 (ay, a9 integers) (1.1b) 
are called quadratic irrationals. They have the form (a + bVD)/c (a,b,c integers, c # 0; D > 0, 
square-free number). 
Wi The division of a line segment a in the ratio of the golden section x/a = (a — x)/a (see 3.5.2.3, 3., 
p. 194) leads to the quadratic equation z? -- z — 1 — 0, ifa — 1. The solution z — (V5 — 1)/2isa 


quadratic irrational. It contains the irrational number v5. 


1.1.1.3 Real Numbers 


Rational and irrational numbers together form the set of real numbers, which is denoted by R. 


1. Most Important Properties 

The set of real numbers has the following important properties (see also 1.1.1.1, 2., p. 1). It is: 

e Infinite. 

ο Ordered. 

e Dense everywhere. 

e Closed, i.e., every point of the numerical axis corresponds to a real number. This statement does 
not hold for the rational numbers. 


2. Arithmetical Operations 

Arithmetical operations can be performed with any two real numbers and the result is a real number, 
too. The only exception is division by zero (see 1.1.1.1, 3., p. 1). Raising to a power and also its 
inverse operation can be performed among real numbers; so it is possible to take an arbitrary root of 
any positive number; every positive real number has a logarithm for an arbitrary positive basis, except 
that 1 cannot be a basis. 

A further generalization of the notion of numbers leads us to the concept of complex numbers (see 1.5, 
p. 34). 

3. Interval of Numbers 

A connected set of real numbers with endpoints a and b is called an interval of numbers with endpoints 
a and b, where a < b and a is allowed to be —oo and b is allowed to be 4-oo. If the endpoint itself does 
not belong to the interval, then this end of the interval is open, in the opposite case it is closed. 
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An interval is given by its endpoints a and b, putting them in braces: A bracket for a closed end of the 
interval and a parenthesis for an open one. It is to be distinguished between open intervals (a, b), half- 
open (half-closed) intervals [a, b) or (a, b] and closed intervals [a,b], according to whether none of the 
endpoints, one of the endpoints or both endpoints belong to it, respectively. Frequently the notation 
Ja, b[ instead of (a, b) for open intervals, and analogously [a, b| instead of [a, 0) is used. In the case of 
graphical representations, in this book the open end of the interval is denoted by a round arrow head, 
the closed one by a filled point. 


1.1.1.4 Continued Fractions 


Continued fractions are nested fractions, by which rational and irrational numbers can be represented 
and approximated even better than by decimal representation (see 19.8.1.1, p. 1002 and Il A and MI 
B on p. 4). 


























1. Rational Numbers p 1 
The continued fraction of a rational number is fi- q — 803r 1 . 03) 
nite. Positiverational numbers which are greater a+ T 
than 1 have the form (1.2). For abbreviation ag + —— ———1 - 
the symbol ᾽ = [a9; 41, 02,..., a4] is used with eee 
ay, > 1 (kK =1,2,...,n). τ 
The numbers a,x are calculated with the help of the Euclidean algorithm: 
Pad ΠΠ (1.35) 
ᾳ ᾳ q 
B ue (0« 3 «1), (1.3b) 
TY T Ti 
P eas (0< <1), (1.3c) 
T2 T2 Το 
AR πι a (0 ees 1) , (1.34) 
Tn—1 Tn—1 Tn—1 
. = an (Tr+1 = 0). (1.3e) 
1 1 1 
n x ez -24 ΕΞ 21 Τ (2:3,1,6]. 
1c 
T 6 


2. Irrational Numbers 


Continued fractions of irrational numbers do not break off. They are called infinite continued fractions 
with [ao; a1, a2, . . .]. 

If some numbers a, are repeated in an infinite continued fraction, then this fraction is called a periodic 
continued fraction or recurring chain fraction. Every periodic continued fraction represents a quadratic 
irrationality, and conversely, every quadratic irrationality has a representation in the form of a periodic 
continued fraction. 

Bl The number 2 — 1.4142135...is a quadratic irrationality and it has the periodic continued fraction 
representation V2 — [1;2,2,2,...]. 
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3. Aproximation of Real Numbers 
If a = [ao; à1, a2, . ..] is an arbitrary real number, then every finite continued fraction 
p 
ap = [a0; Q1, 2, ..., ax] = q (1.4) 
represents an approximation of a. The continued fraction a, is called the k-th approximant of a. It 
can be calculated by the recursive formula 
Pk ακρκ--ι T Pk—2 
Og m T mo — (k21; pa = Lpo-— aoi q-1 — 0, qo — 1). (1.5) 
dk QkQk—1 t Qk—2 
According to the Liouville approximation theorem, the following estimat holds: 
1 
la — ox| — le- «-ς (1.6) 
dio 





Furthermore, it can be shown that the approximants approach the real number a with increasing ac- 
curacy alternatively from above and from below. The approximants converge to o especially fast if the 
numbers a; (i = 1,2,..., k) in (1.4) have large values. Consequently, the convergence is worst for the 
numbers [1; 1, 1,. . .]. 

W A: From the decimal presentation of x the continued fraction representation m = [3; 7, 15, 1, 292, .. .] 
follows with the help of (1.3a)—(1.3e). The corresponding approximants (1.5) with the τ accord- 


22 1 
ing to (1.6) are: o4 — * with "i — a| < a” 2.102,09 — . with |r — a2| < . e 9 - 10 5, 


with |r — a3| < e 8-10 7. The actual errors are much smaller. They are less than 


355 

113 1 ux 
1.3: 10? for o4, 8.4- 107? for a; and 2.7- 1077 for a3. The approximants o4, o2 and o represent better 
approximations for 7 than the decimal representation with the corresponding number of digits. 

BB: The formula of the golden section z/a — (a — z)/z (see 1.1.1.2, p. 2, 3.5.2.3, 3., p. 194 and 
17.3. ᾽ 4, 4., p. 908) í can be represented by the following two continued fractions: x = a[1; 1, 1, ...] and 


Q3 — 


50 + v5) = 50 + [2;4,4,4,...]). The approximant a, delivers in the first case an accuracy of 
0.018. a, in the ο. case of 0.000 001 a. 


1.1.1.5 Commensurability 


Two numbers a and b are called commensurable, i.e., measurable by the same number, if both are an 
integer multiple of a third number c. From a = mc, b = nc (m,n € Z) it follows that 


; = (x rational). (1.7) 


Otherwise a and b are incommensurable. 

W A: The length of a side and a diagonal of a square are incommensurable because their ratio is the 
irrational number V2. 

E B: The lengths of the golden section are incommensurable, because their ratio contains the irra- 
tional number v5 (see 1.1.1.2, p. 2 and 3.5.2.3, 3.,p. 194). Therefore the sides and diagonals in a 
regular pentagon are incommensurable (see W in 3.1.5.3, p. 139). Today Hippasos from Metapontum 
(450 BC) is considered to have discovered the irrational numbers via this example. 


1.1.2 Methods for Proof 
Mostly three types of proofs are used: 
e direct proof, 

e indirect proof, 
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e proof by (mathematical or arithmetical) induction. 
Furthermore there are constructive proofs. 


1.1.2.1 Direct Proof 


The starting point is a theorem which has already been proven (premise p) and the truth of the state- 
ment of the new theorem is derived from it (conclusion q). The logical steps mostly used for the con- 
clusions are implication and equivalence (see 5.1, p. 323). 

1. Direct Proof by Implication 

The implication p > q means that the truth of the conclusion follows from the truth of the premise 
(see “ Implication ” in the truth table, 5.1.1, p. 323). 


, +b 
E Prove the inequality DE > Vab fora > 0, b > 0. The premise is the well-known binomial formula 


(a+b)? = a?+2ab+0?. By subtracting 4ab follows (a+b)? — 4ab = (a—b)? > 0. From this inequality 
the statement is obtained certainly if the investigations are restricted only to the positive square roots 
because of a > 0 and b > 0. 

2. Direct Proof by Equivalence 

The proof will be delivered by verifying an equivalent statement. In practice it means that all the 
arithmetical operations which have to be used for changing p into q must be uniquely invertible. 





1 
E Prove the inequality 1+a+a?+---+a"< 1 forÜü «acl. 


—a 
Multiplying by 1 — a yields 1 — a + a — a? + a? — à? x --- p a^ — ant! — 1 — ant! < 1. 

This last inequality is true because of the assumption 0 « a"*! — 1. The starting inequality also holds 
because all the arithmetical operations to be used are uniquely invertible. 


1.1.2.2 Indirect Proof or Proof by Contradiction 


To prove the statement q: Starting from its negation q, and from q arriving at a false statement r, i.e., 
q — r (see also 5.1.1, 7., p. 325). In this case q must be false, because using the implication a false 
assumption can result only in a false conclusion (see truth table 5.1.1, p. 323). If q is false q must be 
true. 








a 
E Prove that the number V2 is irrational. Suppose, V2 is rational. So the equality v2 = 5 holds for 


some integers a, b and b Z 0. Assuming that the numbers a,b are coprime numbers, i.e., they do not 
2 
-- a A 
have any common divisor, then follows (v2)? = 2 = po a? = 2b*, therefore, a? is an even number, 


and this is possible only if a — 2n is an even number. Deducing a? — 4n? — 20? holds, hence b must be 
an even number, too. It is obviously a contradiction to the assumption that a and b are coprimes. 


1.1.2.3 Mathematical Induction 


Theorems and dependent on natural numbers n are proven with this method. The principle of math- 
ematical induction is the following: If the statement is valid for a natural number no, and if from the 
validity of the statement for a natural number n > no the validity of the statement follows for n + 1, 
then the statement is valid for every natural number n > no. According to these, the steps of the proof 
are: 

1. Basis of the Induction: The truth of the statement is to be shown for n = no. Mostly no = 1 can 
be choosen. 

2. Induction Hypothesis: The statement is valid for an integer n (premise p). 

3. Induction Conclusion: Formulation the proposition for n + 1 (conclusion q). 

4. Proof of the Implication: p > q. 

Steps 3. and 4. together are called the induction step or logical deduction from n ton + 1. 
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1 1 1 n 
W Prove the f la Sn = | t Heet = : 
ipd DM TC AN CO ο n(n+1) n+l 
The steps of the proof by induction are: 
1 1 : =i bviously t 
.n21:582—z- s obviously true. 
n $171.57, 38 obviously tru 
1 1 T 1 n 
2. 9 je S | | την = holds for an n > 1. 
uppose s TE sx mat πορτα 4, boldsforann 2 
F ο n+1 
3. Supposing 2. it is to show: 54,41 — 5° 
n+ 
1 1 1 1 1 1 
4. The proof: 54,1 — i την | — Sud = 
Ne PIO ferc ntu $54 tamt) arnt MaDe 
n 1 n? -2n 1 (n 4-1? n+1 








n+1 n (n--1)(n-2) (n-1(n-2) (πι1-1}(υ 153) n-2' 
1.1.2.4 Constructive Proof 


In approximation theory, for instance, the proof of an existence theorem usually follows a constructive 
process, i.e., the steps of the proof give a method of calculation for a result which satisfies the proposi- 
tions of the existence theorem. 

W The existence of a third-degree interpolation-spline function (see 19.7.1.1, 1., p. 996) can be proved 
in the following way: It is to be shown that the calculation of the coefficients of a spline satisfying 
the requirements of the existence theorem results in a tridiagonal linear equation system, which has a 
unique solution (see 19.7.1.1, 2., p. 997). 


1.1.3 Sums and Products 
1.1.3.1 Sums 
1. Definition 


To briefly denote a sum the summation sign Y is used: 


n 
a +ag+...+4n = >> ag. (1.8) 
k=1 
With this notation the sum of n summands ap (k = 1,2,...,n) is denoted, k is called the running 


index or summation variable. 
2. Rules of Calculation 
1. Sum of Summands Equal to Each Other , i.e., a, = a for k = 1,2,...,m: 


SO a, = na. (1.9a) 
k=1 

2. Multiplication by a Constant Factor 
Σ Cap = Cc Σ αμ. (1.98) 
k=1 k=1 

3. Separating aSum 


Soar = at X αἱ (L<m<n). (1.96) 
k=1 k=1 


k=m+1 
4. Addition of Sums with the Same Length 


n 


So (an + be + ce +...) = Soar + Vode + Voce t+.... (1.9d) 
k=1 k=1 


k=1 k=1 


1.1 Elementary Rules for Calculations 7 





5. Renumbering 


n m+n—1 n n—m-H 
Σ ak = » Qk-m+1; »» ak = Σ Ak+m-—l- (1.9e) 
k=1 k=m k=m k=l 


6. Exchange the Order of Summation in Double Sums 


Y (= D =. (Σ D (1.91) 


i=1 k=1 \i=1 


1.1.3.2 Products 


1. Definition 
The abbreviated notation for a product is the product sign |]: 


n 
αιαο... ἂπ = " αγ. (1.10) 
k=1 
With this notation a product of n factors ay (k = 1,2,...,m) is denoted, where k: is called the running 
index. 


2. Rules of Calculation 
1. Product of Coincident Factors , i.e., a, = a for k = 1,2,...,n: 


I[ «^. (1.11a) 
k=1 


2. Factoring out a Constant Factor 


n 


II(ceu) 2 & II Ök: (1.11b) 
k=1 


k=1 


3. Separating into Partial Products 


II dy = ( ο) II ak (1 «€ m « n). (1.11c) 
k=1 k=1 k=m-+1 


4. Product of Products 
II eos... = (i «) (Tre) (i α) ο (1.11d) 
k=1 k=1 k=1 k=1 


5. Renumbering 


n m+n—1 n n—m-Hl 
Ila = II arme IIDe- II eese (1.11e) 
k=1 k=m k=m k=l 


6. Exchange the Order of Multiplication in Double Products 


it (ft) = fi (ito). iis 


i= k=1 \i=1 


1.1.4 Powers, Roots, and Logarithms 
1.1.4.1 Powers 


The notation a® is used for the algebraic operation of raising to a power. The number a is called the 
base, x is called the exponent or power, and a? is called the power. Powers are defined as in Table 1.1. 
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For the allowed values of bases and exponents there are the following 


Rules of Calculation: 


a? 
ata” = a™, a:a? = — at”, 
αν 
T T 
. > d a 
a" b" = (0b; a Fns 
bt b 


(a*)! — (a*)* = a", 


a = e ™ (a4 > 0). 


Here lna is the natural logarithm of a where e = 2.718281828459 . . . is the base. 


c -(1 


+1, if n even, 
—1, ifn odd, 7 


(1.16a) 


a? —1 forany a £0. 


Table 1.1 Definition of powers 


(1.12) 


(1.13) 


(1.14) 
(1.15) 


Special powers are 


1.16b) 

















positive real 


(p, q integer, q > 0) 


base a exponent x power a* 
0 1 
arbitrary real, Z0 | n— 1,2,3,... a" =a-a-a-...:q (a tothe power n) 
~ ———— 
n factors 
1 
n = —1, —2, —3,... | a = — 
a” 
rational: ? at = Yar 


(q-th root of a to the power p) 























irrational: 
Pk 
lim Pk lim a% 
k>œ qk k—oo 
0 positive 0 
1.1.4.2 Roots 


According to Table 1.1 the n-th root of a positive number a is the positive number denoted by 
Va (a » 0, real; n 7 0, integer). 


This operation is called taking of the root or extraction of the root, a is the radicand, n is the radical or 


index. 
The solution of the equation 


x” =a (areal or complex; n > 0, integer) 


(1.17) 


(1.17b) 


is often denoted by « = i/a. But there is no reason to be confused: In this relation the notation de- 


notes all the solutions of the equation, i.e., it represents n different values x, (k = 1,2,... 


,n) to be 


calculated. In the cace of negative or complex values they are to be determined by (1.140b) (see 1.5.3.6, 


p. 38). 


Bl A: The equation x? = 4 has two real solutions, namely 21,2 = +2. 


E B: The equation zë 


—8 has three roots among the complex numbers: z1 
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—2 and #3 = 1—i¥3, but only one among the reals. 


3, 12 
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1.1.4.3 Logarithms 


1. Definition 
The logarithm u of a positive number x > 0 to the base b > 0, b Z 1, is the exponent of the power 
which has the value x with b in the base. It is denoted by u = log, x. Consequently the equation 


bř =x (1.184) yields log, x = u (1.18b) 
and conversely the second one yields the first one. In particular holds 


—oo for b » 1, 


+oo for b < 1. (1.180) 


Ίορι1--0, log,b=1, log,0 = { 
The logarithm of negative numbers can be defined only among the complex numbers. The logarithmic 
functions see 2.6.2, p. 73. 
To take the logarithm of a given number means to find its logarithm. To take the logarithm of an 
expression means it is transformed like (1.19a, 1.19b). The determination of a number or an expression 
from its logarithm is called raising to a power. 
2. Some Properties of the Logarithm 
a) Every positive number has a logarithm to any positive base, except the base b — 1. 
b) For z 7» 0 and y > 0 the following Rules of Calculation are valid for any b (which is allowed to be 
a base): 


log (xy) = loga +logy, log B = log z — log y. (1.19a) 
y 
; : 1 
logz" — nlogz, in particular log Wz = — log z . (1.19b) 
n 


With (1.19a, 1.19b) the logarithm of products and fractions can be calculated as sums or differences of 
logarithms . 


32? yy s deg cv 3a? V 
2zu3 2zu 


1 
= log3 + 2log x 4 3 losy log 2 — log z — 3log u. 


E Take the logarithm of the expression = log (30? Vy )- log (22119) 





Often the reverse transformation is required, i.e., an expression containing logarithms of different amou- 
nts is to be rewritten into one, which is the logarithm of one expression. 
25 


3r 
W 1og3 -- 21ogx 4 εί log2 — logz — 3logu = log Vy 
3 2zu3 





c) Logarithms to different bases are proportional, i.e., the logarithm to a base a can be change into a 
logarithm to the base b by multiplication: 


log, z — Mlog,r where M — log; b = E (1.20) 
log, a 

M is called the modulus of the transformation. 
1.1.4.4 Special Logarithms 
1. The logarithm to the base 10 is called the decimal or Briggsian logarithm, in formulas: 

logjyr —lgx and  log(x10*) — a + log zv. (1.21) 
2. The logarithm to the base e is called the natural or Neperian logarithm, in formulas: 

log, £ = In x. (1.22) 
The modulus of transformation to change from the natural logarithm into the decimal one is 

M = loge = = — 0.4342944819 , (1.23) 


In 10 
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and to change from the decimal into the natural one it is 


1 
Μι -- M” In 10 = 2.3025850930 . (1.24) 
3. The logarithm to base 2 is called the binary logarithm, in formulas: 
logor —ldz or logoz — bz. (1.25) 


4. The values of the decimal and natural logarithm can be found in logarithm tables. Some time ago 
the logarithm was used for numerical calculation of powers, and it often made numerical multiplication 
and division easier. Mostly the decimal logarithm was used. Today pocket calculators and personal 
computers make these calculations. 
Every number given in decimal form (so every real number), which is called in this relation the antilog, 
can be written in the form 

z— $105 with 1 « $ «10 (1.364) 
by factoring out an appropriate power of ten: 10* with integer k. This form is called the half-logarithmic 
representation. Here @ is given by the sequence of figures of x, and 10* is the order of magnitude of x. 
Then for the logarithm holds 

logx=k+loge with O0O<logt<1, ie, logt=0,.... (1.26b) 
Here k is the so-called characteristic and the sequence of figures behind the decimal point of log $ is 
called the mantissa. The mantissa can be found in logarithm tables. 
W 15324 — 2.5105, the characteristic is 2, the mantissa is 5105. Multiplying or dividing this number 
by 10”, for example 324000; 3240; 3.24; 0.0324, their logarithms have the same mantissa, here 5105, 
but different characteristics. That is why the mantissas are given in logarithm tables. In order to get 
the mantissa of a number zx first the decimal point has to be moved to the right or to the left to get a 
number between 1 and 10, and the characteristic of the antilog x is determined by how many digits k 
the decimal point was moved. 
5. Slide rule Beside the logarithm, the slide rule was of important practical help in numerical calcu- 
lations. The slide rule works by the principle of the form (1.19a), so multiplying and dividing is done 
by adding and subtracting numbers. On the slide rule the scale-segments are denoted according to the 


logarithm values, so multiplication and division can be performed as addition or subtraction (see Scale 
and Graph Papers 2.17.1, p. 115). 


1.1.5 Algebraic Expressions 
1.1.5.1 Definitions 
1. Algebraic Expression 


One or more algebraic quantities, such as numbers or symbols, are called an algebraic expression or 
term if they are connected by the symbols, +, —,-,:, οι etc., as well as by different types of braces 
for fixing the order of operations. 
2. Identity 
is an equality relation between two algebraic expressions if for arbitrary values of the symbols in them 
the equality holds. 
3. Equation 
is an equality relation between two algebraic expressions if the equality holds only for a few values of 
the symbols. For instance an equality relation 

F(x) = f(x) (1.27) 
between two functions with the same independent variable is considered as an equation with one variable 
if it holds only for certain values of the variable. If the equality is valid for every value of x, it is called 
an identity, or one says the equality holds identically, written as formula F(x) = f(x). 
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4. Identical Transformations 

are performed in order to change an algebraic expression into another one if the two expression are 
identically equal. The goal is to have another form, e.g., to get a shorter form or a more convenient 
form for further calculations. Often it is of interest to have the expression in a form which is especially 
good for solving an equation, or taking the logarithm, or calculating the derivative or integral of it, etc. 


1.1.5.2 Algebraic Expressions in Detail 


1. Principal Quantities 

Principal quantities are those general numbers (literal symbols) occurring in algebraic expressions, ac- 
cording to which the expressions are classified. They must be fixed in any single case. In the case of 
functions, the independent variables are the principal quantities. The other quantities not given by 
numbers are the parameters of the expression. In some expressions the parameters are called coeffi- 
cients. 

BH So-called coefficients occur e.g. in the cases of polynomials, Fourier series, and linear differential 
equations, etc. 

An expression belongs to a certain class depending on which kind of operations are performed on the 
principal quantities. Usually, the last letters of the alphabet z, y, z, u, v,... are used to denote the 
principal quantities and the first letters a, 0, c,... are used for parameters. The letters m, n, p, ... are 
usually used for positive integer parameter values, e.g. for indices in summations or in iterations. 

2. Integral Rational Expressions 

are expressions which contain only addition, subtraction, and multiplication of the principal quantities, 
including powers of them with non-negative integer exponents. 

3. Rational Expressions 

contain also division by principal quantities, i.e., division by integral rational expressions, so principal 
quantities can have negative integers in the exponent. 

4. Irrational Expressions 

contain roots, i.e., non-integer rational powers of integral rational or rational expressions with respect 
to their principal quantities, of course. 

5. Transcendental Expressions 

contain exponential, logarithmic or trigonometric expressions of the principal quantities, i.e., there 
can be irrational numbers in the exponent of an expression of principal quantities, or an expression 
of principal quantities can be in the exponent, or in the argument of a trigonometric or logarithmic 
expression. 


1.1.6 Integral Rational Expressions 


1.1.6.1 Representation in Polynomial Form 
Every integral rational expression can be changed into polynomial form by elementary transformations, 
as in addition, subtraction, and multiplication of monomials and polynomials. 
Wi (—a? + 2a?x — x?) (4a? + 8ax) + (a2x? + 2a?a? — Aaxt) — (až + 4a?x? — 4azt) 
4a? 4- 8a*x — 4a?z? — Sa*x 4- 16a?2? — Baxt + aèr? 4- 2a?x? — Aax^ — a? — Aa?3? -- Aaa 
— —b5a? -- 13a3z? — 2a?z3 — Bart. 








1.1.6.2 Factoring Polynomials 

Polynomials often can be decomposed into a product of monomials and polynomials. To do so, factoring 
out, grouping, special formulas and special properties of equations can be used. 

Bl A: Factoring out: 8ax?y — 6bx?y? |- Aca? — 2a?(4ay — 3bry? 4- 2c2?). 

HB: Grouping: 62? + xy — y? — 10xz — 5yz = 62? + 3xy — 2zy — y? — 10xz — 5byz — 3x(2x + y) — 
y(2x 4- y) — 5z(2x + y) = (2% + y) (8a — y — 52). 
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Bl C: Using the properties of equations (see also 1.6.3.1, p. 43): P(x) — x9 — 225 4- Az* -- 2a? — 5a?. 
a) Factoring out z?. b) Realizing that o4 — 1 and a» — —1 are the roots of the equation P(x) = 0 and 
dividing P(z) by z2(—1)(z--1) ^ z1— 2? gives the quotient 2? — 2x 4-5. This expression can no longer 
be decomposed into real factors because p = —2, q — 5, p?/4 — q « 0, so finally the decomposition is 
xê — 25 + 4r4 + 2r? — 5x? = x? (x — 1) (x + 1)(x? — 2x + 5). 








1.1.6.3 Special Formulas 


























(z x y)? — a? +2ry +4, (1.28 
(z +y +z}? = r? +y? +2? +2ry + 2rz + 2yz, (1.29 
ο ο o +t+u =r Hy HH Ha 
+2gy + 2rz +- - +2gu + 2yz +--+ 2yu +--+ 2tu, (1.30 
(x +y)? = 2? + 32?y + 3ry? t y. (1.31 

















The calculation of the expression (x + y)” is done by the help of the binomial formula (see (1.36a)— 
(1.37a)). 




















(α Ελα -- |) -- αἲ -- γ’, (1.32 
a” — y” n—1 n-2 n-2 n-i ‘ 
Ξα fay +--+ ay”? +y"™, (for integer n, and n > 1), (1.33 
T— y 
x” +y” 1 2 2 1 
Z = g" a y+- ry Hy (for odd n, andn > 1), (1.34 
ο y 
q^ y^ 
Lomatdogat?ygp ay" ?. "7! (for even n, andn > 1). (1.35 
vy 





1.1.6.4 Binomial Theorem 


1. Power of an Algebraic Sum of Two Summands (First Binomial Formula) 
The formula 


n(n — 1) n(n — 1)(n — 2) 











(a+b)? = a" +na"b 4 a^ 4 αἲ 509 
2ἱ 3! 
n(n— 1)...(n— 1 
bere Hsec qm ο η H- nab 4 b” (1.36a) 
m! 
is called the binomial theorem, where a and b are real or complex values and n — 1,2,.... Using the 


binomial coefficients delivers a shorter and more convenient notation: 


(a+b)" = 7 Jan 4 (7) an-15 4 (" ] an] £ (" ]an-39 eee (abet 4 | b^ (1.36b) 
0 1 2 3 τ-- 1 n 


(a +b)” = x (;) atk yk, (1.36c) 


k=0 \* 


or 





2. Power of an Algebraic Difference (Second Binomial Formula) 


n(n — 1)(n — 2) 











w= 1 $ 
(a — b)^ — a^ — na"-1b 4 m ) n2? E απ 3 
=a 1 
eas a ja nin ) τ MEI) nmg... ly (1.372) 
m: 


or 
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(a — b^ 5 5 (5 (-1Ygn-*)*. (1.37b) 


k=0 


3. Binomial Coefficients 
The definition is for non-negative and integer n and k: 





n n! 
-—————— (0xktzn 1.38a 
H m-ak CSES) pone 
where n! is the product of the positive integers from 1 to n, and it is called n factorial: 
n! =1-2-3-...+n, and by definition 0! = 1. (1.38b) 


The binomial coefficients can easily be seen from the Pascal triangle in Table 1.2. The first and the 
last number is equal to one in every row; every other coefficient is the sum of the numbers standing on 
left and on right in the row above it. 


Simple calculations verify the following formulas: 


n n n! n n n 
ο e (e Ge Gae cam 
n+1 n n—1 n—2 k 
(9-0 *( k )+( k ΠΡΙ (1.390) 
n+l nal n n n—kímn 
( k ) F cra) (1.39d) s i TE 0. (1.996) 
( 1 








Table 1.2 Pascal’s triangle 








n Coefficients 

0 li 

1 1 1 

2 1 2 1 

3 1 3 3 1 

4 1 4 6 4 1 

5 1 5 10 10 5 1 

6 1 6 15 20 15 6 1 

T t aT T T T dr 

6 6 6 6 6 6 6 6 

0 0 0 & @ 9 @ 











For an arbitrary real value a (œ € R) and a non-negative integer k one can define the binomial coeffi- 


ient [2 
cien i 
k 


ὦ) = ee Dac mec een for integer k and k > 1, (6) =1. (1.40) 
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a (3) .-ici-9ci-23. 5 

3 3! 16 
4. Properties of the Binomial Coefficients 
e The binomial coefficients increase until the middle of the binomial formula (1.36b), then decrease. 
e The binomial coefficients are equal for the terms standing in symmetric positions with respect to 
the start and the end of the expression. 
e The sum of the binomial coefficients in the binomial formula of degree n is equal to 2” . 
e The sum of the coefficients at the odd positions is equal to the sum of the coefficients at the even 
positions. 
5. Binomial Series 
The formula (1.36a) of the binomial theorem can also be extended for negative and fraction exponents. 
If |b] < a, then (a + b)” has a convergent infinite series (see also 21.5, p. 1057): 





(n—1 -1)n-2 
(a -- b)" — a" -- na" ἡ m ) n? | ae ve ΠΣ νεος (1.41) 


1.1.6.5 Determination of the Greatest Common Divisor of Two 
Polynomials 


It is possible that two polynomials P(x) of degree n and Q(x) of degree m with n 7 m have a common 

polynomial factor, which contains æ. The least common multiple of these factors is the greatest common 
divisor of the polynomials. 

E P(x) = (x — 1}(x — 2)(x — 4), Q(x) = (x — 1)(x — 2)(x — 3); the greates common devisor is 
(z — 1)(x — 2). 

If P(x) and Q(x) do not have any common polynomial factor, they are called relatively prime or coprime. 
In this case, their greatest common divisor is a constant. 


The greatest common divisor of two polynomials P(x) and Q(x) can be determined by the Euclidean 
algorithm without decomposing them into factors: 
1. Division of P(x) by Q(x) = Ro(x) results in the quotient T)(«) and the remainder Rj (x): 


P(x) 2 Q(x)Ti(x) + Ri (a) - (1.42a) 
2. Division of Q(r) by Ri (x) results in the quotient T;(v) and the remainder Fox): 
Q(x) = Ry(x)To(x) + Ro(2). (1.42b) 


3. Division of R(x) by Ro(r) results in T3(r) and Ra(r), etc. The greatest common divisor of the 
two polynomials is the last non-zero remainder R,(#). This method is known from the arithmetic of 
natural numbers (see 1.1.1.4, p. 3). 

The determination of the greatest common divisor can be used, e. g., when equations musz be solved 
to separate the roots with higher multiplicity or to apply the Sturm method (see 1.6.3.2, 2., p. 44). 


1.1.7 Rational Expressions 


1.1.7.1 Reducing to the Simplest Form 

Every rational expression can be written in the form of a quotient of two coprime polynomials. To do 
this, only elementary transformations are necessary such as addition, subtraction, multiplication and 
division of polynomials and fractions and simplification of fractions. 


2x +y 





3x + 
W Find the most simple form of Ἔα. 1 E y? + 


2 
ela +> 
( 2) 


THz. 


z S 
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(852 1-21 1-ψ)27 =y z +r +z οσα 1- 2023 }- ψαῦ + (2222 +2)(-y?z +242) 
(α3α2 + a)z z 0323 + xz 
3x2? + 282? + y2? — 23 y223 — yz + ote? +27 4 2323 + xz 
1323 + xz 





1.1.7.2 Determination of the Integral Rational Part 
A quotient of two polynomials with the same variable x is a proper fractionif the degree of the numerator 
is less than the degree of the denominator. In the opposite case, it is called an improper fraction. Every 
improper fraction can be decomposed into a sum of a proper fraction and a polynomial by dividing the 
numerator by the denominator, i.e., separating the integral rational part. 
3a* — 10aa? + 22a?a? — 24a3x + 10a4 . 

x? — 2ax + 3a? ` 








E Determine the integral rational part of R(x) 


ἃ --9 Δρ. 5 4 
(32*—10a2?--22a?z?—24a?x --10a*) : (a? — 2aa + 3a?) = 3x? — 4az + 5a? + — E : à 2 
3a4— 6aa34- 9a?3? v'— 402 — 30 
— 4aa?-4-13a?z?—24a?x 
— 4ax?-- 8a?3?—12a?a 
5a?1?—12a?x --10a* 
—2a?x — 5a 


T T4 24 
R(x) daa + 5a* 4 2 Jaa $ 3a? 


The integral rational part of a rational function R(x) is considered to be as an asymptotic approximation 
for R(x) because for large values of |x|, the value of the proper fraction part tends to zero, and R(x) 
behaves as its polynomial part. 


32? 





— 2a3x— 5a‘. 








1.1.7.3 Partial Fraction Decomposition 
Every proper rational fraction 

P(x) απ -- a, 127 1 4- -- -- a1z + ao 
Q(x) bx" b, 1x74... rd bo 
with coprime polynomials in the numerator and denominator can be uniquely decomposed into a sum of 


partial fractions. The coefficients ao, a1, . . ., bn are real or complex numbers. The partial 
fractions have the form 


A Dr+ E 


— 1.44 and — ——— —— 
(x — a)* ( a) an (a? + px + q)™ 





R(x) = (1.43) 





(n <m) 


<, än, bo, b1,- 


2 
where (5) -q«0. (1.44b) 
In the followings real coefficients are assumed in R(x) in (1.43). 

First the leading coefficient b,,, of the denominator (Q(x) is transformed into 1 by dividing the numerator 
and the denominator of (1.43) by the original value of bm. In the case of real coefficients the following 
three cases are to be distinguished. 

In the case of complex coefficients in R(x) only the first two cases can occur, since complex polynomials 
can be factorized into a product of first degree polynomials. Every proper rational fraction R(x) can 
be expanded into a sum of fractions of the form (1.44a), where A and α are complex numbers. 


1. Partial Fraction Decomposition, Case 1 





The denominator Q(x) has m different simple roots a1 ,... ,@m. Then the expansion has the form 
P(x Gaz" d: αἱ A A A 
(z) nt + 0 E 1 2 m (1.452) 
Q(z)  (x—oi(r—os»...(r—o&4) x—o T-a X — Om 


16 1. Arithmetics 





with coefficients 
P(o4) ο P(ag) ο P(Qm) 
Q(m)' "* Q(o)' ""' Om)’ 


where in the numerators of (1.45b) the values of the derivative 


Ay = Am 











(1.45b) 





dQ 
are taken for 7 = a1, z — a»,.... 
1 



































632—z41 A B C 
r-r x πι ανα 
P(0) P(1) P(-1) 
P(x) = 6x? — x +1, Q'(x) = 3x2? — 1, A = =-1,B= = 3 and C = = 4, 
“i τ TO a) QC) 
P(x) 1 3 4 
Ξ-Ξ1 + 
Q(x) α r—1l1 «41 
An other possibility to determine the coefficients A1, A5,... , Aj is the method of comparing coeffi- 
cients (see 4., p. 17). 
2. Partial Fraction Decomposition, Case 2 
The denominator Q(x) has | multiple real roots 04, 0,..., 04 with multiplicities ky, ko, ..., kj respec- 
tively. Then the decomposition has the form 
P(x) Ant” + Apa") + +++ + αρ EU i Λο Ak, 
Q(x) (r—oj)&(r—oa3)9...(r—oj)h z-a, (z-a) | i (a — ay) 
Bı DB» By, Ly, 
| vs T" . 1.46 
r-o, (r— a3)? (a — ay) T (£ = a)" ΠΡ 


21-51 Αι i Bı i Bə + Bs 
r(z—1) v r—1 (x—1)? (x—1) 


by the method of comparing coefficients. 





a The coefficients A1, B1, B5, B3 can be determined 


3. Partial Fraction Decomposition, Case 3 
If the denominator Q(z) has also complex roots, then its factorization is 


Q(x) = (# — ay)" (a — a) +++ (ax — a)" 


(a? + pra + qn) (a? }- 2ροα + qo)? +++ (x? + ppt + qr)” (1.47) 
according to (1.168), p. 44. Here 04,05, ..., 0; are the [| real roots of polynomial Q(x). Beside these 
roots Q(x) has r complex conjugate pairs of roots, which are the roots of the quadratic factors z? — 


p; 2 
piz + qi (i=1,2,...,r). The numbers p;,q are real, and (5) — qi < 0 holds. In this case the partial 


fraction decomposition has the form 









































P(x) u Gn” αμ 42377 4 --- E ax 4- ao 
Q(x)  (x—o)"(zr — a3)? --- (x? + pra + qd)" (a? + pow + qa)™ --- 
A A A B B. B 
m + (ap god m = Ἐπ ο το —€— ans i 
ασ Dı | Cox + Də Crt + Diy , 
‘Ptpeta ο τρια κα Fp | 
Eix + Fi Eox + Fo au But PF, o (1.48) 
S»ste (psy GEF par F p)" 
5x? — 4r + 16 A Cız + Dı Cox + Do 








= . The coefficients A, C3, D1, C2, D2 are 
(r—3)(z?2— z-1? «-3 2-241 (x? -2+1) κ ο ο αἱ 


to be determined by the method of comparing coefficients. 
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4. Method of Comparing Coefficients 
In order to determine the coefficients A1, A5, ..., E, F1... in (1.48) the expression (1.48) has to be mul- 
tiplied by Q(x), then the result Z (x) is compared with P(«) ), since Z(x) = P(a). After ordering Z(x) 
by the powers of x, one gets a system of equations by comparing the coefficients of the corresponding 
x—powers in Z(x) and P(x). This method is called the method of comparing coefficients or method of 
undetermined coefficients. 
a OU etl P B | C A(x? —1) + Bee +1) + Ce(s - 1) 

-r T T—1 rz4l x(x? — 1) 
Comparing the coefficients of the same powers of x, one gets the system of equations 6 = A+B+C, 
—1 = B — C , 1 = —A, and its solutions are A 2 —1, B 2 3, C — 4. 








1.1.7.4 Transformations of Proportions 
The equality 
a ο a b d c b d 


Pr (1.49a) yields ad=bce, ---, ---, --- (1.495) 
d c I 


and furthermore 


a+b- ctd atb ctd atc b+d a+b c+d 














= = ; = ] = s 1.49c 
b d ' a ο c d a—b  c—d (1.490) 
From the equalities of the proportions 
αι Q2 απ ; αι αρ." απ 0 
SS SSeS 1.50a s tha eS Se 
5L i: ( a) it follows that τπτ h (1.50b) 








1.1.8 Irrational Expressions 

Every irrational expression can be written in a simpler form by 1. simplifying the exponent, 2. taking 
out terms from the radical sign and 3. moving the irrationality into the numerator. 

1. Simplifying the Exponent The exponent can be simplified if the radicand can be factorized 
and the index of the radical and the exponents in the radicand have a common factor; the index of the 
radical and the exponents must be divided by their greatest common divisor. 

E πο, — 23H 4 710) = ye -x52(x — 1)? = ᾗψααδ(α -1). 

2. Moving the Irrationality There are different ways to move the irrationality into the numerator. 


ga: /@ = 2ry E ναι BB: à| =? 2xy?z S ὑλερρα. 
2y 4y? 2y 4yz? 


8y323 2yz 
— Vu 


1 
C: = =a À 
ey eye v) ra 
..Ἵἀἄ. ου 
(UV (s qu) (P yv VY) +y 
3. Simplest Forms of Powers and Radicals Also powers and radicals can be transformed into 
the simplest form. 


“ 8146 94? vr 3r/z(vV2-- St) ϑαγα 4- 32? 
να ui vx  y2—- Vx 2-2 B 2-2 ` 


(να Mai Yr + Wa") (να Vr 4 du Va) (a 1/24 42/34 43/44 a2 (g1/2 13. 


p 2r — ,,.. 5/6. gp — 18/12/12 | 3/4 f ,:11/12 (4, (45/6. 11/13. 13/12 



































18 1. Arithmetics 





7/6 gy = 78/4 78/2 411/12 4.,3/4 = Y5 — WZB_ Wel p 3 — H(I — g6 41/3 +712) — 
VT (1— YT — Yet V2). 

1.2 Finite Series 

1.2.1 Definition ofa Finite Series 


The sum 


n 
Sn = ao + a1 +a2 +-+ + an = Yai, (1.51) 
i=0 
is called a finite series. The summands a; (i = 0,1,2,...,n) are given by certain formulas, they are 


numbers, and they are the terms of the series. 


1.2.2 Arithmetic Series 
1. Arithmetic Series of First Order 


is a finite series where the terms form an arithmetic sequence, i.e., the difference of two terms standing 
after each other is a constant: 








Aa; aj,1— a; —d- const holds, so a; - ao + id. (1.52a) 
Thus holds: 

Sn = ao + (ao + d) + (ao + 2d) +--+ + (ag + nd) (1.52b) 

a 1 

pl 3 7^ (n 4 1) — 1 σου (1.52c) 
2. Arithmetic Series of k-th Order 
is a finite series, where the k-th differences A*a; of the sequence do, a1, a5, ...,a,, are constants. The 
differences of higher order are calculated by the formula 

A'ay— A lau, — AN lai (p2,8,..., k). (1.53a) 


It is convenient to calculate them from the difference schema (also difference table or triangle schema): 


ao 
Δαρ 
αι A?ag 
Δαι A?ag 
az Aa, s. AFag 
Aag A?a, 
ag Δίω "Aa 07 (1.53b) 
: A" ag 
Ata, a 
Ae an_3 a 
Ala, 5 
hai 
απ 


The following formulas hold for the terms and the sum: 


ai = ao + (;) 4% + B Aag + +++ + (Δία (6:12, m) (1.53c) 


n+1 n+1 n+1 n+l 
s-( 1 al 2 Jao + ( 3 ΠΡ (1.534) 
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1.2.3 Geometric Series 
The sum (1.51) is called a geometric series, if the terms form a geometric sequence, i.e., the ratio of two 
successive terms is a constant: 
αι > 
— 2 const holds, so a;- aod. (1.544) 
αι 


Thus holds: 





n+l 


-1 
Sn = ao + aoq + aog? + -- - + aog” = o for ql, (1.54b) 
g= 
Sn = (n+ 1)aọo for q=1. (1.549) 
For n — oo (see 7.2.1.1, 2., p. 459), there is an infinite geometric series, which has a limit if |g| « 1, 
and this limit is called sum s: 



























































s= τ (1.544) 
1.2.4 Special Finite Series 
1 
42434 (n-1) en c TD ) (1.55 
η 1 2 η 
οσο. (1.56 
143454+---+(2n—3)+(Q2n-1) =n’, (1.57 
24+4+4+6+4+---+(2n-—2)+2n =n(n+1), (1.58 
1 1)(2 1 
1? 4-22. 3? (n pages l Ber ) (1.59 
2(n + 1)? 
13423433 4---4+(n Ip tnd = tet ) ; (1.60 
4n? —1 
1432454 (in - ip - X 5 ) (1.61 
1? 33 4 5? 4 (2n — 1)? = n?(2n? — 1), (1.62 
1)(2n 4 1) (31? 4- 3n — 1 
(PU Pe pnt 2 ME Net Die a= 1) CEN Hed. (1.63 
1-16 Du qnt 
1420 +30? 4+---+ne™t = (n ü M s (ασ. (1.64 
1.2.5 Mean Values 
(See also 16.3.4.1, 1., p. 839 and 16.4, p. 848) 
1.2.5.1 Arithmetic Mean or Arithmetic Average 
The arithmetic mean of the n quantities a1, a2,..., dn is the expression 
atag+-:-+a, 12 
ra= — zia. (1.652) 
k-i 
For two values a and b holds: 
b 
= = (1.65b) 


The values a, x, and b form an arithmetic sequence. 
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1.2.5.2 Geometric Mean or Geometric Average 
The geometric mean of n positive quantities a1, 02, ..., a, is the expression 


1 
n n 
za = Yaz... än = (Ù «) . 
k=1 
For two positive values a and b holds 


Ta = Vab. 


(1.66a) 


(1.66b) 


The values a , xc; and b form a geometric sequence. 


Fig. 1.3b. 








tion (see 3.5.2.3, 3., p. 194). 


Figure 1.3 
1.2.5.3 Harmonic Mean 


The harmonic mean of n quantities a4, 02,..., a4 (a; Z 0;i — 1,2,...,n) is the expression 


m [IE Les ipei. 





nay ag απ n i 


For two values a and b holds 


2 =((2+5)] ο 9αῦ 
“ΗΓ Μα} i TH One 


1.2.5.4 Quadratic Mean 


The quadratic mean of n quantities a1, a2,..., dy is the expression 





1 1 
xg — 4 —(ai? o a3? a2) — -Y aż. 
n T y 


For two values a and b holds 


a? +b? 
rQ = zs 





The quadratic mean is important in the theory of observational error (see 16.4, p. 848). 


1.2.5.5 Relations Between the Means of Two Positive Values 


b 2ab la? +b? 
For z4 = C, ge = Vab, ty =, 29 = - > we have 
1. ifa < b, then 


a < Ty < Ttg < Ta < To <b, 
2. ifa = b, then 


Q=ta=%tg=ty=tqg=b. 








If a and b are given line segments, then a segment 


with length xa = vab can be given by the help of 
one of the constructions shown in Fig. 1.3a or in 


A special case of the geometric mean is given by di- 
viding a line segment according to the golden sec- 


(1.67a) 


(1.67b) 


(1.682) 


(1.68b) 


(1.692) 


(1.69b) 
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1.3 Business Mathematics 

Business calculations are based on the use of arithmetic and geometric series, on formulas (1.52a)— 
(1.52c) and (1.54a)-(1.54d). However these applications in banking are so varied and special that a 
special discipline has developed using specific terminology. So business arithmetic is not confined only 
to the calculation of the principal by compound interest or the calculation of annuities. It also includes 
the calculation of interest, repayments, amortization, calculation of instalment payments, annuities, 
depreciation, effective interest yield and the yield on investment. Basic concepts and formulas for cal- 
culations are discussed below. For studying financial mathematics in detail, you will have to consult 
the relevant literature on the subject (see [1.2], [1.8]). 


Insurance mathematics and risk theory use the methods of probability theory and mathematical statis- 
tics, and they represent a separate discipline, so they don't be discussed here (see [1.4], [1.5]). 


1.3.1 Calculation of Interest or Percentage 
1.3.1.1 Percentage or Interest 


The expression p percent of K means τον. where K denotes the principal in business mathematics. 
'The symbol for percent is 96, i.e., the following equalities hold: 


p% = ud or 1% = 0.01. (1.70) 


1.3.1.2 Increment 


If K is raised by p%, the increased value is 


ege ct 
Ř=K (1+ Πο) (1.71) 


Relating the increment Ke to the new value K , the proportion is Ks :K= p : 100, so K contains 
p-100 
100 + p 


percent of increment. 


W [fan article has a value of € 200 and a 15% extra charge is added, the final value is € 230. This price 
15-100 


115 
1.3.1.3 Discount or Reduction 
Reducing the value K by p% rebate yields the reduced value 





p= (1.72) 


contains p — — 13.04 percent increment for the user. 





rs p 
k-k(1- 2). 1. 
2 ( (ο) 5 


Comparing the reduction Ka to the new value K gives 


p- 100 
100 — p 
percent of rebate. 


W if an article has a value € 300, and they give a 10% discount, it will be sold for € 270. This price 
10- 100 


90 





p= (1.74) 


contains p — = 11.11 percent rebate for the buyer. 








22 1. Arithmetics 





1.3.2 Calculation of Compound Interest 
1.3.2.1 Interest 


Interestis either payment for the use of a loan or it is a revenue realized from a receivable. For a principal 
K, placed for a whole period of interest (usually one year), 
, P 
Kc 1.75 
100 (1.75) 
interest is paid at the end of the period of interest. Here p is the rate of interest for the period of interest, 
and one says that p% interest is paid for the principal K. 


1.3.2.2 Compound Interest 


Compound interestis computed on the principal and on any interest earned that has not been paid or 
withdrawn. It is the return on the principal for two or more time periods. The interest of the principal 
increased by interest is called compound interest. 

In the following different cases are discussed which depend on how the principal is changing. 

1. Single Deposit 

Compounded annually the principal K increases after n years up to the final value K,. At the end of 
the n-th year this value is: 


e p 
K,=K(1+—). 1.76 
g ( 7 τ) (1:76) 
For a briefer notation the substitution 1 + ΙΙ — q is used and q is called the accumulation factor or 


growth factor. 
Interest may be compounded for any period of time: annually, half-annually, monthly, daily, and so on. 
Dividing the year into m equal interest periods the interest will be added to the principal K at the end 


of every period. Then the interest is Kv for one interest period, and the principal increases after 


PX 
100m 
n years with m interest periods up to the value 





; p wy 
Kana KI . 1. 
᾽ ( * το) (1-77) 


The quantity (1 + (τ) is known as the nominal rate, and (1 + 100 i) as the effective rate. 
W A principal of € 5000, with a nominal interest 7.2% annually, increases within 6 years a) com- 
pounded annually to Ke — 5000(14-0.072)6 — € 7588.20, b) compounded monthly to K7_ = 5000(1+ 
0.072/12)" = € 7691.74. 


2. Regular Deposits 


Suppose depositing the same amount E in equal intervals. Such an interval must be equal to an interest 
period. The depositions can be made at the beginning of the interval, or at the end of the interval. At 
the end of the n-th interest period the balance Kpn is 








a) Depositing at the Beginning: b) Depositing at the End: 
-1 q*—1 
Kn = Eq 1.78a Kn =E . 1.78Ł 
e Τ᾽ (1.784) qa (1.78b) 


3. Depositing in the Course of the Year 

A year or an interest period is divided into m equal parts. At the beginning or at the end of each of 
these time periods the same amount £ is deposited and bears interest until the end of the year. In this 
way, after one year the balance Ky is 
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a) Depositing at the Beginning: b) Depositing at the End: 
(m Up (m 1)p 
Ky=E hi — 1.792 Kı= E — ———|. (79b 
Kı In + 200 (1.792) 1 m+ 200 ( ») 


In the second year the total Ay bears interest, and further deposits and interests are added like in the 
first year, so after n years the balance Kpn for midterm deposits and yearly interest payment is: 








a) Depositing at the Beginning: b) Depositing at the End: 
: (m+1)p| 7—1 7 (m — Dp| q^ —1 
K, = E |i ------ . (1.80a K,=E ------ : 1.801 
KÇ [r+ 200 πι (1.80a) K m+ 200 = (1.80b) 


E ^t a yearly rate of interest p = 5.2% a depositor deposits € 1000 at the end of every month. After 
how many years will it reach the balance € 500 000? 


From (1.80b), for instance, from 500000 = 1000 [12 + 
22.42 years. 





11: 2) 1.052" — 1 
200 0.052 


, follows the answer, n = 


1.3.3 Amortization Calculus 


1.3.3.1 Amortization 


Amortization is the repayment of credits. The assumptions: 

1. For a debt S the debtor is charged at p% interest at the end of an interest period. 

2. After N interest period the debt is completely repaid. 

The charge of the debtor consists of interest and principal repayment for every interest period. If the 
interest period is one year, the amount to be paid during the whole year is called an annuity. 

There are different possibilities for a debtor. For instance, the repayments can be made at the interest 
date, or meanwhile; the amount of repayment can be different time by time, or it can be constant during 
the whole term. 


1.3.3.2 Equal Principal Repayments 


The amortization instalments are paid during the year, but no midterm compound interest is calcu- 
lated. The following notation should be used: 
e S debt (interest payment at the end of a period with p%), 


5 ΠΕ 
e T — —— principal repayment (T' — const), 
mN 


e m number of repayments during one interest period, 
e N number of interest periods until the debt is fully repaid. 
Besides the principal repayments the debtor also has to pay the interest charges: 


a) Interest Z,, for the n-th Interest Period: b) Total Interest Z to be Paid for a Debt 
S, mN Times, During N Interest Periods 
with an Interest Rate p% : 








ps 1 m+1 
a = T00 | Ν (r 2m } FERE) z=- x go Be [— -1 κ (181b) 
ai 100 2 2m 
Wl A debt of € 60000 has a yearly interest rate of 896. The prin- l. year: Zı = «4900 
cipal repayment of € 1000 for 60 months should be paid at the 2. year: Zə= € 3400 
end of the months. How much is the actual interest at the end of 3. year: Z3 = € 2440 
each year? The interest for every year is calculated by (1.81a) with 4. year: Zy= € 1480 
S = 60000, p = 8, N = 5 and m = 12. They are enumerated in 5.year: Z5 — € 520 


the annexed table. Z= €12200 
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8.60000[5—1 . 13 
M [za = € 12200. 


The total interest can be calculated also by (1.81b) as Z = a 


1.3.3.3 Equal Annuities 


a S ; r 

For equal principal repayments T = —— the interest payable decreases over the course of time (see 
m. 

the previous example). In contrast to this, in the case of equal annuities the same amount is repaid 

for every interest period. A constant annuity A containing the principal repayment and the interest is 

repaid, i.e., the charge of the debtor is constant during the whole period of repayment. 

With the notation 

e S debt (interest payment of p% at the end of a period), 

ο A annuity for every interest period (A const), 

e a one instalment paid m times per interest period (a const), 

ος - 1 -- in the accumulation factor, 

after n interest periods the remaining outstanding debt S, is: 





(πετ Dp| e? —1 
=S- Am p 1.82 
S, — Sq ο 200 m (1.82) 


Here the term Sq" denotes the value of the debt S after n interest periods with compound interest 


(see (1.76)). The second term in (1.82) gives the value of the midterm repayments a with compound 
interest (see (1.80b) with E — a). For the annuity holds 


B (m — 1)p 
d ; (1.83) 


Here paying A once means the same as paying a m times. From (1.83) it follows that A > ma. Because 
after N interest periods the debt must be completely repaid, from (1.82) for Sy = 0 considering (1.83) 
for the annuity holds: 


-1 q—1 
Α- αφ τς ασ 7. 1.84 
PIE pou (1.84) 





To solve a problem of business mathematics, from (1.84), any of the quantities A, $, q or N can be 
expressed, if the others are known. 

W A: Aloanof € 60000 bears 896 interest per year, and is to be repaid over 5 years in equal instalments. 
How much is the yearly annuity A and the monthly instalment a? From (1.84) and (1.83) we get: 


; 15027. 
A — 60 000—547. = € 15027.39, a = — = € 1207.99. 
1-iog 12 + -300 


HB: A loan of S = € 100 000 is to be repaid during N = 8 years in equal annuities with an interest rate 
of 7.5%. At the end of every year € 5000 extra repayment must be made. How much will the monthly 


instalment be? For the annuity A per year according to (1.84) follows A = 100 soo a . 


- L.0758 
€ 17072.70. Because A consists of 12 monthly instalments a, and because of the € 5000 extra payment 


| + 5000 = 17072.70 holds, so the monthly 





at the end of the year, from (1.83) A = a [a + 
charge is a = € 972.62. 
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1.3.4 Annuity Calculations 
1.3.4.1 Annuities 


If a series of payments is made regularly at the same time intervals, in equal or varying amounts, at the 
beginning or at the end of the interval, it is called annuity payments. To distinguish are: 

a) Payments on an Account The periodic payments, called rents, are paid on an account and bear 
compound interest. Therefore the formulas of 1.3.2 are to be used. 

b) Receipt of Payments The payments of rent are made from capital bearing compound interest. 
Here the formulas of the annuity calculations in 1.3.3 are to be used, where the annuities are called 
rents. If no more than the actual interest is paid as a rent, it is called a perpetual annuity. 

Rent payments (deposits and payoffs) can be made at the interest terms, or at shorter intervals during 
the period of interest, i.e. in the course of the year. 


1.3.4.2 Future Amount of an Ordinary Annuity 
The date of the interest calculations and the payments should coincide. The interest is calculated at 
p% compound interest, and the payments (rents) on the account are always the same, R. The future 
value of the ordinary annuity Rn, i.e., the amount to which the regular deposits increase after n periods 
amounts to: 

R =R! with 142 (1.85) 

= wit. = ----. $ 
πι í 100 

The present value of an ordinary annuity Ro is the amount which should be paid at the beginning of the 
first interest period (one time) to reach the final value R, with compound interest during n periods: 





: D 
Ro = — with gq=1+—. 1.86 
UR 1 100 41-86) 

W A man claims € 5000 at the end of every year for 10 years from a firm. Before the first payment 
the firm declares bankruptcy. Only the present value of the ordinary annuity Rp can be asked from the 
administration of the bankrupt’s estate. With an interest of 4% per year the man gets: 

1-1 1-47" 1 — 1.047" 
Ro = —RL— = rn—* = 5000 = € 40 554.48. 

q" q—1 q-1 0.04 











1.3.4.3 Balance after n Annuity Payments 
For ordinary annuity payments capital K is at our disposal bearing p% interest. After every interest 
period an amount r is paid. The balance K, after n interest periods, i.e., after n rent payments, is: 








nm] j 
Kn = Kq" - Ra = Kq" -ri with πι (1.872) 
Conclusions from (1.872): 
p= K (1.87b) Consequently K;, = K holds, so the capital does not change. This 
is the case of perpetual annuity. 
r> Ka (1.87c) The capital will be completely used up after N rent payments. 
From (1.872) it follows for Kx = 0: 
N 
: rq'—1 
K--— f 1.87d 
< dc (1.87d) 


If midterm interest is calculated and midterm rents are paid, and the original interest period is divided 
into m equal intervals, then in the formulas (1.85)-(1.87a) n is replaced by mn and accordingly q = 


p p 
1+ — byq=1+——. 
t 100 πε * 100m 
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W What amount must be deposited monthly at the end of the month for 20 years, from which a rent of 
€ 2000 should be paid monthly for 20 years, and the interest period is one month with an interest rate 
of 0.596. 

From (1.87d) follows for n — 20- 12 — 240 the sum which is necessary for the required payments: 


2000 1.00574 — 1 


K= Io 0005 . a 279161.54. The necessary monthly deposits R are given by (1.85): 
1.005" — 1 
Rəs = 219161.54 — R— — —— — , ie. R — € 604.19. 
0.005 


1.3.5 Depreciation 
1.3.5.1 Methods of Depreciation 


Depreciation is the term most often used to indicate that assets have declined in service potential in 
a given year either due to obsolescence or physical factors. Depreciation is a method whereby the 
original (cost) value at the beginning of the reporting year is reduced to the residual value at year-end. 
The following concepts are used: 

e A depreciation base, 

e N useful life (given in years), 

e R, residual value after n years (n < N), 

ea, (n — 1,2,..., N) depreciation rate in the n-th year. 

The methods of depreciation differ from each other depending on the amortization rate: 

e straight-line method, i.e., equal yearly rates, 

e decreasing-charge method, i.e., decreasing yearly rates. 


1.3.5.2 Straight-Line Method 


The yearly depreciations are constant, i.e., for amortization rates a, and the remaining value R,, after 
n years follows: 
A — Ry A — Ry 
= —— =a, 1.88 R, = A— n——— (n=1,2,...,N). (1.89 
y (1.88) j md ). (1.89) 
Substituting Ry = 0, then the value of the given thing is reduced to zero after N years, i.e., it is totally 
depreciated. 


απ 


W The purchase price of a machine is A — € 50000. In 5 years it should be depreciated to a value Rs = 
€ 10000. 








Year] Depreciation] Depreciation]Residual|Cumulated depr. in % Linear depreciation according to 
base expense value of the depr. base (Là 8) on ( 1.89) vields yis 
F . 8 : yields the an- 

2 m Pam a am ion nexed amortization schedule: 
3 34 000 8000 26 000 23.5 It shows that the percentage of ac- 
4 26 000 8000 18 000 30.8 cumulated depreciation with re- 
5 18 000 8000 10 000 ALA spect to the actual initial value is 








increasing. 


1.3.5.3 Arithmetically Declining Balance Depreciation 


In this case the depreciation is not constant. It is decreasing yearly by the same amount d, by the 
so-called multiple. For depreciation in the n-th year follows: 


à, —aj—(n—1)d (n—2,3,..., N - 1; a, and d are given). (1.90) 








N 
Considering the equality A — Ry — »7 a, from the previous equation it follows that: 


2[Na, m= (A = ΕΝ) g 


pe N(N - 1) 


(1.91) 
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For d = 0 follows the special case of straight-line depreciation. If d > 0, it follows from (1.91) that 


A — Ry 
a, > =a, 1.92 
---ῬἘ (192) 

where a is the depreciation rate for straight-line depreciation. The first depreciation rate a4 of the 
arithmetically-declining balance depreciation must satisfy the following inequality: 

A — Ry A — Ry 
<a, <2 L 1.93 
N 1 N (1.93) 
W A machine of € 50000 purchase price is to be depreciated to the value of € 10 000 within 5 years by 


arithmetically declining deprecia- 





























Year |Depretiation|Depreciation|Residual|Depreciation in % tion. In the first year € 15000 
base expense value of depr. base should be depreciated. 
1 50 000 15 000 35 000 30.0 The annexed depreciation sched- 
2 35 000 11500 23 500 32.9 ule is calculated by the given for- 
3 23 500 8 000 15 500 34.0 mulas, and it shows that with the 
4 15 500 4 500 11 000 29.0 exception of the last rate the per- 
5 11 000 1000 10 000 9.1 centage of depreciation is fairly 
equal. 


1.3.5.4 Digital Declining Balance Depreciation 
Digital depreciation is a special case of arithmetically declining depreciation. Here it is required that 
the last depreciation rate ay should be equal to the multiple d. From ay — d it follows that 








d= ΠΠ (1.944) ài — Nd, a3 — (N — 1)d, ...,ay —d. (1.94b) 
W The purchase price of a machine is € A = 50000. This machine is to be depreciated in 5 years to 
Year | Depreciation | Depreciation Residual] Depreciation in % the value R5 — € 10000 by 
base expense value | ofthe depr. base digital depreciation. 

1 50 000 a, = 5d = 13335] 36665 26.7 The annexed depreciation 

2 36 665 dg = 4d = 10668] 25997 29.1 schedule, calculated by the 

3 25 997 az = 3d = 8001| 17996 30.8 given formulas, shows that 

4 17 996 a4 = 2d = 5334| 12662 29.6 the percentage of the de- 

5 12662 à5— d- 2667, 9995 21.1 preciation is fairly equal. 














1.3.5.5 Geometrically Declining Balance Depreciation 
Consider geometrically declining depreciation where p% of the actual value is depreciated every year. 
For the residual value R,, after n years holds: 
p n 

a= A(1- 2) (ne 1,9... 5. (1.95) 
Usually A (the acquisition cost) is given. The useful life of the asset is N years long. If from the quan- 
tities Ry,p and N, two is given, the third one can be calculated by the formula (1.95). 
W A: A machine with a purchase value € 50000 is to be geometrically depreciated yearly by 10%. 
After how many years will its value drop below € 10000 for the first time? Based on (1.95), yields 
In(10 000/50 000) 

In(1 — 0.1) 

E B: Fora purchase price of A = € 1000 the residual value R,, should be represented for n = 1,2,..., 10 
years by a) straight-line, b) arithmetically declining, c) geometrically declining depreciation. The re- 
sults are shown in Fig. 1.4. 


N= = 15.27 years. 
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1.3.5.6 Depreciation with Different 
Types of Depreciation Account 
Since in the case of geometrically declining deprecia- 
tion the residual value cannot become equal to zero 
for a finite n, it is reasonable after a certain time, e.g., 
after m years, to switch over to straight-line depreci- 
ation. m is to be determined to an amount that from 
this time on the geometrically declining depreciation 
rate is smaller than the straight-line depreciation rate. 
From this requirement it follows that: 

m»N- ns (1.96) 

p 

Here m is the last year of geometrically declining de- 
preciation and N is the last year of linear depreciation 
Figure 1.4 when the residual value becomes zero. 





W A machine with a purchase value of € 50000 is to be depreciated to zero within 15 years, for m 
years by geometrically declining depreciation with 14% of the residual value, then with the straight- 


100 
line method. From (1.96) follows m > 15— d4 = 7.76, i.e., after m = 8 years it is reasonable to switch 


over to straight-line depreciation. 


1.4 Inequalities 


1.4.1 Pure Inequalities 
1.4.1.1 Definitions 


1. Inequalities 
Inequalities are comparisons of two real algebraic expressions represented by one of the following signs: 


TypeI > (“greater”) Type II < — ("smaller") 

Type HI 4 (“not equal”) Type Hla <> (“greater or smaller") 
TypeIV > (“greater or equal”) TypeIVa # (“not smaller”) 

Type V < (“smaller or equal” ) TypeVa f£ (“not greater”) 


The notation II and Ia, IV and IVa, and V and Va have the same meaning, so they can be replaced 

by each other. The notation III can also be used for those types of quantities for which the notions of 

"greater" or "smaller" cannot be defined, for instance for complex numbers or vectors, but in this case 

it cannot be replaced by IIIa. 

2. Identical Inequalities, Inequalities of the Same and of the Opposite Sense, 
Equivalent Inequalities 

1. Identical Inequalities are valid for arbitrary values of the letters contained in them. 

2. Inequalities of the Same Sense belong to the same type from the first two, i.e., both belong to 

type I or both belong to type II. 

3. Inequalities of the Opposite Sense belong to different types of the first two, i.e., one to type 

I, the other to type II. 

4. Equivalent Inequalities are inequalities if they are valid exactly for the same values of the un- 

knowns contained in them. 

3. Solution of Inequalities 

Similarly to equalities, inequalities can contain unknown quantities which are usually denoted by the 

last letters of the alphabet. The solution of an inequality or a system of inequalities means the determi- 

nation of the limits for the unknowns between which they can change, keeping the inequality or system 
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of inequalities true. 
Solutions can be looked for any kind of inequality; mostly pure inequalities of type I and II are to be 
solved. 


1.4.1.2 Properties of Inequalities of Type I and II 
1. Change the Sense of the Inequality 











If a>b holds, then b<a_ is valid, (1.97a) 

if a<bholds, then b>a _ is valid. (1.97b) 
2. Transitivity 

If a>b and b>chold, then a>c _ is valid; (1.98a) 

if a<b and b<chold, then a<c _ is valid. (1.98b) 
3. Addition and Subtraction of a Quantity 

If a>bholds, then atc>b+c _ is valid; (1.99a) 

if a<bholds, then atc<b+c _ is valid. (1.99b) 











By adding or subtracting the same amount to the both sides of inequality, the sense of the inequality 
does not change. 


4. Addition of Inequalities 
If a>b and c>dhold, then at+ce>b+d _ is valid; (1.100a) 





if a<b and c<dhold, then a+ce<b+d  isvalid. (1.100b) 
Two inequalities of the same sense can be added. 


5. Subtraction of Inequalities 
If a>b and c<dhold, then a—c>b-—d is valid; (1.101a) 


if a<b and c>dhold, then a—c<b—d_ is valid. (1.101b) 


Inequalities of the opposite sense can be subtracted; the result keeps the sense of the first inequality. 
Subtracting inequalities of the same sense is not allowed. 


6. Multiplication and Division of an Inequality by a Quantity 


b 
If a>b and c>Ohold, then ac>be and “>~ are valid, (1.102a) 
ü g 
: a b ' 
if a<b and c>0O hold, then ac<be and A < - are valid, (1.102b) 
ἘΣ 
᾿ a b ] 
if a>b and c<0O hold, then ac<be and z< a valid, (1.102c) 
E 
- a b 3 
if a«b and c«0 hold, then ac>be and : > -are valid. (1.102d) 
ο 


Multiplication or division of both sides of an inequality by a positive value does not change the sense 
of the inequality. Multiplication or division by a negative value changes the sense of the inequality. 
7. Inequalities and Reciprocal Values 


If 0<a<b or a<b<0O hold, then τ 55 : is valid. (1.103) 
a 
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1.4.2 Special Inequalities 
1.4.2.1 Triangle Inequality for Real Numbers 


For arbitrary real numbers a, 5, a4, a3, . .. , an, there are the inequalities 
[a 4- b| € |a| + b] ; Jay + ag +--+ + as| < |a| + la| +--+- + lanl- (1.104) 


The absolute value of the sum of two or more real numbers is less than or equal to the sum of their 
absolute values. The equality holds only if the summands have the same sign. 


1.4.2.2 Triangle Inequality for Complex Numbers 


For n complex numbers z1,25,...,2, € € 


n 
3n 
k=1 








= |a +22 ++ Zal < lal + lza] +--+ lal = X |3ε|- (1.105) 
k=1 


1.4.2.3 Inequalities for Absolute Values of Differences of Real and Complex 
Numbers 


For arbitrary real numbers a,b € R, there are the inequalities 
Ila] — |b|] < |a — | € la] -- [b]. (1.106) 
The absolute value of the difference of two real numbers is less than or equal to the sum of their absolute 


values, but greater than or equal to the absolute value of the difference of their absolute values. For 
two arbitrary complex numbers 21,29 € € 


l| — [zl] < [z1 — 22] < [al + lz2l. (1.107) 
1.4.2.4 Inequality for Arithmetic and Geometric Means 


utat tan 2 Vajd9:::a4, for a; 90. (1.108) 


TL 


The arithmetic mean of n positive numbers is greater than or equal to their geometric mean. Equality 
holds only if all the n numbers are equal. 


1.4.2.5 Inequality for Arithmetic and Quadratic Means 


2 Δ ας... 2 
Qaa bo ast (1.109) 
TL 


'The absolute value of the arithmetic mean of numbers is less than or equal to their quadratic mean. 


αι a2: dn 
n 








1.4.2.6 Inequalities for Different Means of Real Numbers 


For the harmonic, geometric, arithmetic, and quadratic means of two positive real numbers a and b 
with a « b the following inequalities hold (see also 1.2.5.5, p. 20): 








a «tg « 1g « rA-«mrg «b. (1.1102) 
Here 
a+b 2ab a? +0? 
Cay zg = Vab, tH = Pea tQ= yo (1.110b) 


1.4.2.7 Bernoulli's Inequality 
For every real number a > —1 and integer n > 1 holds 
(+a) >1+na. (1.111) 
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The equality holds only for n — 1, or a — 0. 
1.4.2.8 Binomial Inequality 
For arbitrary real numbers a,b € R holds 


jab] < Ha uh). (1.112) 


1.4.2.9 Cauchy-Schwarz Inequality 


1. Cauchy-Schwarz Inequality for Real Numbers 
The Cauchy-Schwarz inequality holds for arbitrary real numbers a;, bj € R : 





|aibı + a2b2 + -- - + anbn| < γαι» Ha +-+ an? ybi? ba oo he? (1.113a) 





or 














(ayb + agby + +++ + Gnbp)? < (ay? + ay? + +++ + an?) (b1? + by? + +++ + bn”). (1.113b) 
For two finite sequences of n real numbers, the sum of the pairwise products is less than or equal to 
the product of the square roots of the sums of the squares of these numbers. Equality holds only if 
a, : by = dg: bg = +++ = Gyn: bn. 
If n — 3 and (a1,a2, a3) and {b1, b2, b3} are considered as vectors in a Cartesian coordinate system, 
then the Cauchy-Schwarz inequality means that the absolute value of the scalar product of two vectors 
is less than or equal to the product of absolute values of these vectors. If n > 3, then this statement 
can be extended for vectors in n-dimensional Euclidean space. 


2. Cauchy-Schwarz Inequality for Complex Numbers 

Considering that for complex numbers |z|? — z*z (z* is the complex conjugate of z), the inequality 
(1.113b) is valid also for arbitrary complex numbers z;, w; € C: 

(z1wi + zowe + +++ + ZnWn)*(21wi + 22W2 +--+ + ZrWn) 

< (ayaa + 29% 22 + +++ + 2n* Zn) (Wi Wy + We* We + +++ + Wn*Wn). 

3. Cauchy-Schwarz Inequality for Convergent Infinite Series and Integrals 

An analogous statement to (1.113b) is the Cauchy-Schwarz inequality for convergent infinite series and 
for certain integrals: 


(= vty) = p më) (5 i) (1114) 


[ro eas] s (fires) (fiers). (1.115) 


1.4.2.10 Chebyshev Inequality 








If ay, 2, ... , Q5, 01,02, . .. , b, are real positive numbers, then the following inequalities hold: 
(Ot (tth) < aıbı + az2b2 +- +- + anbn (1.116a) 
n n n 
for ay<ag<...<a@, and bi < bo <...< by, 
or ay >a)>...>a, and b >bg>...> bn, 
and 
(= Faze ἝἜ %) (^ ο... ”) > aby + agby + +++ + Anbn (1.116b) 
n n n 


for ay<ag<...<a, and bh >bg >... > dy. 
For two finite sequences with n positive numbers, the product of the arithmetic means of these sequences 
is less than or equal to the aritmetic mean of the pairwise products if both sequences are increasing or 
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both are decreasing; but the inequality is valid in the opposite sense if one of the sequences is increasing 
and the other one is decreasing. 


1.4.2.11 Generalized Chebyshev Inequality 














If a1, 02, ..., Q4, 01,02, . .. , b, are real positive numbers, then the following inequalities hold: 
2 +a + H anë yo +b + +++ + dn* Z qu + (aghe) + +- + + (anbn)* (1.1172) 
TL TL TL 
for ay < az <... <an and bı < b<... < bn 
or αι αρ... σαν and bı > b>... > bn 
and 
(eee g ο a= (anbn)* (1.117b) 
TL TL TL 
for ay <a <... <an and bı > b>... > bn. 
1.4.2.12 Hölder Inequality 
1. Hölder Inequality for Series 
1 1 
If p and q are two real numbers such that — + — = 1 is fulfilled, and if x1, £2, . . . , &n and y1, Y2,- - - , Yn 
P q 


are arbitrary 2n complex numbers, then the following inequality holds: 


1 
q 


itm 
μι (1.1180) 
k=1 k=1 k=1 


This inequality is also valid for countable infinite pairs of numbers: 


Y |xkyk| < » l| i p P f 2 (1.118b) 
k=l 


k=1 k=1 

where from the convergence of the series on the right-hand side the convergence of the left-hand side 
follows. 

2. H6lder Inequality for Integrals 


If f(a) and g(x) are two measurable functions on the measure space (X, A, 14) (see 12.9.2, p. 695), then 
the following inequality holds: 


[todus [ror | ᾽ n | a (1.180) 


1.4.2.13 Minkowski Inequality 


1. Minkowski Inequality for Series 
If p 7 1 holds, and (x, £zf* and (yx 4 with xj, yy € C are two sequences of numbers, then holds: 


1 1 


7 < » l| 


p p 


1 
x [zy -- | * lx P I (1.1192) 
k=1 k=1 
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2. Minkowski Inequality for Integrals 
If f(a) and g(x) are two measurable functions on the measure space (X, A, ji) (see 12.9.2, p. 695), then 


holds: 
μπω κω» z [iron E owe (1.1195) 


1.4.3 Solution of Linear and Quadratic Inequalities 
1.4.3.1 General Remarks 


During the solution of an inequality it is transformed into equivalent inequalities step by step. Similarly 
to the solution of an equation the same expression can be added to both sides; formally, it may seem 
that a summand is brought from one side to the other, changing its sign. Furthermore one can multiply 
or divide both sides of an inequality by a non-zero expression, where the inequality keeps its sense if 
this expression has a positive value, and changes its sense if this expression has a negative value. An 
inequality of first degree can always be transformed into the form 


ax > b. (1.120) 


The simplest form of an inequality of second degree is 





>m (1.121a) or g? <m (1.121b) 
and in the general case it has the form 


aai? brc c0 (1.122a) or aa?- bxc «0. (1.122b) 
1.4.3.2 Linear Inequalities 
The linear inequality of first degree (1.120) has the solution 


b b 
z>- fora>0 (1.123a) and zc « — for a « 0. (1.123b) 
a a 


2 
E 5r+3<8r+1, 5zr—8r<1-3, —-3r7«-2, 2» 3 
1.4.3.3 Quadratic Inequalities 
Inequalities of second degree in the form 
>m (1.124a) and αὖ «πὶ (1.124b 


have solutions 


a) a?» m: Form 70 thesolutionis z » Vm and x<—Vm (|v|» vm), (1.125a 
for m < 0 the inequality obviously holds for any 2. (1.125b 
b) z?« m: Form 0 thesolutionis — /m « x « vm (|z| < vm), (1.126a 
for m <0 there is no solution. (1.126b 


1.4.3.4 General Case for Inequalities of Second Degree 


ax? +br +ce>0 (1.127a) or aa?- bxc «Q0. (1.127b 


First dividing the inequality by a. If a < 0 then the sense of the inequality changes, but in any case i 
will have the form 


x? +pr+q<0 (1.127c) or a? +pa+q>0. (1.127d 
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By completing the square it follows that 


(s + y < (5) -q (11270) i (s $ E > iy -q (1.1278) 


ΜΗ 
Denoting x 4- : by z and (5) — q by m, the inequalities 


2 <m (1.128a) or z»m (1.128b) 


can be obtained. Solving these inequalities yields the values for x. 


2 
W A: —22? - 14x — 20 7 0, z? — 7x -- 10 « 0, («-i) " d dele) 


2 4) 
3, 7, 3,7 
ΠΡΙ 
The solution is 2 < x < 5. 


E B: x? +6xr+15>0, (x +3)? >—6. The inequality holds identically. 


2 
E C: —2r? + 14x — 20 < 0, (s- i) Bes α-φ»} and z-1«-5. 


The solution intervals are x > 5 and x < 2. 
1.5 Complex Numbers 


1.5.1 Imaginary and Complex Numbers 
1.5.1.1 Imaginary Unit 


The imaginary unit is denoted by i, which represents a number different from any real number, and 
whose square is equal to —1. In electronics, instead of i the letter j is usually used to avoid accidently 
confusing it with the intensity of current, also denoted by i. The introduction of the imaginary unit leads 
to the generalization of the notion of numbers to the complex numbers, which play a very important role 
in algebra and analysis. The complex numbers have several interpretations in geometry and physics. 
1.5.1.2 Complex Numbers 
The algebraic form of a complex number is 

z—a-ib. (1.1292) 
When a and b take all possible real values, then one gets all possible complex numbers z. The number 
a is the real part, the number b is the imaginary part of the number z: 

a = Re(z), b=Im(z). (1.129b) 
For b — 0 it is z — a, so the real numbers form a subset of the complex numbers. For a = 0 it is z = ib, 
which is a ^pure imaginary number". 
The total set of complex numbers is denoted by C. 


Remark: Functions w = f(z) with complex variable z — x + iy will be discussed in function theory 


(see 14.1, p. 731 ff). 


1.5.2 Geometric Representation 
1.5.2.1 Vector Representation 


Similarly to the representation of the real numbers on the numerical axis, the complex numbers can be 
represented as points in the so-called Gaussian number plane: A number z = a + ib is represented by 
the point whose abscissa is a and ordinate is b (Fig. 1.5). The real numbers are on the axis of abscissae 
which is also called the real axis, the pure imaginary numbers are on the axis of ordinates which is 
also called the imaginary axis. On this plane every point is given uniquely by its position vector or 
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radius vector (see 3.5.1.1, 6., p. 181), so every complex number corresponds to a vector which starts at 
the origin and is directed to the point defined by the complex number. So, complex numbers can be 
represented as points or as vectors (Fig. 1.6). 


1.5.2.2 Equality of Complex Numbers 


Two complex numbers are equal by definition if their real parts and imaginary parts are equal to each 
other. From a geometric viewpoint, two complex numbers are equal if the position vectors correspond- 
ing to them are equal. In the opposite case the complex numbers are not equal. The notions “greater” 
and “smaller” are meaningless for complex numbers. 





Im(z) 
Z τε οιδώμως az 
0 Re(z) Re(z) 
Figure 1.5 Figure 1.6 Figure 1.7 


1.5.2.3 Trigonometric Form of Complex Numbers 
The form 

z—a-ib (1304) 
is called the algebraic form of the complex number. Using polar coordinates yields the trigonometric 
form of the complex numbers (Fig. 1.7): 

z = p(cos y +i sin y). (1.130b) 
The length of the position vector of a point p = |z| is called the absolute value or the magnitude of the 


complex number, the angle ọ, given in radian measure, is called the argument of the complex number 
and is denoted by arg z: 














p=\z|, p =argz =w+2kr with 0< p<œ, =r <w < +r, k=0,+1,2,.... (1.130c) 
One calls the principal value of the argument of the complex mumber. 
The relations between p, p and a, b for a point are the same as between the Cartesian and polar 
coordinates of a point (see 3.5.2.2, p. 192): 


a = pcos f, (1.131a) b = psin p, (1.131b) | p— va? 4-2, (1.131c) 
b 
arctan — for a > 0, 
a 
arccos Í for b > 0, p > 0, +5 fora =0, b> 0, 
= π 
P = 4 — arecos ^ forb « 0, p » 0, Cot = for a — 0, b « 0, 


2 


p 
undefined for p = 0 m +n fora «0, b 7 0, 
a 


(1.131d) b 
arctan- -mr fora « 0, b « 0. 
a 


(1.131e) 
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The complex number z = 0 has absolute value equal to zero; its argument arg 0 is undefined. 


1.5.2.4 Exponential Form of a Complex Number 
The representation 

z= pe? (1.132a) 
is called the exponential form of the complex number, where p is the magnitude and q is the argument. 
The Euler relation is the formula 

d? = coso +i sing. (1.132b) 
W Representation of a complex number in three forms: 
a) z=1+i¥V3 (algebraic form), b) z = 2 (cos ᾽ +i sin 5) (trigonometric form), 
c) z — 2é!$. (exponential form), considering the principal value of it. 
Without restriction to the principal value holds the representation 


d) z = 1+i v3 = 2exp h (5 + 2επ)] — 2 |cos (5 + 2k) +isin ( + 3] (k — 0, 








m 














1.5.2.5 Conjugate Complex Numbers 
Two complex numbers z and z* are called conjugate complex numbers if their real parts are equal and 
their imaginary parts differ only in sign: 

Re(z*) = Re(z), Im(z*) = —Im(z). (1.133a) 
The geometric interpretation of points corresponding to the conjugate complex numbers are points 
symmetric with respect to the real axis. Conjugate complex numbers have the same absolute value, 
their arguments differ only in sign: 


z —a-ib- p(cosq i sing) — pe?, (1.133b) 
2* =a — ib = p(cosp —i sing) = pe”. (1.133c) 


Instead of z* one often uses the notation z for the conjugate of z. 


1.5.3 Calculation with Complex Numbers 
1.5.3.1 Addition and Subtraction 


Addition and subtraction of two or more complex numbers given in algebraic form is defined by the 
formula 


zı + z2 — z3 +-+- = (ay dbi) + (a2 +ib2)— (a3 +ib3) +-+- 
(a, + ag — ag +--+) +i (bi + bo — b3 +--+). (1.134) 
The calculation can be done in the same way as doing with usual binomials. As a geometric interpreta- 
tion of addition and subtraction can be considered the addition and subtraction of the corresponding 
vectors (Fig. 1.8). For these the usual rules for vector calculations are to be used (see 3.5.1.1, p. 181). 
For z and 2*, z + z* is always real, and z — z* is pure imaginary. 











0| 1 Re(z) 


Re(z) 


Figure 1.8 Figure 1.9 Figure 1.10 
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1.5.3.2 Multiplication 


The multiplication of two complex numbers 2; and z3 given in algebraic form is defined by the following 
formula 


2120 — (αι T ibi)(ao + ib) = (aiaz = 109) +i (aib + bya2) ἃ (11354) 
For numbers given in trigonometric form holds 
2129 — |p1(cos oy -- i sin q1)][p»(cos φα 4- i sin q3)] 
= pipe|cos(yi + Ya) +i sin(yi + φο)], (1.135b) 
i.e., the absolute value of the product is equal to the product of the absolute values of the factors, and 
the argument of the product is equal to the sum of the arguments of the factors. The exponential form 
of the product is 


2120 = pipa eee» (1.135c) 


The geometric interpretation of the product of two complex numbers 2, and 2 is a vector (Fig. 1.9). It 
is generated by rotation of the vector corresponding to z; by the argument of the vector 2ο (clockwise or 
counterclockwise according to the sign of this argument), and the length of the vector will be stretched 
by [z2]. 

The product z129 can also be represented with similar triangles (Fig. 1.9). The multiplication of a 
complex number z by i means a rotation by 7/2 and the absolute value does not change (Fig. 1.10). 
For z and z*: 


259 4g, (1.136) 








22. -- ϱ; -- 


1.5.3.3 Division 
Division is defined as the inverse operation of multiplication. For complex numbers given in algebraic 
form holds 


2 


z ib byb bı — ab. 
1 Bc 1  4102-- P yi2” a (1.1372) 
z2 @ +ib ay? + be a3? + be 


For complex numbers given in trigonometric form holds 


zı pi(cosyit+ising:) — pr | 
z2 ρα(οοθφο-15ἰπρϱ) p2 
i.e., the absolute value of the quotient is equal to the ratio of the absolute values of the dividend and 
the divisor; the argument of the quotient is equal to the difference of the arguments. 
For the exponential form follows 


a P1 α(ρι-φα). (1.137c) 
22 p2 
In the geometric representation the vector corresponding to 2/22 can be generated by a rotation of the 
vector representing z; by — arg z2, and then by a contraction by | 29]. 








cos(q1 — v2) 1-1 βἰη(φι -- φα)], (1.137b) 





Remark: Division by zero is impossible. 


1.5.3.4 General Rules for the Basic Operations 
Calculations with complex numbers z = a +ib are to be done in the same way as doing with ordinary 
binomials, but considering i? = —1. Dividing a complex number by a complex number first the imag- 
inary part of the denominator has to be removed by multiplying the numerator and the denominator 
of the fraction by the complex conjugate of the divisor. This is possible because 
(a+ib)(a— ib) =a? +b? (1.138) 

is a real number. 

(3—4i)(-1+5i)? | 10+7i _ (3—4)(1-— 10i — 25) 

1+3i © 5i 1+3i 


(10+7i)i _ —2(3 — 4i )(12 + 5i) 
5ii 1+3i 
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7-100 —2(56—33i)0—3i), 7-100 — —2(—43 — 201i) , 7 — 10i 
T 


= "5 10 5 


ie 
T F191i) — 104-38.2i. 
5 (1-39 3) g (904 91i) — 104-38.2i 


1.5.3.5 Taking Powers of Complex Numbers 
The n-th power of a complex number could be calculated using the binomial formula, but it would 
be very inconvenient. For practical reasons the trigonometric form is to be used and the so-called de 
Moivre formula: 

[p(cosy +i sin y)|”" = p"(cosny +i sinny), (1.139a) 
i.e., the absolute value is raised to the n-th power, and the argument is multiplied by n. In particular, 
holds: 


i?=-1, i8=-i, it=41 (1.139b) in general jte. (1.139c) 


1.5.3.6 Taking the n-th Root of a Complex Number 
Taking of the n-th root is the inverse operation of taking powers. For z = p(cosy +i sin p) £ 0 the 
notation 

2" = 4z (n>0, integer), (1.140a) 
is the shorthand notation for the n different values 


+2k +2k 
we = yp (cos PEZET Li sia 22), 


(k 20,1,2,...,n— 1). (1.140b) 
While addition, subtraction, multiplication, division, and tak- 
ing a power with integer exponent have unique results, taking 
the n-th root has n different solutions wy. 
The geometric interpretations of the points wj are the vertices 
of a regular n-gon whose center is at the origin. In Fig. 1.11 
the six values of 4/z are represented. Figure 1.11 





1.6 Algebraic and Transcendental Equations 


1.6.1 Transforming Algebraic Equations to Normal Form 
1.6.1.1 Definition 


The variable x in the equality 

F(x) = f(x) (1.141) 
is called the unknown if the equality is valid only for certain values x1, 29,...,%, of the variable, and 
these values are called the solutions or the roots of the equation. Two equations are considered equiv- 
alent if they have exactly the same roots. 
An equation is called an algebraic equation if the functions F(x) and f(x) are algebraic, i.e., they are 
rational or irrational expressions; of course one of them can be constant. Every algebraic equation can 
be transformed into the normal form 

P(x) = dnt" + dnt" 3 4. --- az d ag — 0 (1.142) 
by algebraic transformations. The roots of the original equation occur among the roots of the normal 
form, but under certain circumstances some are superfluous. The leading coefficient a, is frequently 
transformed to the value 1. 
The exponent n is called the degree of the equation. 

r—1-4cvyz2—6 r—3 


W Determine the normal form of the equation — 335-3) =1+ 
2 -- z 





. The transformations 


1.6 Algebraic and Transcendental Equations 39 





step by step are: 

g(x — 1 + Væ? -- ϐ) -- δα(α -- 3) ἠ- 3(α -- 2}(α -- 3), a?— x xa? —6 — 3a? — 6x + 32? — 15x 
18, zV/z?—6-5z?—20z--18, az?(z?— 6) — 25x*— 2002? 4-580z? — 720a 4-324, 24r*— 200r? + 
586x? — 720a + 324 = 0. The result is an equation of fourth degree in normal form. 











1.6.1.2 System ofn Algebraic Equations 


Every system of algebraic equations can be transformed to normal form, i.e., into a system of polynomial 
equations: 


P aae) = 0 P e = e Pa esm 9. (1.143) 
The P; (i — 1,2,...,n) are polynomials in z, y, z, .... 





P 1 r-1 
W Determine the normal form of the equation system: 1. cc » 2. Y ενα, à o my-—s. 
y/u 2 y-1 
The normal form is: 1. 32:2 9 — 0, 2. z2—2rz-1—?z-2yz—z —0, 3. ay—z=0. 


1.6.1.3 Extraneous Roots 


After transforming an algebraic equation into the normal form (1.142) it can happen that the equation 
P(x) = 0 has some roots which are not solutions of the original equation (1.141). The roots of the 
equation P(x) = 0 must be substituted into the original equation to check whether they are really 
solutions of (1.141). 
Extraneous solutions can emerge if not invertible transformations are performed: 
1. Vanishing denominator If the equation has the form 

P(x) 

Q(x) 
with polynomials P(x) and Q(x), then the normal form of (1.144a) after multiplying by the denomi- 
nator Q(x) is: 

P(x) =0. (1.144b) 
The roots of (1.144b) are the same as the roots of (1.144a), except the ones which are roots both of the 
numerator and of the denominator, i.e. which satisfy P(x) = 0 and Q(x) = 0. If x — a is a root of 
the denominator, then in the case x = a the multiplication by Q(x) is a multiplication by zero. Every 
time when a non-identical transformation is performed, the checking of the solutions is necessary (see 
also 1.6.3.1, p. 43). 

x 
r-l1 z-i 

the normal form, but it is not a solution of the original equation, since the fractions are not defined for 
αι. 





=0 (1.144a) 








. The corresponding normal form is z^ — 3? — z -- 1 = 0. z4 — 1 isa solution of 


2. Irrational equations If the original equation contains radicals, the normal form is usually 
achieved by powering. E.g. squaring is not an identical transformation (since it is not invertible). 


W JVe+74+1 = 2¢ or Vx +7 = 24 — 1. By squaring both sides of the second form of the equation 
its normal form is 4r? — 5x — 6 — 0, and the roots are αι = 2 and x2 = —3/4. The root a, = 2isa 
solution of the original equation, but the root 3 — —3/4 is not. 


1.6.2 Equations of Degree at Most Four 
1.6.2.1 Equations of Degree One (Linear Equations) 


1. Normal Form 
ax 4b —0 (az 0). (1.145) 
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2. Number of Solutions 
There is a unique solution 
b 
TE d (1.146) 


a 


1.6.2.2 Equations of Degree Two (Quadratic Equations) 


1. Normal Form 


aa? -- bx -- c — 0 (a 0) (1.1472) 
or divided by a: 
x? +pr+q=0. (1.147b) 
2. Number of Real Solutions of a Real Equation Depending on the sign of the discriminant 
p 
D = 4ac— b? for (1.147a) or D =q-— a for (1.147b), (1.148) 
holds: 


e for D « 0, there are two real solutions (two real roots), 

e for D — 0, there is one real solution (two coincident roots), 

e for D > 0, there is no real solution (two complex roots). 

3. Properties of the Roots of a Quadratic Equation If x, and 2 are the roots of the quadratic 

equation (1.147a) or (1.147b), then the following equalities hold: 
b c 

zı +2 = —- = —p, zi:39——-—q. (1.149) 

a a 

4. Solution of Quadratic Equations 


Method 1: Factorization of 
aa? 4 bz d c- a(z— o)(r — B) (1.150a) or a?--pz-Fq- (x —o)(x — B), (1.150b) 
if it is successful, immediately gives the roots 
21—0, z9—f. (1.151) 
E e? +r- 6=0, z? +r- 6 [= (x+3)(x-— 2), £1 = —3, t2 = 2. 


Method 2: Using the solution formula in the cases D < 0: 
a) For (1.147a) the solutions are 








b b\? 
—-+ πλ τας 
—b+ t? — 4ac 2 B 
tp = -- (1.1522) or zip — (152b) 


If b is an even integer the second formula is to be used. 
b) For (1.147b) the solutions are 


p p 
tiz =E EA — - q. 1.153 
112 9*V4-4 ( ) 


1.6.2.3 Equations of Degree Three (Cubic Equations) 
1. Normal Form 
aa? 4- bx? -- ex -- d — 0 (a £0) (11544) 





b 
or after dividing by a and substituting y = x + 3a there is 
a 


y? +3py +2q=0 orin reduced form y? +p*y +q =0, (1.154b) 
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where 
- 2b be d x 3ac — D? 
q'-—29q Xu 3 H E and p* — 3p 3 (1.154c) 
2. Number of Real Solutions Depending on the sign of the discriminant 
D-ghp (1.155) 
holds: 


e for D > 0, one real solution (one real and two complex roots), 

e for D < 0, three real solutions (three different real roots), 

e for D = 0, one real solution (one real root with multiplicity three) in the case p = q = 0; or two real 
solutions (a single and a double real root) in the case p? = —q? 4 0. 

3. Properties of the Roots of a Cubic Equation If x, , x2, and 23 are the roots of the cubic 
equation (1.154a), then the following equalities hold: 


b c d 
T1 + T2 + T3 = a) tate + 21%3 + L923 = go Tm =. (1.156) 


4. Solution of a Cubic Equation 
Method 1: If it is possible to decompose the left-hand side into a product of linear terms 








aa? -- ba? -- cx }- ἆ -- α(ς -- α)(α -- β)(α -- η) (1.157a) 
one immediately gets the roots 
πιξα, Z2—, απ}. (1.157b) 














E r? +x? -6r = 0, r? +r? -— 6r = r(x +3)(x— 2); zı =0, z2 3, 13 — 2. 


Method 2: Using the Formula of Cardano. By substituting y = u + v the equation (1.154b) has the 
form 


u> +03 + (w+ v)(3uv + 3p) + 2g = 0. (1.158a 
This equation is obviously satisfied if 

w+v3 =—2q and uw =-—p (1.158b 
hold. Writing (1.158b) in the form 

u? v) 2 —2q, uu = p, (1.158c 





there are two unknowns u? and vê, the sum and product of which are known. Therefore using the Vieta 
root theorem (see 1.6.3.1, 3., p. 44) the solutions of the quadratic equation 


w? — (ue +08 )w + ube? = w? + 2quw — p’ = 0 (1.158d 
can be calculated: 
w =u =q +y +p, w =v? =q- VÈ +P, (1.158e 


so for the solution y of (1.154b) the Cardano formula results in 


y=u+v=Ņ -q +y + + i-a- VP+. (1.158f 


Since the third root of a complex number means three different numbers (see (1.140b), p. 38) there are 

















nine different cases, but because of uv = —p, the solutions are reduced to the following three: 
yi = Uy + vy (if possible, consider the real third roots uy and v; such that ujv; = —p), (1.158g 
1.3 i d 
pul- a? t 1.1581 
y= (-5+5V3) + (-5-5V3), (1.158h 





1 i 1 i . 
mu (75-38) n (7553). ; 
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E jy?--6y 4 2—0 withp —2, q— 1 and ? 4 p? 2 9 and u — 4/—14 3 = 4/2 — 1.2599, 
v = 4—1—3 — 4/—4 — —1.5874. The real root is yj — u «- v — —0.3275, the complex roots are 

















1 3 
U23 — -3( +u)+ Seu — v) — 0.1638 +i - 2.4659. 


Method 3: For a real equation, the auxiliary values given in Table 1.3 can be used. With p from 
(1.154c) 
τς ΤΡΙ (1.159) 


is substituted where the sign of r is the same as the sign of g. Next, using Table 1.3, one can determine 
the value of the auxiliary variable y and with it the roots y1, yo and ys depending on the signs of p and 
D-gtp. 





Table 1.3 Auxiliary values for the solution of equations of degree three 























p«o 
p»0 
P +p <O +p >O 
q q ; q 
pud. προς ünlu L 
C059 — 8 cosh = -3 sinhy = 3 
γι = —2r cos £ yı = —2r cosh £ yi--2r sinh £ 
y2 = +2r cos (so = ϐ) yo — rcosh 7 + iV3rsinh 7 y2= rsinh 7 +iv3r cosh £ 
y3 = +2r cos (so | £) y3 = r cosh = iV3r sinh Z y3 = r sinh s i 3r cosh £ 














α -9y +4=0. p= 3, q=2, @+p? <0,r= V3, cosy = = 0.3849, y = 67°27. 


2 
3/3 
yı = —2V3 cos 22°27 = —3.201, y2 = 2V3cos(60° — 22°27’) — 2.747, ys — 2/3 cos(60? -- 22927) — 
0.455. 

Checking: yı + y2 + y3 = 0.001 which can be considered 0 for the accuracy of our calculations. 


Method 4: Numerical approximate solution, see 19.1.2, p. 952; numerical approximate solution by 
the help of a nomogram, see 2.19, p. 128. 


1.6.2.4 Equations of Degree Four 


1. Normal Form 








azt + br? + cr? +dr+e=0 (a#0). (1.160) 
If all the coefficients are real, this equation has 0 or 2 or 4 real solutions. 
2. Special Forms If b — d — 0 holds, the roots of the biquadratic equation 

ax’ + cx? +e=0 (1.161a) 
can be calculated by the formulas 








—czx yc — 4ae 
T1234 = EVY, p m ——— — —. 
2a 
For a — e and b — d, the roots of the equation 
ax’ + ba? + cx? + bx +a=0 (1.161c) 
can be calculated by the formulas 


yt -4 —b x Vi? — 4ac + 8a? 


11294 = 2 |) Y= 2a 





(1.161b) 











(1.161d) 
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3. Solution of a General Equation of Degree Four 
Method 1: If somehow the left-hand side of the equation can be factorized 











ax’ + ba? + cx? + dx +e =0= a(x — a)(a — B)(x — y)(x — à) (1.162a) 
then the roots can be immediately determined: 

ασια, %=P, T=J, T=. (1.162b) 
W xt — 22? —2? + 22 =0, r(x? — 1)(x — 2) = x(x — 1)(a@ + 1)(a — 2); 
zı =0, t2 = 1, v3 --ἶ, αι 2. 


Method 2: The roots of the equation (1.162a) for a = 1 coincide with the roots of the equation 











x by — d 
P UFAZ [yt | ed, (1.163a) 
2 A 
where A = +,/8y + b? — 4c and y is one of the real roots of the equation of third degree 
8y> — dey? + (2bd — 8e)y + e(4c — b?) — d? = 0 (1.163b) 
. B bc . Κον b ς s 
with B= pc # 0. The case B — 0 gives by the help of the substitution r — u — 1? biquadratic 


equation of the form (1.161a) for u with a — 1. 
Method 3: Approximate solution, see 19.1.2, p. 952. 


1.6.2.5 Equations of Higher Degree 


It is impossible to give a formula or a finite sequence of formulas which produce the roots of an equation 
of degree five or higher (see also 19.1.2.2,2., p. 954). 


1.6.3 Equations of Degree n 
1.6.3.1 General Properties of Algebraic Equations 


1. Roots 
The left-hand side of the equation 
z" ra, 42" -E...--ag — 0 (1.1642) 


is a polynomial P; (x) of degree n, and a solution of (1.164a) is a root of the polynomial P, (a). If a is 
a root of the polynomial, then P, (x) is divisible by (x — a). Generally 

P, (x) = (a — a) P,-1(x) + P,(a). (1.164b) 
Here P, 4(x) is a polynomial of degree n — 1. If P,(«) is divisible by (a — a)*, but it is not divisible by 
(x — a)** then a is called a root of order k of the equation P,,(x) = 0. In this case a is a common root 
of the polynomial P(x) and its derivatives to order (k — 1). 
2. Fundamental Theorem of Algebra 
Every equation of degree n whose coefficients are real or complex numbers has n real or complex 
roots, where the roots of higher order are counted by their multiplicity. Denoting the roots of P(x) 
by a, 6,y7,... and they have multiplicity k, l, m, .. ., then the product representation of the polynomial 
is 

P(x) = (x — o)*(x — B)'(x —y)".... (1.165a) 
"The solution of the equation P(x) — 0 can be simplified by reducing the equation to another one, which 
has the same roots, but only with multiplicity one (if possible). In order to get this, the polynomial is 
to be composed into a product of two factors 

P(x) = Q(x)T(x), (1.165b) 
such that 


T(x) = (x—a)F (x — 8)71..., Q(z) 2 (r - o)(z — B).... (1.165c) 
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Because the roots of the polynomial P(x) with higher multiplicity are the roots of its derivative P'(a), 
too, T(x) is the greatest common devisor of the polynomial P(x) and its derivative P’(x) (see 1.1.6.5, 
p-14). Dividing P(x) by T(x) yields the polynomial Q(x) which has all the roots of P(x), and each 
root occurs with multiplicity one. 

3. Theorem of Vieta About Roots 


The relations between the n roots z1, 19, ..., Un and the coefficients of the equation (1.164a) are: 


n 
1T 29d... Z4 — Nun ο ds 
i=1 


n 
1132 + T13 +... + Lp1Ly = y Uti = An-2, 
iji 
i<j 


n 
212923 1- Ζι 2904 1-... 1- Ζῃ 92ῃ. 12ῃ = 5 LjL{Lk = —An-3, (1.166) 
i,j,k=1 
i<j<k 


T12... En = (—1)”ao. 


1.6.3.2 Equations with Real Coefficients 
1. Complex Roots 


Polynomial equations with real coefficients can also have complex roots but only pairwise conjugate 
complex numbers, i.e., if a = a + ib is a root, then 8 = a — ib is also a root, and it has the same 








multiplicity. The expressions p = —(a + 8) = —2a and q = af = a? + b? satisfy the unequation 
2 
(5) — q « 0, so that 
(x — o)(x — 8) —a?-- px - q (1.167) 


holds. Substituting the product corresponding to (1.167) for every pair of factors in (1.165a), one gets 
a decomposition of the polynomial with real coefficients into real factors. 


P(x) = (a — ay)" (a — ag)" --- (x — oj) 


(x? + pre + qu) (a? + pow + G2) + (a? + Pet + Gr). (1.168) 
Here a1, Q2,..., a, are the | real roots of the polynomial P(x). It also has r pairs of conjugate complex 
roots, which are the roots of the quadratic factors x? + p;x +q; (i = 1,2,...,r). The numbers a; (j = 


$2 
1,2,..., l), pi and q; (i = 1,2,...,r) are real and the inequalities (2) — qi « 0 hold. 


2. Number of Roots of an Equation with Real Coefficients 

According to (1.167) every equation of odd degree has at least one real root. The number of further 
real roots of (1.164a) between two arbitrary real numbers a < b, can be determined in the following 
way: 


a) Separate the Multiple Roots: Separating the multiple roots of P(x) = 0, yields an equation 
which has all the roots of the original equation, but only with multiplicity one. Then the form mentioned 
in the case of the fundamental theorem must be produced. 

For practical reasons it is a good idea to start with the determination of the Sturm chain (the Sturm 
functions (1.169)). This is almost the same as the Euclidean algorithm for determining the greatest 
common devisor, but it gives some further information. If Pm is not a constant then P(x) has multiple 
roots, which must be separated. Therefore in the following it can be assumed that P(x) = 0 has no 
multiple roots. 
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b) Creating the Sequence of Sturm Functions: 
P(x), P’(x), Py(x), Po(x),..., Pm = const. (1.169) 


Here P(x) is the left-hand side of the equation, P(x) is the first derivative of P(x), P(x) is the re- 
mainder on division of P(x) by P’(x), but with the opposite sign, P2(a) is the remainder on division of 
P'(x) by Pi(x) similarly with the opposite sign, etc.; Pn = const is the last non-zero remainder, but 
it must be a constant, otherwise P(x) and P'(x) have common devisors, and P(x) has multiple roots. 
In order to simplify the calculations the remainders can be multiplied by positive numbers, what does 
not change the result. 


c) Theorem of Sturm: If A is the number of changes in sign, i.e. the number of changes from “+” 
to “—” and vice versa, in the sequence (1.169) for x = a, and B is the number of changes in sign in the 
sequence (1.169) for x — b, then the difference A — B is equal to the number of real roots of P(x) = 0 in 
the interval [a, b]. If in the sequence some numbers are equal to zero, then they should not be considered 
in the sign change count. 


Bl Determination of the number of roots of the equation z* — 5z? -- 8r — 8 — 0 in the interval [0, 2]. 
The calculations by the Sturm functions are: P(x) — x* — ba? -- 8x — 8; P'(r) ^ 4a? — 10x 4- 8; 
Py(x) = 5a? — 12x + 16; Po(x) = —3x + 284; P3 = —1. Substituting z = 0 results in the sequence 
—8, +8, +16, +284, —1 with two changes in sign, substituting z — 2 results in +4, +20, +12, +278, —1 
with one change in sign, so A — B —2— 1 — 1, i.c., between 0 and 2 there is one root. 


d) Descartes Rule: The number of positive roots of the equation P(r) = 0 is not greater than the 
number of changes of sign in the sequence of coefficients of the polynomial P(x), and these two numbers 
can differ from each other only by an even number. 

Bl What can be told about the roots of the equation z* -- 22? — 2? -5x — 1 — 0 ? The coefficients in the 
equation have signs +, +, — , +, — , i.e., there are three changes of sign. By the rule of Descartes the 
equation has either three or one roots. Because on replacing x by —x the roots of the equation change 
their signs, and on replacing x by x + h the roots are shifted by h, the number of negative roots, or 
the roots greater than h can be estimated by the help of the rule of Descartes. In the given example 
replacing z by —z yields z^ — 22? — x? — 5x — 1 = 0, i.e., the equation has at most one negative root. 
Replacing x by x 4- 1 yields z^ 4- 62? 4- 11z? -- 13x -- 6 — 0, i.e., every positive root of the equation (one 
or three) is smaller than 1. 


3. Solution of Equations of Degree n 


Usually equations with n > 4 can be solved only approximately. In practice, approximate methods are 
also used to get solutions of equations of degree three or four (see 19.1.2.3, p. 954). 

In order to determine certain real roots of an algebraic equation the general numerical procedures for 
non-linear equations can be used (see 19.1, p. 949). In order to determine all roots, including the 
complex roots of an algebraic equation of degree n the Brodetsky-Smeal method can be used (see [1.7], 
[19.31]). In order to determine complex roots one can use the Bairstow method (see [19.31]). 


1.6.4 Reducing Transcendental Equations to Algebraic Equations 


1.6.4.1 Definition 
An equation F(x) = f(x) is transcendental if at least one of the functions F(x) or f(x) is not algebraic. 
B A: 37 = 4° ?. 2° MB: 2log; (3a — 1) — log; (127 +1) = 0, lE C: 3cosh x — sinh x + 9, 
1 
E D: 27! = 8%- — 47-2 W E: sing = cos? £ — T B F: rcosr — sin. 


In some cases it is possible to reduce the solution of a transcendental equation to the solution of an 
algebraic equation, for instance by appropriate substitutions. In general, transcendental equations 
can be solved only approximately. In the following sections some special transcendental equations are 
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discussed which can be reduced to algebraic equations. 


1.6.4.2 Exponential Equations 


Exponential equations can be reduced to algebraic equations in the following two cases, if the unknown 
x or a polynomial P(x) is only in the exponent of some quantities a, b,c,...: 


a) If the powers a” 17) pF) are connected by multiplication or division, then the logarithm can 


be taken on an arbitrary base. 

2log4 

B 3" —4*7?.2*: zlog3 2 (x —2)log4 4- zlog2; x 2 ——————— —.. 
τω... log4 — log3 + log 2 


b) If a, b, c, ... are integer (or rational) powers of the same number k, i.e., a = k”, b = k™, c = k}, . 
holds, then by substituting y = k” one can get an algebraic equation for " and after solving it follows 


lo 
the solution z — By 
log 
= ὃ 4 2c 293r 22x 
ΒΟ = gr *— 47"; uu dE. . Substitution of y — 2* results in y? — 4y? — 32y — 0 and 
yı = 8, y2 4 yz = 0; 27 = 8, 2° 4, 273 = 0, so x, = 3 follows. There are no further real 











roots. 
1.6.4.3 Logarithmic Equations 


Logarithmic equations can be reduced to algebraic equations in the following two cases, if the unknown 
x or a polynomial P(x) is only under the logarithm sign: 

a) If the equation contains only the logarithm of the same expression, then by introducing this as a 
new unknown, one can solve the equation with respect to it. The original unknown can be determined 
by using the logarithm. 


HB milog, P(x)}? +n = a\/[log, P(x)|? 4- b. The substitution y — log, P(x) yields the equation my? + 
= ayy? +b. After solving for y one gets the solution for x from the equation P(x) = a”. 


b) If the equation is a linear combination of logarithms of polynomials of x, on the same base a, with 
integer coefficients m, n, ..., i.e., it has the form mlog, Pi(x) 4- nlog, P»(x )4 .. = 0, then the left- 
hand side can be written as the logarithm of a rationale expression. (The original equation may contain 
rational coefficients and rational expressions under the logarithm, or logarithms with different bases, 
if the bases are rational powers of each other.) 
2 3x — 1)? 

d log; 1, £ á ) = 154; = 0,% = 2: 
Substituting z, = 0 in the original equation gives negative values in the logarithm, i.c., this logarithm 
is a complex value, so x = 0 is not a solution. 











E 2log; (3x — 1) — log,(12% + 1) — 0, logs 


1.6.4.4 Trigonometric Equations 


Trigonometric equations can be reduced to algebraic equations if the unknown z or the expression nx +a 
with integer n is only in the argument of the trigonometric functions. After using the trigonometric 
formulas (see 2.7.2, p.81) the equation will contain only one unique function containing z, and after 
replacing it by y an algebraic equation arises. The solution for x is obtained from the solutions for y, 
naturally taking the multi-valuedness of the solution into consideration. 


1 1 3 
E sinz = cos? z — -orsing =1— sin? x — 1: Substituting y = sin x yields y? + y — 17 0 and 


1 3 : 2 : 1 
n=zp =z The result y2 gives no real solution, because | sin x| < 1 for all real x; from yı = 5 
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π 5r . 
follows x = 6 + 2kr and x = € + 2kx with k = 1,2,3,.... 


1.6.4.5 Equations with Hyperbolic Functions 
Equations with hyperbolic functions can be reduced to algebraic equations if the unknown z is only in the 
argument of the hyperbolic functions. Rewriting the hyperbolic functions as exponential expressions, 


1 
then substituting y — c* and — = e *, and the result is an algebraic equation for y. After solving this 
y 


the solution is x = ln y. 














E ou pt __ et 2 
E 3coshr «κο νο E T b = 49; c? 4267-920; y 5-9 — 0,42 994-2 — 0; 
y 
ya = =e T pm 2 2.1716, x4 — we & —14784. 





2 Functions 


2.1 Notion of Functions 
2.1.1 Definition of a Function 
2.1.1.1 Function 


If x and y are two variable quantities, and if there is a rule which assigns a unique value of y to a given 
value of x, then y is called a function of x, using the notation 

y= f(x). (2.1) 
The variable x is called the independent variable or the argument of the function y. The values of x, 
to which a value of y is assigned, form the domain D of the function f(x). The variable y is called the 
dependent variable; the values of y form the range Wof the function f(x). Functions can be represented 
by the points (x, y) as curves, or graphs of the function. 
2.1.1.2 Real Functions 
If both the domain and the range contain only real numbers the function y = f(x) is called a real 
function of a real variable. 
E A: y=? with D: —œ <z <œ, W: 0<y< 0%. 
B B: y— /x with D: 0€ x «oo, W:0<y<om. 
2.1.1.3 Functions of Several Variables 
If the variable y depends on several independent variables c1, 22, . .. , 5, then the notation 

y — f (iriimov ss su) (2.2) 
is used for a function of several variables (see 2.18, p. 118). 
2.1.1.4 Complex Functions 
If the dependent and independent variables are complex numbers w and z respectively, then w = f(z) 
means a complex function of a complex variable, (see 14.1, p. 731). Complex-valued functions w(a) are 
called complex functions even if they have real arguments  . 


2.1.1.5 Further Functions 


In different fields of mathematics, for instance in vector analysis and in vector field theory (see 13.1, 
p. 701), other types of functions are to be considered whose arguments and values are defined as follows: 


1. The arguments are real — the function values are vectors. 


W A: Vector functions (see 13.1.1, p. 701). 
E B: Parameter representations of curves (see 3.6.2, p. 256). 


2. The arguments are vectors — the function values are real numbers. 
E Scalar fields (see 13.1.2, p. 702). 
3. The arguments are vectors — the function values are vectors. 
Wi A: Vector fields (see 13.1.3, p. 704). I B: Parametric representations or vector forms of surfaces 
(see 3.6.3, p. 261). 
2.1.1.6 Functionals 
If a real number is assigned to every function x = x(t) of a given class of functions, then it is called a 
functional. 
b 
W A: Ifx(t)isa given function which is integrable on fa, b], then f(a) = / x(t) dt isa linear functional 
Ja 
defined on the set of continuous functions x integrable on [a, b] (see 12.5, p. 677). 
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W D: Integral expressions in variational problems (see 10.1, p. 610). 


2.1.1.7 Functions and Mappings 
Suppose there are given two non-empty sets X and Y. A mapping, which is denoted by 

f:X OY, (2.3) 
isa rule, by which a uniquely defined element y of Y is assigned to every element x of X. The element 
y is called the image of x, as formula y = f(x). The set Y is called the image space or range of f, the 
set X is called the original space or domain of f. 
W A: If both the original and the image spaces are subsets of real numbers, i.e., X = D C R and 
Y = W C R hold, then (2.3) defines a real function y = f(x) of the real variable 2. 
E B: If f is a matrix A = (aj) (i = 1,2,...,m;j — 1,2,...,n) of type (m,n) and X = R” and 
Y = R”, then (2.3) defines a mapping from R” into R". The rule (2.3) is given by the following 
system of m linear equations: 


Yr = Auti + Arzt + + AinTn 
s Yo = Gi, + A2272 T `° T Amn 

ΥΞ Αχ Or : 
Ym = Gm1%1 + Am2%2 + +++ T Amnn 








ie. Ax means the product of the matrix A and the vector x. 
Remarks: 
1. The notion of mapping is a generalization of the notion of function. So, some mappings are some- 
times called functions. 





2. The important properties of mappings can be found in 5.2.3, 5. p. 333. 

3. A mapping, which assigns to every element from an abstract space X a unique element usually from 
a different abstract space Y, is called an operator. Here an abstract space usually means a function 
space, since the most important spaces in applications consist of functions. Abstract spaces are for in- 
stance linear spaces (see vector spaces 5.3.8, p. 365), metric spaces (see 12.2, p. 662) and normed spaces 
(see 12.3, p. 669). 


2.1.2 Methods for Defining a Real Function 
2.1.2.1 Defining a Function 


A function can be defined in several different ways, for instance by a table of values, by graphical rep- 
resentation, i.e., by a curve, by a formula, which is called an analytic expression, or piece by piece with 
different formulas. Only such values of the independent variable can belong to the domain of an an- 
alytic expression for which the function makes sense, i.e., it takes a unique, finite real value. If the 
domain is not otherwise defined, the domain is considered as the maximal set for which the definition 
makes sense. 

2.1.2.2 Analytic Representation of a Function 

Usually the following three forms are in use: 

1. Explicit Form: 

y = f(x). (2.4) 
E y= vl- r?, —l1<x<1, yÈ 0. Here the graph is the upper half of the unit circle centered at 
the origin. 

2. Implicit Form: 

F(v, y) — 0, (2.5) 
in the case when there is a unique y which satisfies this equation, or it can be told which solution is 
considered to be the value of the function. 

WBi-y-1-0, -l<2x< +l, ων 0. Here the graph is again the upper half of the unit 
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circle centered at the origin. It should be emphasized that z? + y? + 1 = 0 itself does not define a real 
function. 
3. Parametric Form: 

r= y(t), y= v(t). (2.6) 
The corresponding values of x and y are given as functions of an auxiliary variable t, which is called a 
parameter. The functions y(t) and y(t) must have the same domain. This representation defines a real 
function only if x = y(t) defines a one-to-one correspondence between x and t. 
E = olt), y=Ņ(t) with e(t) 2 cost and v(t) 2 sint, 0 € t € v. Here the graph is again the 
upper half of the unit circle centered at the origin. 
Remark: Functions given in parametric form sometimes do not have any explicit or implicit parameter- 
free equation. 
Bc =t+2sint = y(t), y=t—cost = w(t). 


Examples for Functions Given Piece by Piece: 


y y y=x-E(x) BMA: y= E(x) = int(x) = [z] = n for 
— 1 n<u<n+1, n integer. 

y=E(x) + The function E(x) or int(x) (read “integer 
=i part of x”) means the greatest integer less 

than or equal to 2. 
HB: The function y = frac(x) = x — [x] 
(read “fractional part of x” ) gives the differ- 
[123456 X 0|] 12345 "E ence of x and [x] (Fig. 2.1b). Fig. 2.1a,b 
shows the corresponding graphical repre- 
a) b) sentations, where the arrow-heads mean 
that the endpoints do not belong to the 

Figure 2.1 curves. 


= 


— 





5ο! -: Ὁ οὐ ἠ5 σι 


p forga O, 22. 
BC: y= {2 frs >Ó, (Fig. 2.2a). 


—1 for x < 0, 
E D: y = sign(r) — 0 for x = 0, 

+1 for x > 0, 
(Fig. 2.2b). By sign(x) (read “signum 
x”), the sign function is denoted. 








a) 


Figure 2.2 


2.1.3 Certain Types of Functions y y 

2.1.3.1 Monotone Functions uu 

If a function satisfies the relations T | ! 
| 





X [EscscN 


» » 
f(xa) 2 (αι) οἱ f(z) < f(a), (2.7a) of x 2X of Χι Xx 
for arbitrary arguments 7; and x2 with x2 > 2 inits do- a) b) 
main, then it is called monotonically increasing or mo- 
notonically decreasing (Fig. 2.3a,b). Figure 2.3 


If one of the above relations (2.7a) does not hold for every x in the domain of the function, but it is valid, 
e.g., in an interval or on a half-axis, then the function is called monotonic in this domain. Functions 
satisfying the relations 


F(x2) > f(1) or (ας) αι), (2.2) 
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i.e., when the equality never holds in (2.7a), are called strictly monotonically increasing or strictly mono- 
tonically decreasing. In Fig. 2.3a there is a representation of a strictly monotonically increasing func- 
tion; in Fig. 2.3b there is the graph of a monotonically decreasing function being constant between xı 
and x. 


E y = e” is strictly monotonically decreasing, y = ln z is strictly monotonically increasing. 


2.1.3.2 Bounded Functions 


A function is called bounded above if there is a number (called an upper bound) such that the values 
of the function never exceed it. A function is called bounded below if there is a number (called a lower 
bound) such that the values of the function are never less than this number. If a function is bounded 
above and below, it simply is called bounded. 


E A: y= 1-— x? is bounded above (y < 1). W B: y= e" is bounded below (y > 0). 


4 
WB C: y — sinz is bounded (—1 < y < +1). BD: y= ΤΕ is bounded (0 « y < 4). 
x 





2.1.3.3 Extreme Values of Functions 
The function f(a) with domain D has an absolute or global maximum at the point a, if for all x € D 


f(a) > f(x) (2.8a) 
holds. The function f(a) has a relative or local maximum at the point a, if the inequality (2.8a) holds 
only in an environment of the point a, i.e. for all r witha — e «r«a-e,6»0,r€ D. 


In analogy the definition of an absolute or global minimum as well as for a relative or local minimum can 
be given, but the inequality (2.8a) is to be replaced by 

f(a) € f(x). (2.8b) 
Remarks: 
a) The notions maximum and minimum, are called the extreme values, they are not coupled to the 
differentiability of functions, i.e., they hold also for functions which are not differentiable in some points 
of the domain. Examples are discontinuities of curves (see Figs.2.9, p. 58 and 6.10b,c, p. 443). 
b) Criterions for the determination of extreme values of differentiable functions see in 6.1.5.2, p. 443. 


2.1.3.4 Even Functions 

Even functions (Fig. 2.4a) satisfy the relation y y 
I(-2) = f(a). (2.9a) 

If D is the domain of f, then 





(x € D) = (-x € D) (2.9b) " ` 
should hold. 0 x 0 pu x 
BA: y=cosz, MB: y=2*—32?+1. a) b) 
2.1.3.5 Odd Functions Figure 2.4 


Odd functions (Fig. 2.4b) satisfy the relation 

f(-x) = -f (x). (2.10a) 
If D is the domain of f, then 

(z € D) => (—z € D) (2.10b) 
should hold. 
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BA: y=sinz, E B: y= r-r. 
2.1.3.6 Representation with Even and Odd Functions 


If for the domain D of a function f the condition “from x € D it follows that —x € D” holds, then f 
can be written as a sum of an even function g and an odd function u: 


f(x) = 92) + ula) with 9(2) = Ie) +, ua) = ie- a (811) 





1 1 
E f(r) = e= s (e H e*) “7 G e) — cosh x + sinh a (see 2.9.1, p. 89). 


2.1.3.7 Periodic Functions y T 
Periodic functions satisfy the relation 
f(x--T)-—f(x) T const, T Z 0. (2.12) 
Obviously, if the above equality holds for some T, it holds for 0 T x 
any integer multiple of T. The smallest positive number T sat- 
isfying the relation is called the period (Fig. 2.5). Figure 2.5 


2.1.3.8 Inverse Functions 
A function y = f(x) with domain D and range W assigns a unique y € W to every x € D. If reversed, 
to every y € W there belongs only one x € D, then the inverse function of f can be defined. It is 
denoted by y or by f-!. Here f^! is a symbol for a function, not a power of f. 
To find the inverse function of f, the variables x and y are interchanged in the formula of f, then y is 
expressed from x = f(x) in order to get y = y(x). The representations y = f(x) and x = (y) are 
equivalent. The following important formulas come from this relation 

f(e(y) — vy and φί/())--α. (2.13) 
The graph of an inverse function y = v(x) is obtained by reflection of the graph of y = f(x) with 
respect to the line y = x (Fig. 2.6). 
E The function y = f(x) = e”? (D: —oo € xr € oo,W : y » 0) is equivalent Obviously, every strictly 
monotonic function has an inverse function. 


Y 





Figure 2.6 





MA: y=f(x)=2" with D: «> 0, W: y>0; 
= g(x) = yr with D: r0, W: y>0. 
ΒΒ: y=f(r) =e with D: —oo« x « oo, W: y>0; 
y = (x) =Ine with D: «>0, W: =œ < y< œ. 
BC: y=f(x) =sinz with D: -n/2<a<7n/2, W: -1l<y<\1; 
y = (x) =aresing  wih D: -l<a<1l, W: —r/2 < y < 7/2 
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Remarks: 

1. If a function f is strictly monotone in an interval Z C D, then there is an inverse f~! for this interval. 
2. If a non-monotone function can be partitioned in strictly monotone parts, then the corresponding 
inverse exists for each part. 


2.1.4 Limits of Functions 


2.1.4.1 Definition of the Limit of a Function 
The function y = f(x) has the limit A at z = a 

lim f(x) =A or f(t) >A for za, (2.14) 
if as x approaches the value a infinitely closely, the value of f(x) approaches the value A infinitely 
closely. The function f(x) does not have to be defined at a, and even if defined, it does not matter 
whether f(a) is equal to A. 
Precise Definition: The limit (2.14) exists, if for any given positive number e there is a positive 
number y such that for every « 4 a belonging to the domain and satisfying the inequality 

[r—a| «€ n, (2.15a) 


the inequality 

|f(z) - A| « e (2.15b) 
holds eventually with the exception of the point a (Fig. 2.7). 
If a is an endpoint of a connected region, then the inequality 
|a — a| < 1 is reduced either to a — 7 € x or to x < a +n (see 
also 2.1.4.5). 








2.1.4.2 Definition by Limit of Sequences (see ol aaam x 


7.1.2, p. 458) Figure 2.7 

A function f(x) has the limit A at x = a if for every sequence 

£1, T2, . - - , n, ... Of the values of x from the domain and converging to a (but being not equal to a), the 
sequence of the corresponding values of the function f (x1), f (x2), ..., f (x4), ... converges to A. 


2.1.4.3 Cauchy Condition for Convergence 

A necessary and sufficient condition for a function f(x) to have a limit at x = a is that for any two 
values zı Z a and x3 Æ a belonging to the domain and being close enough to a, the values f (11) and 
f (13) are also close enough to each other. 

Precise Definition: A necessary and sufficient condition for a function f(a) to have a limit at x = a 
is that for any given positive number e there is a positive number n such that for arbitrary values xı 
and zz belonging to the domain and satisfying the inequalities 


O«c|ri—a| «9. and 0 «|vo — a| «€ m, (2.164) 
the inequality 
f (x1) — f(x3)| « & (2.16b) 


holds. 


2.1.4.4 Infinity as a Limit of a Function 
The symbol 
lim |f(x)| = co (2.17) 





means that as x approaches a, the absolute value |f (x)| does not have an upper bound. 
Precise Definition: The equality (2.17) holds if for any given positive number K there is a positive 
number 7 such that for any x Z a from the interval 


α-η«αχ«α!ἠ-η (2.184) 
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the corresponding value of | f (x)| is larger than K: 

|f(z)) » K. (2.18b) 
If all the values of f(a) in the interval 

a-n<u“u<a+n (2.18c) 
are positive, one writes 

lim f(x) = +00; (2.18d) 
if they are negative, one writes 


lim f(x) = —oo. (2.156) 


2.1.4.5 Left-Hand and Right-Hand Limit of a Function 
A function f(x) has a left-hand limit A^ at x = a, if as z tends to a from the left, the value f(x) tends 
to A`: 

A` = lim f(x) — f(a — 0). (2.194) 


α-λα-0 


Similarly, a function has a right-hand limit At if as x tends to a from the right, the value f(a) tends to 
At: 


At= lim f(r) = f(a+0). (2.19b) 


r—a-0 


The equality lim f(x) = A is valid only if the left-hand and 


right-hand limits exist, and they are equal: 





At=A =A. (2.19c) ! 
1 1 
E The function f(x) = ———y tends to different values from ' 
l4 ez-t 1 
the left and from the right for x —> 1: f(1—0) = 1, f(1+0) = 0 
(Fig. 2.8). Figure 2.8 


2.1.4.6 Limit of a Function as x Tends to Infinity 


Case a) A number A is called the limit of a function f(x) as x — --oo, and one writes 
= i τ 20a 

A Hm. f(x) (2.20a) 
if for any given positive number £ there is a number N > 0 such that for every x > N, the corresponding 
value f(x) isin the interval A — € < f(x) < A +e. Analogously 

A= εἴπη f(x) (2.20b) 
is the limit of a function f(x) as x — —oo if for any given positive number e there is a positive number 
N > Osuch that for any « < —N the corresponding value of f(a) isin the interval A—e < f(x) < A+e. 
. rcl . rcl E 
WA: lin Ξ1, MB: lin =1, MC: lim e?=0. 


rto m z-—oo0 mp z—-—o00 








Case b) Assume that for any positive number K, there is a positive number N such that if x > N or 
x < —N then the absolute value of the function is larger then K. In this case one writes 


,Um. |f(@)| =co or lim |7(ῳ) — oo. (2.20c) 
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P a?—1 Í 2 —1 
WA: lin = +00, M B: lim — —o0, 
α-!οο 2 z—-—oo 2 
: Q2 04-4 
WB C: lim ——oo, BD: lim = +00. 
z—++00 12 z—-—oo x2 





2.1.4.7 Theorems About Limits of Functions 


1. Limit of a Constant Function The limit of a constant function is the constant itself: 
lim A = A. (2.21) E 
ra 


2. Limit ofa Sum or a Difference If among a finite number of functions each has a limit, then the 
limit of their sum or difference is equal to the sum or difference of their limits (if this last expression 
does not contain oo — oo): 


lim [f() + g(a) - v(2)] — lim f(x) + lim g(x) — lim (2). (2.22) 


3. Limit of Products If among a finite number of functions each has a limit, then the limit of their 
product is equal to the product of their limits (if this last expression does not contain a 0 - oo type): 


lim (f) e) 9G] — [im F(0)] [lim e] [im o] - (223) 


4. Limit of a Quotient The limit of the quotient of two functions is equal to the quotient of their 
limits, in the case when both limits exist and the limit of the denominator is not equal to zero (and this 
last expression is not an oo/oo type): 
A lim f(x) 
x 
im f(z) = moy (2.24) 
zcaqpmp z 
φία) — lim y(«) 


Also if the denominator is equal to zero, usually one can tell if the limit exists or not, checking the sign 
of the denominator (the indeterminate form is 0/0). Similarly, one can calculate the limit of a power 
by taking a suitable power of the limit (if it is not a 0°, 1°, or o0° type). 


5. Pinching If the values of a function f(x) lie between the values of the functions y(x) and y(x), 
i.e., p(x) < f(x) < y(x), and if lim p(x) = A and lim y(x) = A hold, then f(x) has a limit, too, and 
lim f(x) = A. (2.25) 


2.1.4.8 Calculation of Limits 


The calculation of the value of a limit can be made by using the 5 described theorems as well as some 
transformations (see 2.1.4.7). 
1. Suitable Transformations 


For the calculation of limits the expression is to be transformed into a suitable form. There are several 
types of recommended transformations in different cases; here are three of them as examples. 


























z3—1 
E A: lim = lim (£? +x +1) = 3. 
z31 py — z—1 
>. vli+r-1 ,. (Vlctz-l)(yVlctr-cl) , 1 1 
E B: lim lim lim Ξ-- 
20 x 20 z(V/1-4 c1) zo0/1c-r-1 2 
5] T 2 il 2 5] T 
B C: lim 2 2x lim (sin 2x) sin 2x 


— 2 lim 2. Here one can refer to the well-known theorem 
230 m 230 x 2230 2g 
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sina 
im 
a30 Q 


2. Bernoulli-U'Hospital Rule 





—1 (sce B A, 2.1.4.9, p. 57)). 


0 
In the case of indeterminate forms like Ur d 0 - oo, oo — oo, 0? , oo? , 1%, one often applies the 
οο 
Bernoulli-l’Hospital rule (usually called l’Hospital rule for short): 
ea) 


Va) 





0 oo 
Case a) Indeterminate Forms αρ ο First, use the theorem only after checking if for f(a) = 
oo 


the following conditions are fulfilled. 
Suppose lim (a) — 0 and Iim v(z) — 0 or lim p(x) = o0 and lim (x) — oo, and suppose that there 


isan αμα containing a such that the functions y(x) and y(x ` are defined and differentiable in this 





/ 
interval except perhaps at a, and v/(x) Z 0 in this interval, and lim i ς 


x "(a 
lim f(a) = lim el) = lim eG) : (2.26) 
i ray) (x |) ia V (x) 
Remark: If the limit of the ratio of the derivatives does not exist, it does not mean that the original 
limit does not exist. Maybe it does, but one cannot tell this using l'Hospital's rule. 


2 exists. Then 








If lim E Mu J is still an indeterminate form, and the numerator and denominator satisfy the assumptions 


of the above theorem, l'Hospital's rule can be used again. 


























2cos2x 2 
Insin 2x an . 2tanr .,. co . cos?2x 
E lim lim sin 2g = lim lim COS lim 
z>0 Insina x0 «0 tan 2x x0 2 z0 cos?z 
sin g cos? 2x 


Case b) Indeterminate Form 0 - oo: Having f(x) = y(x) W(x) and lim (x) = 0 and 


lim w(x) — oo, then in order to use l'Hospital's rule for lim f (x) itis to be transformed into one of the 
ra 


p(x) (x) 


: x LU "T : : 0 oo. . 
forms lim or lim , 80 it is reduced to an indeterminate form — or — like in case a). 
za 1 z3a 1 0 oo 


φ(α) plz) 











π-2α : -2 











WB lim (7 —2x)tanz = lim = lim =2. 
an /2 z—7/2 cotz z—m-/2 — 1 
sin?z 
Case c) Indeterminate Form oo — oo: If f() = o(r) — (v) and lim p(x) = oo and lim y(x) = o0 


t : $ 0 oo : . 
hold, then this expression can be transformed into the form Ü or — usually in several different ways; 
οο 








t. 1 1 
for instance as yp —  — ( -- 5) / —. Then it is to proceed as in case a). 

v oj] ev 

4 1 rlir—z-l 0 ; : ; ; 
E lim C : ) = lim | == τ — —. Applying l'Hospital rule twice yields 
r—1 lng πο rlnnr—lnzc 0 i 
1 
. rlnir—zrz-4l . lng . " 1 
lim lenin = lim = lim I T Ele: 
zo1A rlir—lnz πε = Itu 
x x 


Case d) Indeterminate Forms 0° , 00° , 1°: If f(x) = v(x)” and lim g(x) = +0 and lim (a) = 
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0 holds, then first the limit A of In f(a) = y(x) ln y(x), is to be calculated, which has the form 0 - co 
(case b)), then lim f (a) = e^ holds. 


The procedures in the cases 00° and 1° are similar. 








Ina 
E lim 2” =X, na*? =alnz, lim rlnz — lim = lim (—7z) = 0, i.e., A = ln X = 0, 
r—40 g r—40 r40 z-— r—40 


so X — 1l, and finally lim x” = 1. 
r—40 


3. Taylor Expansion 
Besides l'Hospital's rule the expansion of functions of indeterminate form into Taylor series can be 
applied (see 6.1.4.5, p. 442). 


a? 2 a? 
. z—[r—l-c^4u—-e 
. £—sinz . 3! Bb . 1 az? 1 
B in — lim — lim +] ==. 
z>0 g? 20 28 2503! — 5! 6 


2.1.4.9 Order of Magnitude of Functions and Landau Order Symbols 


Comparing two functions, often their mutual behavior with respect to a certain argument x = a is to 
be considered. It is also convenient to compare the order of magnitude of the functions. 





1. A function f(a) tends to infinity at a at a higher order (or faster rate) than a function g(x) at a, if 


D 
the quotient Τα) and the absolute values of f(x) exceed any limit as x tends to a. 
2 











2. Afunction f(x) tends to zero at a at a higher order than a function g(x) at a, if the absolute values 
f(x) 
g(x) 
3. Two functions f(r) and g(x) tend to zero or to infinity at s at the same order of magnitude, if 

x 
Λη < M holds for the absolute value of their quotient as x tends to a, where M and m 
g(x 


are constants. 


tends to zero as x tends to a. 





of f(x), g(x) and the quotient 


O<m< 








4. Landau Order Symbols The mutual behavior of two functions at a point x = a can be described 
by the Landau order symbols O (“big O”), or o (“small o”) as follows: If x — a then 











f(x) =O(g(a)) means that lim fr) — A70, Ac-const, (2.272) 
ra g(x) 
and 
f(x) =o(g(@)) means that lim Fla) — 0, (2.27b) 
ra g X 
where a = +00 is also possible. The Landau order symbols have meaning only by assuming x tends to 
a given a. 


sinc 





ΒΑ: sina = O(a) for « + 0, because with f(x) — sin x and g(r) = x holds: lim = 1# 0, i6; 
E x 

sin x behaves like x in the neighborhood of x = 0. 

E B: For f(x) = 1-— cosx and g(x) = sin z the function f(x) vanishes with a higher order than g(x): 

f(x) ΠΝ 1—cosx 

r0 








- = 0, i.e., 1 — cos z = o(sin z) for z — 0. 
sinc 








B C: /(z) and g(x) vanish by the same order for f(x) = 1 — cosx, g(x) = x°: 
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. |1-ecosr 
— lim 
r0 











==, ie., 1 — cosa = O(2?) for > 0. 
x 











5. Polynomial The order of magnitude of polynomials at +00 can be expressed by their degree. So 
the function f(x) = x has order 1, a polynomial of degree n + 1 has an order higher by one than a 
polynomial of degree n. 


6. Exponential Function The exponential function e” tends faster to infinity for vr — oo more 
quickly to infinity than any high power z" (n is a fixed positive number): 
e7 


lim 
Ώ-λοο 





— oo. (2.282) 








Z2 
'The proof follows by applying l'Hospital's rule for a natural number n: 
ct οἵ x 


lim = lim 13.440 lim 
x00 yn zoo nyn—l z> n! 











— oo. (2.28b) 


7. Logarithmic Function The logarithm tends to infinity more slowly than any small positive power 
x (ais a fixed positive number): 


log x 





lim 


=0. (2.29) 








qo 


The proof is with the help of l’Hospital’s rule. 
2.1.5 Continuity of a Function 
2.1.5.1 Notion of Continuity and Discontinuity 


Most functions occurring in practice are continuous, i.e., for small 
changes of the argument x a continuous function y(x) changes 
also only a little. The graphical representation of such a func- 
tion results in a continuous curve. If the curve is broken at some 
points, the corresponding function is discontinuous, and the val- 
ues of the arguments where the breaks are, are the points of dis- 
continuity. Fig. 2.9 shows the curve of a function, which is piece- 
wise continuous. The points of discontinuity are A, B, C, D, E, F 
and G. The arrow-heads show that the endpoints do not belong 
to the curve. 


2.1.5.2 Definition of Continuity 
A function y = f(x) is called continuous at the point x = a if Figure 2.9 


1. f(x) is defined at a; 
2. the limit lim f(a) exists and is equal to f (a). 





B 


This is exactly the case if for an arbitrary ¢ > 0 there is a d(¢) > 0 such that 
f(x)— f(a)| <£ for every x with |x— a| < ð (2.30) 





holds. 

Here also it is to talk about one-sided (left- or right-hand sided) continuity, if instead of lim f(x) = f(a) 
only the one-sided limit lim 5 f(x) (or lm. d f (a)) is to be considered and this is equal to the value f(a). 
Ifa function is continuous for every x in a given interval from a to b, then the function is called continuous 
in this interval, which can be open, half-open, or closed (see 1.1.1.3, 3., p. 2). If a function is defined 
and continuous at every point of the numerical axis, it is called continuous everywhere. 

A function has a point of discontinuity at x — a, which is an interior point or an endpoint of its domain, 
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if the function is not defined here, or f(a) is not equal to the limit lim f(x), or the limit does not exist. 


If the function is defined only on one side of x — a, e.g., --/z for x = 0 and arccos x for x = 1 , then it 
is not a point of discontinuity but it is a termination. 

A function f(x) is called piecewise continuous, if it is continuous at every point of an interval except at 
a finite number of points, and at these points it has finite jumps. 


2.1.5.3 Most Frequent Types of Discontinuities 


1. Values ofthe Function Tend to Infinity 


The most frequent discontinuity occurs if the function tends to +00 (points B, C, and E in Fig. 2.9). 
BA: f(x) =tanz, f ( - 0) = +00, f G + 0) — —oo. The type of discontinuity (see Fig. 2.34, 


p. 78) is the same as at E in Fig. 2.9. For the meaning of the symbols f(a — 0), f(a + 0) see 2.1.4.5, 
p. 54. 
1 


HB: f(x) = ——,> 
F(x) (x — 1)?’ 
the point B in Fig. 2.9. 





f(1—0) = +00, f(1+0) = +00. The type of discontinuity is the same as at 


1 
HC: f(z) — esi, f(1—0) — 0, f(1-- 0) — oo. The type of discontinuity is the same as at C in 
Fig. 2.9, with the difference that this function f(x) is not defined at x — 1. 

2. Finite Jump 


Passing through x = a the function f(a) jumps from a finite value to another finite value (like at the 
points A, F, G in Fig. 2.9, p. 58): The value of the function f(a) for x = a may not be defined here, 
as at point G; or it can coincide with f(a — 0) or with f(a 4- 0) (point F); or it can be different from 
f (a — 0) and f (a 4- 0) (point A). 


BA: f(x) = a f(1—0) — 1, f(1-4- 0) — 0 (Fig. 2.8, p. 54). 
1 γεα- 1 
WB B: /(r) = E(x) (Fig. 2.1c, p. 50) f(n—0) 2n— 1, f(n-- 0) 2n (n integer). 


B ο: f(x) = lim. f(1—0)21, f(14-0) 2 0, 0-2. 


3. Removable Discontinuity 

Assuming that lim f(x) exists, i.e., f(a — 0) = f(a+ 0) holds, but either the function is not defined 
for « = aor there is f(a) A lim f (x) (point D in Fig. 2.9, p. 58). This type of discontinuity is called 
removable, because defining f(a) — lim f (x) the function becomes continuous here. The procedure 


consists of adding only one point to the curve, or changing the place only of one point at D. The 
different indeterminate expressions for x = a, which have a finite limit examined by l'Hospital's rule 
or with other methods, are examples of removable discontinuities. 


yl+zr-1 


1 
B f(x) = is an undetermined τ expression for « = 0, but lim f(x) = 2. the function 
X T 
Vit*z-1 
πο κα for x #0 
πως 5 
1 
5 for r —0 


is continuous. 
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2.1.5.4 Continuity and Discontinuity of Elementary Functions 


The elementary functions are continuous on their domains; the points of discontinuity do not belong 
to their domain. The following theorems hold: 


1. Polynomials are continuous everywhere. 





2. Rational Functions with polynomials P(x) and Q(x) are continuous everywhere except 


(x) 

x) 
the points z, where Q(r) = 0. If at « = a, Q(a) = 0 and P(a) ¥ 0, the function tends to oo on 
both sides of a; this point is called a pole. The function also has a pole if P(a) = 0, but a is a root of 
the denominator with higher multiplicity than for the numerator (see 1.6.3.1, 2., p. 43). Otherwise the 
discontinuity is removable. 





3. Irrational Functions Roots of polynomials are continuous for every x in their domain. At the end 
of the domain they can terminate by a finite value if the radicand changes its sign. Roots of rational 
functions are discontinuous for such values of x where the radicand is discontinuous. 


4. Trigonometric Functions The functions sin x and cos x are continuous everywhere; tan x and 


(2n +1)r 


secx have infinite jumps at the points x — ; the functions cot x and cosec x have infinite 


jumps at the points x = nr (n integer). 


5. Inverse Trigonometric Functions The functions arctan z and arccot x are continuous every- 
where, arcsin x and arccos x terminate at the end of their domain because of —1 < x < +1, and they 
are continuous here from one side. 


6. Exponential Functions e” or a” with a > 0 They are continuous everywhere. 


7. Logarithmic Function log x with Arbitrary Positive Base The function is continuous for all 
positive x and terminates at 2 = 0 because of him log x = —oo by a right-sided limit. 
ᾳ-» 


8. Composite Elementary Functions The continuity is to be checked for every point x of every 
elementary function containing in the composition (see also continuity of composite functions in 2.1.5.5, 


2., p. 61). i 


ez-2 





W Find the points of discontinuity of the function y — The exponent 


zsinyl—zcz t= 


1 1 1 
infinite jump at x — 2; for z — 2 also ez-? has an infinite jump: (c=) =0; (c=) = 00: 
2—2-0 2—240 


has an 
2 


'The function y has a finite denominator at x — 2. Consequently, at r — 2 there is an infinite jump of 
the same type as at point C in Fig. 2.9, p. 58. 

For « = 0 the denominator is also zero, just like for the values of z, for which sin V/1 — x is equal to 
zero. These last ones correspond to the roots of the equation V/1 — z — nz or z — 1 — n?z?, where n 
is an arbitrary integer. The numerator is not equal to zero for these numbers, so at the points x = 0, 
£= 1, x = 17r, x= 18r’, x= 127r’, ...the function has the same type of discontinuity as 
the point E in Fig. 2.9, p. 58. 














2.1.5.5 Properties of Continuous Functions 


1. Continuity of Sum, Difference, Product and Quotient of Continuous Functions 





If f(x) and g(x) are continuous on the interval [a, b], then f(x) + g(a) , f(a) g(x) are also continuous, 


f(x) 


and if g(a) # 0 on this interval, then is also continuous. 
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2. Continuity of Composite Functions y — f(u(x)) 
If u(x) is continuous at x = a and f(u) is continuous at u — u(a) then the composite function y = 
f (u(x)) is continuous at 7 = a, and 


lim f(u()) = F (lim u(0)) = F(u(a)) (2.31) 


is valid. This means that a continuous function of a continuous function is also continuous. 
Remark: The converse sentence is not valid. It is possible that the composite function of discontinuous 
functions is continuous. 
3. Bolzano Theorem 
If a function f(x) is continuous on a finite closed interval [a,b], and f(a) and f(b) have different signs, 
then f(x) has at least one root in this interval, i.e., there exists at least one interior point of this interval 
c such that: 

f(c) 20 with a<c<b. (2.32) 
The geometric interpretation of this statement is that the graph of a continuous function can go from 
one side of the x-axis to the other side only if the curve has an intersection point with the x-axis. 
4. Intermediate Value Theorem 


If a function f(a) is continuous on an interval, and it has different values A and B, at the points a and 
b of this interval, where a « b, i.e., 


fla)=A, f()-B, AZB, (2.332) 
then for any value C between A and B there is at least one point c between a and b such that 
fle) =C, (a<c<b, A<C<BorA>C>B). (2.33b) 


In other words: The function f(x) takes every value between A and B on the interval (a,b) at least 
once. Or: The continuous image of an interval is an interval. 


5. Existence of an Inverse Function y ; y 
If a one-to-one function is continuous on an inter- 
val, it is strictly monotone on this interval. 

If a function f(x) is continuous on a connected do- 
main I, and it is strictly monotone increasing or de- 
creasing, then for this f(x) there also exists a con- 











tinuous, strictly monotone increasing or decreas- αἱ I X οἱ 
ing inverse function o(x) (see also 2.1.3.8, p. 52), a) b) 
which is defined on domain II given by the values 

of f(x) (Fig. 2.10). Figure 2.10 


Remark: In order to make sure that the inverse function of f(a) is continuous, f(a) must be continuous 
on an interval. Supposing only that the function is strictly monotonic on an interval, and continuous 
at an interior point c, and f(c) = C, then the inverse function exists, but may be not continuous at C. 
6. Theorem About the Boundedness of a Function 
If a function f(x) is continuous on a finite, closed interval [a, b] then it is bounded on this interval, i.e., 
there exist two numbers m and M such that 

m<f(a)<M for a<ac<b. (2.34) 


7. Weierstrass Theorem 


If the function f(a) is continuous on the finite, closed interval fa, b] then f(x) has an absolute maximum 
M and an absolute minimum m, i.e., there exists in this interval at least one point c and at least one 
point d. such that for all x with a € x «€ b: 


m = f(d) < f(x) < f(c) = M. (2.35) 
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The difference between the greatest and smallest value of a continuous function is called its variation 
in the given interval. The notion of variation can be extended to the case when the function does not 
have any greatest or smallest value. 


2.2 Elementary Functions 


Elementary functions are defined by formulas containing a finite number of operations on the indepen- 
dent variable and constants. The operations are the four basic arithmetical operations, taking powers 
and roots, the use of an exponential or a logarithm function, or the use of trigonometric functions or 
inverse trigonometric functions. To distinguish are algebraic and transcendental elementary functions. 
As another type of function, can be defined the non-elementary functions (see for instance 8.2.5, p. 513). 


2.2.1 Algebraic Functions 


In an algebraic function the argument x and the function y are connected by an algebraic equation. It 
has the form 


po(x) + pi(x)y + po(a)y? +... + pr(x)y” = 0 (2.36) 
where po, P1,---; Pn are polynomials in x. 
E 3r? — 4ry + z? — 1 = 0, i.e., po(£) 2 23? — 1, (x) 4x , po(x) = 0, p3(x) = 3a. 
If it is possible to solve an algebraic equation (2.36) for y, then there is one of the following types of the 
simplest algebraic functions. 








2.2.1.1 Polynomials 
Performing only addition, subtraction and multiplication on the argument x then: 

y = Ont” + Gna” 1 +... +a. (2.37) 
holds. In particular one can distinguish y = a as a constant, y = ax + b as a linear function, and 
y = ax? + br + casa quadratic function. 
2.2.1.2 Rational Functions 
A rational function can always be written in the form of the ratio of two polynomials: 


αμαῖ -- a4 43771 -- ... 4- ag 





Ξε τ mM 2: a 
y ba d. bu ιαπ 1 +... +b (2-38a) 
The special case 
ax +b 
y= 2.38] 
7 cr--d ΕΘ 





is called a homographic or linear fractional function. 
2.2.1.3 Irrational Functions 


Besides the operations enumerated for rational functions, the argument x also occurs under the radical 
sign. 


MA: y=V2¢+3, E B: y= ὐ(α - ντ. 
2.2.2 Transcendental Functions 


Transcendental functions cannot be given by an algebraic equation like (2.36). In the following para- 
graphs the simplest elementary transcendental functions are introduced. 


2.2.2.1 Exponential Functions 


The variable x or an algebraic function of x is in the exponent of a constant base (see 2.6.1, p. 72). 
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W A: y= et, HB: y= a®, HC: y= 9307—5a_ 
2.2.2.2 Logarithmic Functions 


The function is the logarithm with a constant base of the variable x or an algebraic function of x (see 
2.6.2, p. 73). 
MA: y=Inc, OB: y=lez, HC: y=log,(52? — 32). 


2.2.2.3 Trigonometric Functions 


'The variable z or an algebraic function of z occurs under the symbols sin, cos, tan, cot, sec, cosec (see 
2.7, p. 76). 

BA: y=sinz, MB: y=cos(2e+3), EC: y=tanyz. 

In general, the argument of a trigonometric function is not only an angle or a circular arc as in the 
geometric definition, but an arbitrary quantity. The trigonometric functions can be defined in a purely 


το 


analytic way without any geometry. For instance one can represent them by an expansion in a series, 





I I . . . zu. dA . PT 
or, e.g., the sin function as the solution of the differential equation di E + y = 0 with the initial values 
T 


d : : n boats 
y — 0and ZU — Lat z — 0. The numerical value of the argument of the trigonometric function is equal 
ΜΑ 
to the arc in units of radians. When dealing with trigonometric functions, the argument is considered 
to be given in radian measure (sce 3.1.1.5, p. 131). 
2.2.2.4 Inverse Trigonometric Functions 


The variable x or an algebraic function of x is in the argument of the inverse trigonometric functions 
(see 2.8, p. 85) arcsin, arccos, etc. 


W A: y-—arcsinrz, W B: y= arccos y1 -— x. 
2.2.2.5 Hyperbolic Functions 
(see 2.9, p. 89). 


2.2.2.6 Inverse Hyperbolic Functions 
(see 2.10, p. 93). 


2.2.3 Composite Functions 


Composite functions are all possible compositions of the above algebraic and transcendental functions, 
i.e., if a function has another function as an argument. 


lng + varcsin s 
BA: y=Insinz, HB: y= EN RUND. 
x? + 5e® 


Such composition of a finite number of elementary functions again yields an elementary function. The 
examples C in the previous types of functions are also composite functions. 


2.3 Polynomials 
2.3.1 Linear Function 


The graph of the linear function 

y=ar+b (2.39) 
(polynomial of degree 1) is a line (Fig. 2.11a). The proportional factor is denoted by a, the crossing 
point of the line and the axis of the ordinate by b. 
For a > 0 the function is monotone increasing, for a < 0 it is monotone decreasing; for a = 0 it is a 
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A Xv e ; . b 
polynomial of degree zero, i.e., it is a constant function. The intercepts are at A (-2. o) and B(0,b) 
a 


(for details see 3.5.2.6, 1., p. 195). With b = 0 direct proportionality 
y — ar; (2.40) 
holds, graphically it is a line running through the origin (Fig. 2.11b). 


y 
M 

a) 0 A~ x 
Va 

b) 0 x a 





Figure 2.11 Figure 2.12 


2.3.2 Quadratic Polynomial 
The polynomial of second degree 
y — ax? +br+c (2.41) 


b 
(quadratic polynomial) defines a parabola with a vertical axis of symmetry at x = δα (Fig. 2.12). 
a 


For a > 0 the function is first monotone decreasing, it has a minimum, then it is monotone increasing. 
For a < 0 first it is monotone increasing, it has a maximum, then it is monotone decreasing. In the case 


- VI? — 4ac 


—b 4 
b? — 4ac 7 0: The intersection points Aj, Ag with the x-axis are ( 2 
a 
point B with the y-axis, is at (0, c). In the case 0? — 4ac — 0 there is one intersection point (contact 


point) with the x-axis. In the case b? — 4ac « 0 there is no intersection point. The extremum point of 


b 4ac— ll? 
the curve is at C (x: ; —— 





: o) , the intersection 


1 ) (for more details about the parabola see 3.5.2.10, p. 204). 
a α 





Figure 2.13 


2.3.3 Cubic Polynomials 
The polynomial of third degree 
y = ax? + ba? + ca +d (2.42) 
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defines a cubic parabola (Fig. 2.13a,b,c). Both the shape of the curve and the behavior of the function 
depend on a and the discriminant A = 3ac — ?. If A > 0 holds (Fig. 2.13a,b), then for a > 0 the 
function is monotonically increasing, and for a < 0 it is decreasing. If A < 0 the function has exactly 
one local minimum and one local maximum (Fig. 2.13c). For a > 0 the value of the function rises from 
—oo until the maximum, then falls until the minimum, then it rises again to 4-oo; for a < 0 the value 
of the function falls from +oo until the minimum, then rises until the maximum, then it falls again to 
—oo. The intersection points with the x-axis are at the values of the real roots of (2.42) for y = 0. The 
function can have one, two (then there is a point where the x-axis is the tangent line of the curve) or 
three real roots: A;, Ag and A3. The intersection point with the y-axis is at B(0,d), the extreme points 


bv d 4- 20? — Yabe + (6ac — 2b?) /— 
3a? 27a? ` 











of the curve C and D, if any, are at ( 


b 20? — 9abc 1 
3a’ 27a? 


The inflection point which is also the center of symmetry of the curve is at E ( — 


dy A 

At this point the tangent line has the slope tan y = ( w) 
a 
E 


=a, 


yA — nodd 
- neven 





Figure 2.14 Figure 2.15 


2.3.4 Polynomials of n-th Degree 
The integral rational function of n-th degree 

y = Ont” + ne” 1 +... +a 12 + a9 (2.43) 
defines a curve of n-th degree or n-th order (see 3.5.2.5, p. 195) of parabolic type (Fig. 2.14). 
Case 1, n odd: Fora, > 0 the value of y changes continuously from —oo to +00, and for a, < 0 
from +00 to —oo. The curve can intersect or contact the x-axis up to n times, and there is at least one 
intersection point (for the solution of an equation of n-th degree see 1.6.3.1, p. 43 and 19.1.2, p. 952). 
The function (2.43) has none or an even number up to n — 1 of extreme values, where minima and 
maxima occur alternately; the number of inflection points is odd and is between 1 and n — 2. There 
are no asymptotes or singularities. 
Case 2, n even: Fora, > 0 the value of y changes continuously from +oo through its minimum 
until +00 and for a,, < 0 from —oo through its maximum until —oo. The curve can intersect or contact 
the z-axis up to n times, but it is also possible that it never does that. The number of extrema is odd, 
and maxima and minima alternate; the number of inflection points is even, and it can also be zero. 
There are no asymptotes or singularities. 
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Before sketching the graph of a function, it is recommended first to determine the extreme points, the 
inflection points, the values of the first derivative at these points, then to sketch the tangent lines at 
these points, and finally to connect these points continuously. 


2.3.5 Parabola of n-th Degree 
The graph of the function 

y = ax” (2.44) 
where n > 0, integer, is a parabola of n-th degree, or of n-th order (Fig. 2.15). 
1. Special Case a= 1: The curve y = x” goes through the point (0,0) and (1,1) and contacts or 
intersects the x-axis at the origin. For even n the curve is symmetric with respect to the y-axis, and 
with a minimum at the origin. For odd n the curve is symmetric with respect to the origin, and it has 
an inflection point there. There is no asymptote. 


2. General Case a Z0: The curve of y = az" can be got from the curve of y = x” by stretching the 
ordinates by the factor |a|. For a < 0 the curve y — [a|x" is to be reflected with respect to the x-axis. 


2.4 Rational Functions 


2.4.1 Special Fractional Linear Function (Inverse Proportionality) 


The graph of the function 

ees 2.45 

ae (2.45) 
is an equilateral hyperbola, whose asymptotes are the coordinate axes (Fig. 2.16). The point of dis- 
continuity is at « = 0 with y = too. Ifa > 0 holds, then the function is strictly monotone decreasing 
in the interval (—oo, 0) with values from 0 to —oo and also strictly monotone decreasing in the interval 
(0, --o0) with values from --oo to 0 (curve in the first and third quadrants). If a < 0, then the func- 
tion is increasing in the interval (—oo , 0) with values from 0 to +00 and also increasing in the interval 
(0, +00) with values from —oo to 0 (dotted curve in the second and fourth quadrants). The vertices A 


and B are at (+y [a], -- V [a| ) and (+y lal, —4/ lal) with the same sign for a > 0 and with different sign 


for a < 0. There are no extreme values (for more details about hyperbolas see 3.5.2.9, p. 201). 














Figure 2.16 Figure 2.17 


2.4.2 Linear Fractional Function 
The graph of the function 


r+b 
y= : (ω 0,A= 


agx + bg 


ay by 
az b2 














= αιῦο = biaz z 0) (2.46) 
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is an equilateral hyperbola, whose asymptotes are parallel to the coordinate axes (Fig. 2.17). 














bo a A 
The center is at C (-2 ] 2). The parameter a in the equality (2.45) corresponds here to — — g with 
dg dg 
αι ὦ bo X ΙΔ “tyla 
Δ -- !| The vertices of the hyperbola A # 0 and B are at [- EM 
32 02 2 
bz + bx yIAI αι - αι γΙΔΙ 
(= , where for A < 0 the same signs are taken, for A > 0 the different ones. 
: . PME bg . I ay 
The point of discontinuity is at x = ——. For A < 0 the values of the function are decreasing from — 
ag ag 
a a 
to —oo and from +00 to —. For A > 0 the values of the function are increasing from — to +00 and 
a2 a2 
from —oo to 2. , There is no extremum. 
a2 


2.4.8 Curves of Third Degree, ΤΥΡεΙ 
The graph of the function 
b c ( ax” + bx +c 
a+—+ 
τα 





) (bz 0, cz 0) (2.47) 


(Fig. 2.18) is a curve of third degree (type I). It has two asymptotes x = 0 and y — a and it has two 
branches. One of them corresponds to the monotone changing of y while it takes its values between 
à and 4-oo or —oo; the other branch goes through three characteristic points: the intersection point 


c 2c p? 
with the asymptote y — a at A (-5 D an extreme point at B (ος τᾳ-- 3 and an inflection 
4c 
] 3c 2b? SA ; 
point at C A ,0— del The positions of the branches depend on the signs of b and c, and there 
c 


are four cases (Fig. 2.18). The intersection points D, E with the x-axis, if any, are fora # 0 at 
—b x Vt? — 4ac 

( 2a 

or none, depending on whether 0? — 4ac 7 0, — 0 or « 0 holds. 





ο ; 8 : 
; o) , fora = Oat (-5. a) ; their number can be two, one (the a-axis is a tangent line) 


For b = 0 the function (2.47) becomes the function y = a + = (see (Fig. 2.21) the reciprocal power), 
x 


. i . ax +b . 
and for c = 0 it becomes the homographic function y = ——— , as a special case of (2.46). 
a 


2.4.4 Curves of Third Degree, Type II 
The graph of the function 


1 
y= —— 0 2.48 
Y= a? +br+c (a #0) ( ) 
b 
is a curve of third degree (type II) which is symmetric about the vertical line x = — and the x-axis is 
α 


its asymptote (Fig. 2.19), because lim y = 0. Its shape depends on the signs of a and A — 4ac — b?. 
q—-Loo 
From the two cases a > 0 and a < 0 only the first one is considered, because reflecting the curve of 


E 1 
TIN 


— ——— — with respect to the x-axis one gets the second one. 
—a)r? — br — c 
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Bre 
c>0,b>0 
b) 





Figure 2.18 
Case a) A > 0: The function is positive and continuous for arbitrary values of x and it is increasing 


. b : I . da - . a ; 
on the interval (—oo, 27 Here it takes its maximum, Δ’ then it is decreasing again in the interval 
a 


b b 4a 
(= ,oo). The extreme point A of the curve is at (x à x) , the inflection points B and C are at 
2a α 


D e VA 3a 
2a 2aV3' ^ 


ας 80 
get tan y = Fa? (3) (Fig. 2.192). 








) ; and for the corresponding slopes of the tangent lines (angular coefficients ) we 





Case b) A — 0: The function is positive for arbitrary values of x, its value rises from 0 to --oo, at 
b 

t= = t it has a point of discontinuity (a pole), where lim y = +00. Then its value falls from 
20, 2-}20 

here back to 0 (Fig. 2.19b). 


Case c) A « 0: The value of y rises from 0 to +00, at the point of discontinuity it jumps to —oo, 
and rises to the maximum, then falls back to —oo; at the other point of discontinuity it jumps to +00, 





b 4 
then it falls to 0. The extreme point A of the curve is at (- os x) . The points of discontinuity are 
a 
—b+VJV-A 
at r = —— (Fig. 2.19c). 
α 


2.4.5 Curves of Third Degree, Type III 


The graph of the function 
(a#0,b40,c#0) (2.49) 


T 


US oe +br+e 
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A=0 





b) 
Figure 2.19 


is a curve of third degree (type II) which goes through the origin, and has the x-axis (Fig. 2.20) as an 
asymptote. The behavior of the function depends on the signs of a and of A = 4ac — b?, and for A < 0 
also on the signs of the roots o and f of the equation ax? + br +c = 0, and for A = 0 also on the sign 
of b. From the two cases, a > 0 and a < 0, only the first one is considered because reflecting the curve 


of y= with respect to the x-axis yields the second one. 


x 
(—a)a? — bx —c 
Case a) A > 0: The function is continuous everywhere, its value falls from 0 to the minimum, then 
rises to the maximum, then falls again to 0. 

c —b+2/ac 


The extreme points of the curve, A and B, are at (+ =, E) ; there are three inflection 
α 








points (Fig. 2.20a). 
Case b) A = 0: The behavior of the function depends on the sign of b, so there are two cases. In both 


cases there is a point of discontinuity at £ = x both curves have one inflection point. 
α 


οὐ — 0: The value of the function falls from 0 to —oo, the function has a point of discontinuity, then the 
value of the function rises from —oo to the maximum, then decreases to 0 (Fig. 2.20b;). The extreme 


C 1 
oint A of the curve is at A +E =, |]. 
p ( a’ 2/fac+b 
οὐ «0: The value of the function falls from 0 to the minimum, then rises to 4-oo, running through the 
origin, then the function has a point of discontinuity, then the value of the function falls from +-oo to 0 


6 1 

Fig. 2.20b2). The extreme point A of the curve is at A ΠΗ͂ -------]). 

(Fig 2) p ( a ^ 2γας- z) 
Case c) A « 0: The function has two points of discontinuity, at v = oa and x — f; its behavior 
depends on the signs of a and f. 
e The signs of o and f) are different: The value of the function falls from 0 to —oo, jumps up to +00, 
then falls again from 4-oo to —oo, running through the origin, then jumps again up to 4-oo , then it falls 
tending to 0 (Fig. 2.20c;). The function has no extremum. 
e The signs of « and f are both negative: The value of the function falls from 0 to —oo, jumps up 
to 4-oo, from here it goes through a minimum up to 4-oo again, jumps down to —oo,, then rises to a 
maximum, then falls tending to 0 (Fig. 2.20c;). 
The extremum points A and P can be calculated with the same formula as in case a) of 2.4.5. 
e The signs of a and B are both positive: The value of the function falls from 0 until the minimum, then 
rises to +00, jumps down to —oo, then it rises to the maximum, then it falls again to —oo, then jumps 
up to +00 and then it tends to 0 (Fig. 2.20c3). 
The extremum points A and B can be calculated by the same formula as in case a) of 2.4.5. 
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Figure 2.20 
In all three cases the curve has one inflection point. 
Figure 2.21 Figure 2.22 
2.4.6 Reciprocal Powers 
The graph of the function 
y= E —ar " (n0, integer;a 4 0) (2.50) 
x 


is a curve of hyperbolic type with the coordinate axes as asymptotes. The point of discontinuity is at 
x = 0 (Fig. 2.21). 

Case a) For a > 0 and for even n the value of the function rises from 0 to +00, then it falls tending 
to 0, and it is always positive. For odd n it falls from 0 to —oo, it jumps up to +00, then it falls tending 
to 0. 


2.5 Irrational Functions 71 





Case b) For a < 0 and for even n the value of the function falls from 0 to —oo, then it tends to 0, and 
it is always negative. For odd n it rises from 0 up to 4-oo, jumps down to —oo, then it tends to 0. 

The function does not have any extremum. The larger n is, the faster the curve approaches the x-axis, 
and the slower it approaches the y-axis. For even n the curve is symmetric with respect to the y-axis, 
for odd n it is center-symmetric and its center of symmetry is the origin. The Fig. 2.21 shows the cases 
n — 2 and n — 3 for a — 1. 


2.5 Irrational Functions 
2.5.1 Square Root of a Linear Binomial 


The union of the curve of the two functions 


y=+var+b (a¥0) (2.51) 








j : : : ; b . 
is a parabola with the x-axis as the symmetry axis. The vertex A is at (-2. ο) the semifocal chord 
a 


(see 3.5.2.10, p. 204) is p = s . The domain of the function and the shape of the curve depend on the 


sign of a (Fig. 2.22) (for more details about the parabola see 3.5.2.10, p. 204). 


2.5.2 Square Root ofa Quadratic Polynomial 


The union of the graphs of the two functions 


y = +Var?+br+c (a#0, A= 4ac— b? £0) (2.52) 


is for a < 0 an ellipse, for a > 0 a hyperbola (Fig. 2.23). One of the two symmetry axes is the x-axis, 




















the other one is the line x = ——. 
2a 
ba A ΙΔ 
The vertices A, C and B, D are at (= E 4 , where A = 4ac — B?. 
4a 
y B 
C » 
x 
D 
a) a<0,A<0 a>0,A<0 


Figure 2.23 


The domain of the function and the shape of the curve depend on the signs of a and A (Fig. 2.23). 
For a < 0 and A > 0 the function has only imaginary values, so no curve exists (for more details about 
the ellipse and hyperbola see 3.5.2.8, p. 199 and 3.5.2.9, p. 201). 


2.5.3 Power Function 


The power function 


y — ax? — ay tín (m, n integer, positive, coprime) (2.53) 
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is to be discussed for k > 0 and for k « 0 (Fig. 2.24, Fig. 2.25). The investigation here can be 
restricted to the case a — 1, because for a Z 1 the curve differs from the curve of y — z^ only by a 
stretching in the direction of the y-axis by a factor |a|, and for a negative a by a reflection to the x-axis. 











y a y y 3 
1/3 y=x? 
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1 1 
> + > 
01 x OIN x 
gcc 
b) d) 
Figure 2.24 
Case a) k > 0, y = z""/". The shape of the curve is represented in four characteristic cases 


depending on the numbers m and n in Fig. 2.24. The curve goes through the points (0,0) and (1, 1). 
For k > 1 the x-axis is a tangent line of the curve at the origin (Fig. 2.24d), for k < 1 the y-axis isa 
tangent line also at the origin(Fig. 2.24a,b,c). For even n the union of the graph of functions y — -Ez* 
may be considered: it has two branches symmetric to the x-axis (Fig. 2.24a,d), for even m the curve 
is symmetric to the y-axis (Fig. 2.24c). If m and n are both odd, the curve is symmetric with respect 
to the origin (Fig. 2.24b). So the curves can have a vertex, a cusp or an inflection point at the origin 
(Fig. 2.24). None of them has any asymptote. 
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Figure 2.25 


Case b) k < 0, y — x-""/^; The shape of the curve is represented in three characteristic cases 
depending on m and n in Fig. 2.25. The curve is a hyperbolic type curve, where the asymptotes 
coincide with the coordinate axes (Fig. 2.25). The point of discontinuity is at r — 0. The greater |k| 
is the faster the curve approaches the x-axis, and the slower it approaches the y-axis. The symmetry 
properties of the curves are the same as above for k > 0; they depend on whether m and n are even or 
odd. There is no extreme value. 


2.6 Exponential Functions and Logarithmic Functions 


2.6.1 Exponential Functions 
The function 

y—a*—e&* (a>0, b=Ina), (2.54) 
is called the exponential function and its graphical representation the exponential curve (Fig. 2.26). 
From (2.54) for a = e follows the function of the natural exponential curve 

y=. (2.55) 
The function has only positive values. Its domain is the interval (—o0, +00). For a > 1, i.e., for b > 0, 
the function is strictly monotone increasing and takes its values from 0 until oo. For a < 1, i.e., for 
b « 0, it is strictly monotone decreasing, its value falls from oo until 0. The larger |b| is, the greater is 
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the speed of growth and decay. The curve goes through the point (0, 1) and approaches asymptotically 
the x-axis, for b > 0 on the right and for b < 0 on the left, and faster for greater values of |b|. The 


T 
function y = a” = (9 increases for a < 1 and decreases for a > 1. 
a 






y=log,x=lb x 
y=log.x=In x 
------ y-logux-lgx 





Figure 2.26 Figure 2.27 


2.6.2 Logarithmic Functions 
The function 

y=log,x (a>0, aF1) (2.56) 
gives the logarithmic curve (Fig. 2.27); the curve is the reflection of the exponential curve with respect 
to the line y = x. From (2.56) for a = e follows the curve of the natural logarithm 

y=lInc. (2.57) 
The real logarithmic function is defined only for > 0. For a > 1 it is strictly monotone increasing 
and takes its values from —oo to +00, for a < 1 it is strictly monotone decreasing, and takes its values 
from +00 to —oo, and the greater | In a] is, the faster the growth and decay. The curve goes through the 
point (1,0) and approaches asymptotically the y-axis, for a > 1 down, for a < 1 up, and again faster 
for larger values of |In a|. 


2.6.5 Error Curve 
The function 

y = ear)? (2.58) 
gives the error curve (Gauss error distribution curve) (Fig. 2.28). Since the function is even, the y- 
axis is the symmetry axis of the curve and the larger |a] is, the faster it approaches asymptotically the 
x-axis. It takes its maximum at zero, and it is equal to one, so the extreme point A of the curve is at 





1 1 
(0, 1), the inflection points of the curve B, C are at ( ) . 


ET 


The slopes of the tangent lines are here tan o — xay2/e. 





A very important application of the error curve (2.58) is 
the description ofthe normal distribution properties of the 
observational error (see 16.2.4.1, p. 818.): 

1 x 
y= (2) = e ——|. 2.59 
y= ole) = Ren (- zx) (259) 











Figure 2.28 
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2.6.4 Exponential Sum 


The function 


y = ae" 4. ced? (2.60) 
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Figure 2.29 


is represented in Fig. 2.29 for the characteristic sign relations. The sum of the functions is got by 
adding the ordinates of the curves, i.e., the summands are y1 — ac" and y» — ce^". The function is 
continuous. If none of the numbers a, b, c, d is equal to 0, the curve has one of the four forms represented 
in Fig. 2.29. Depending on the signs of the parameters it is possible, that the graphs are reflected over 
a coordinate axis. 

The intersection points A and B of the curve with the y-axis and with the x-axis are at (0,a +c), and 

In (—a/c 

η (EP 


1 ab 
d-b 9) respectively, the extremum C is at « = Τη In (-3) , and the inflection point D 


cd 
is at L af- Ë) inu hen they exist 
sat © = —— Im =c e case en ey exist. 
1s x d—b j: ce 1n th se when ley XIS 


Case a) The parameters a and c, and b and d have the same signs: The function does not change its 
sign, it is strictly monotone; its value is changing from 0 to +00 or to —oo or it is changing from +00 
or from —oo to 0. There is no inflection point. The asymptote is the z-axis (Fig. 2.29a). 

Case b) The parameters a and c have the same sign, b and d have different signs: The function does 
not change its sign and either comes from 4-oo and arrives at +00 and has a minimum or comes from 
—oo, goes to —oo and has a maximum. There is no inflection point (Fig. 2.29b). 

Case c) The parameters a and c have different signs, b and d have the same signs: The function has 
one extremum and it is strictly monotone before and after. It changes its sign once. Its value changes 
whether from 0 until the extremum, then goes to +00 or —oo or it comes first from 4-oo or —oo, takes 
the extremum, then approaches 0. The x-axis is an asymptote, the extreme point of the curve is at C 
and the inflection point at D (Fig. 2.29c). 

Case d) The parameters a and cand also b and d have different signs: The function is strictly monotone, 
its value rises from —oo to +00 or it falls from +oo to —oo. It has an inflection point D (Fig. 2.29d). 


2.6.5 Generalized Error Function 


The curve of the function 


p b\* 
y = aexp(bx + cx”) = aexp (-i5) exp ( (« + x) | (c 0) (2.61) 
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Figure 2.30 


can be considered as the generalization of the error function (2.58); it results in a symmetric curve with 
respect to the vertical line x = πε it has no intersection point with the z-axis, and the intersection 


c 
point D with the y-axis is at (0, a) (Fig. 2.30a,b). 

The shape of the curve depends on the signs of a and c. Here only the case a > 0 is discussed, because 
the curve for a < 0 is got by reflecting it in the z-axis. 

Case a) c > 0: The value of the function falls from +oo until the minimum, and then rises again to 


b pg 
+oo. It is always positive. The extreme point A of the curve is at τς , a exp (-) and it corre- 
ο 4 


sponds to the minimum of the function; there is no inflection point or asymptote (Fig. 2.30a). 


b 
Case b) c < 0: The z-axis is the asymptote. The extreme point A of the curve is at (-z: 
c 
b2 
a exp (-)) and it corresponds to the maximum of the function. The inflection points B and C 
m 


are at 


(A cs eee (ος )) (Fig 2.30b). 





2c 4c 


2.6.6 Product of Power and Exponential Functions 


The function 

y = axe” (2.62) 
is discussed here only in the case a > 0, because in the case a < 0 the curve is got by reflecting it in the 
x-axis. For a non-integer b the function is defined only for x > 0, and for an integer b the shape of the 
curve for negative x can be deduced also from the following cases (Fig. 2.31). 
Fig. 2.31 shows how the curve behaves for arbitrary parameters. 
For b > 0 the curve passes through the origin. The tangent line at this point for b > 1 is the x-axis, for 
b = 1 the line y = x, for O < b < 1 the y-axis. For b < 0 the y-axis is an asymptote. For c > 0 the 
function is increasing and exceeds any value, for c « 0 it tends asymptotically to 0. For different signs 


: b ] ; 
of b and c the function has an extremum at r — —— (point A on the curve). The curve has either no or 
c 








b 
one or two inflection points at 2 = -———— (points C and D see Fig. 2.31c,e,f,g). 
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Figure 2.31 


2.7 Trigonometric Functions (Functions of Angles) 


2.7.1 Basic Notions 
2.7.1.1 Definition and Representation 
1. Definition 


The trigonometric functions are introduced by geometric considerations. So in their definition and also 
in their arguments degree or radian measure is used (see 3.1.1.5, p. 131). 


2. Sine 
The standard sine function 
y —sinx (2.63) 


is a continuous curve with period T' — 27 (see Fig. 2.32a). 








Figure 2.32 
The intersection points Bo, B4, B. 4, B3, B. 3,..., with By — (kz,0) (k — 0,31, £2,...) ofthe stan- 














dard sine curve and the x-axis are the inflection points of the curve. Here the angle of slope of the tan- 





Y ; "E: i ; 
gent line with the x axis is Ἐπ The extreme points of the curve are at Co, C1, C 4, C2, C_2,... with 














Cy = ((k 3 ln, (-1)*) (k = 0, +1, 2,.. J: For every value of the function y there is —1 < y < 1. 


The general sine function 


y = Asin(wz + Yo) (2.64) 
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with an amplitude |A|, frequency w, and phase shift yo is represented in Fig. 2.32b. 
Comparing the standard and the general sine curve (Fig. 2.32b) it can be seen that in the general 
case the curve is stretched in the direction of y by a factor |A|, in the direction of x it is compressed 


1 2π 
by a factor —, and it is shifted to the left by a segment $0 The period is T = —. The intersection 
ω ω ω 


ο (k = 0, 





kT — po 








= 


,+2,...). The extreme points are Ck = 





points with the z-axis are Bẹ = ( 


en (-1*A| (k-9, 

















1ος]. 
Ζ 39,9) 
3. Cosine 
The standard cosine function 
y = cos x = sin(x + 5) (2.65) x 





is represented in Fig. 2.33. 


The intersection points with the z-axis 


Figure 2.33 
1 
Bo, Bi, Bo, ..., Be = (ο + 5) T, 0) (k = 0, £1, €2,...) are also the inflection points. The angle of 














slope of the tangent line is +—. 








The extreme points are Co, Ci,..., Ck = (km, (—1)*) (k = 0,+1 











The general cosine function 


y = Acos(wa + qo) (2.66a) 
can be transformed into the form 
y = Asin (ων + po + 5) ὃ (2.665) 


i.e., the general sine function shifted left by y = i 


4. Tangent 
The tangent function 
sin x 
y = tant = — (2.67) 
COS X 














1 
has period T = 7 and the asymptotes are x = (ι + 5) T (k = 0,+1,+2,...) (Fig. 2.34). The 
π 


π 

thm, +5 +k) (k=0, 
a 2. ( 
values from —oo to +00. The curve has intersection points with the z-axis at Ao, A1, A. 1, Ae, A_2,..., 


Ay = (k7,0) (& = 0,+1,+2,...), these points are the inflection points and the angle of slope of the 











function is monotone increasing in the intervals 


μα 


2,...) and takes 

















T 
tangent line is T 


5. Cotangent 


The cotangent function 








amy 1 
y — cotz — aS — —tan (s + 5) (2.68) 
sing tang 2 
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Figure 2.34 





Figure 2.35 


has a graph which is the tangent curve reflected with respect to the x-axis and shifted to the left by 











2,...). Between 0 and s the function is 





s (Fig. 2.35). The asymptotes are x = kr (k = 0,1, 


monotone decreasing and takes its values from +00 until —oo; the function has period T = m. The 


il 
intersection points with the x-axis are at Ao, Ay, A_1, Ao, A_2,... with A, = ((x + ;) T, 0) (k= 









































π 
0,+1,+2,...), they are the inflection points of the curve and here the angle of the tangent line is ο... 
Α π 
6. Secant y 2 2n 
The secant function | ! 
1 A i 
y = sec T = (2.69) η f 
COS & it i 
1 Ἡ | 
has period T = 27, the asymptotes are 7 = (t + ;) m (k = 2 | 
0,+1,+2,...); and obviously |y| > 1 holds. The extreme points 
corresponding to the maxima of the function are Ap, Ay, A-4,... 1 i ' 
i | 0 | 
| | s τη 
! 1 Ig 5! 
f i H aa 
| | | dg 
i i aggy B 
| | Γ 8188} 8 
1| q ο ο ο 
| 1 0 90 180 270 360 


Figure 2.36 Figure 2.37 














Figure 2.38 


with Ap = ((2k + 1)r,—1) (k = 0,1, £2,...), the extreme τ ο... to the minima of 








the function are Bo, Bi, B_1,... with By = ο +1) (k =0, +1, ...) (Fig. 2.36). 


7. Cosecant 


The cosecant function 


y = cosec z = 





sin x 











(2.70) 
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T 
has a graph which is the graph of the secant shifted to the right by x = 3° The asymptotes are 











α = kr (k = 0,+1,+2,...). The extreme points corresponding to the maxima of the function are 

















Ak 4 
Ao, Ai, A-i,... with A, = ( ti 2 (k = 0,+£1,+2,...) and the the points corresponding 


(“+ 














to the minima of the function are Bo, B1, B. 4,... with Bj — 








zn) (k = 0,+1,+42,...) 
(Fig. 2.37). 
2.7.1.2 Range and Behavior of the Functions 


1. Angle Domain 0 < z < 360° 

The six trigonometric functions are represented together in Fig. 2.38 in all the four quadrants for a 
complete domain of angles from 0° to 360° or for a complete domain of radians from 0 to 27. 

In Table 2.1 there is a review of the domain and the range of these functions. The signs of the functions 
depend on the quadrant where the argument is taken from, and these are reviewed in Table 2.2. 


Table 2.1 Domain and range of trigonometric functions 





Domain Range Domain Range 





-l<sina <1 x # (2k 17 —oo < tana < 00 
—1 < cosx < 1 2 $ kr —oo « cot z « oo 
(k =0,+1,+2,...) 


-οο κα «αο { 
































2. Function Values for Some Special Arguments (see Table 2.3) 


3. Arbitrary Angle 

Since the trigonometric functions are periodic (period 360° or 180°), the determination of their values 
for an arbitrary argument x can be reduced by the following rules. 

Argument x > 360° or a > 180°: If the angle is greater than 360° or greater than 180°, then it is 
to be reduced for a value a, for which 0 < a < 360° or 0 € a < 180° holds, in the following way (n 
integer): 


sin(360° -n +a) =sina, (2.71) cos(360° - n+ a) = cosa, (2.72) 





tan(180°-n+a)=tana, (2.73) cot(180° - n +a) = cota. (2.74) 


Table 2.2 Signs of trigonometric functions 









































| Quadrant Angle sin cos tan cot sec csc 
I from 0° to 90° + + + + F H 
II from 90° to 180° H I 
III from 180° to 270° I 
IV from 270° to 360° 














Argument æ < 0: If the argument is negative, then the following formulas reduce the calculations to 
functions for positive argument: 


sin(—a) = —sina, (2.75) cos(—a) = cosa, (2.76) 


tan(—a) = — tana, (2.77) cot(—a) = —cota. (2.78) 
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Table 2.3 Values of trigonometric functions for 0°, 30°, 45°, 60° and 90°. 






















































































Angle | Radian | sin | cos | tan | cot | sec | csc | 
0° 0 0/1 | 0 |Τοο] 1 | σου] 
1 1 3 3 273 
30° -π B v3 v3 V3 v3 9 | 
6 2 2 3 3 
1 2 2 
45? m 2 2 1 1 | V2 | v2 | 
4 2 2 
1 η Ἡ 9 2ν 9 
60° o v3 =, |. 4/3 v3 9 2ν5 | 
3 2 2 3 3 
1 
90° ο 1 ϱ σι 0 +00 1 
Table 2.4 Reduction formulas and quadrant relations of trigonometric functions 
Function | x = 90° +a | x = 180° +a | x = 270° +a | x = 360° — a 
sin x + cos a sina — cosa —sino 
cos & Fsina — cosa +sina + cos a 
tan x Fcota + tana F cota —tana 
cot x F tana +E cot a F tana —cota 





























Argument z for 90? « x « 360°: If 90° < x < 360° holds, then the arguments are to be reduced 
for an acute angle a by the reduction formulas given in Table 2.4. The relations between the values 
of the functions belonging to the arguments which differ from each other by 90°, 180° or 270° or which 
complete each other to 90°, 180° or 270° are called quadrant relations. 

The first and second columns of Table 2.4 give the complementary angle formulas, and the first and 
third ones give the supplementary angle formulas. Because « = 90° — a is the complementary angle 
(see 3.1.1.2, p. 130) of a, the following relations 


cosa = sin x = sin(90° — a), (32.Τ94) sin a = cos x = cos(90° — a) (2.79b) 
are called the complementary angle formulas. 
For œ + x = 180° the relations between the trigonometric functions for supplementary angles (see 
3.1.1.2, p. 130) 

sina = sina = sin(180°— a), (2.80a) — cos a = cos % = cos(180° — a) (2.80b) 


are called supplementary angle formulas. 

Argument g for 0° < x < 90°: The values of trigonometric functions for acute angles (0° < x < 
90°) have been taken formerly from tables, today calculators are used. 

Β sin(—1000°) = — sin 1000° = — sin(360° - 2 + 280°) = — sin 280° = + cos 10° = +0.9848. 


4. Angles in Radian Measure 
The arguments given in radian measure, i.e., in units of radians, can be easily converted by formula 


(3.2) (see 3.1.1.5, p. 131). 
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2.7.2 Important Formulas for Trigonometric Functions 


Remark: Trigonometric functions with complex argument z are discussed in 14.5.2, p. 759. 


2.7.2.1 Relations Between the Trigonometric Functions 











sin? a -- cos?a — 1, (2.81) sec? a — tan? a — 1, (2.82) 
cosec? a — cot?a — 1, (2.83) sina: coseca — 1, (2.84) 
cosa seca — 1, (2.85) tana: cota = 1, (2.86) 
SUAE. ia a, (2.87) COS = cota. (2.88) 
cos a sina 


Some important relations are summarized in Table 2.5 for 0 < œ < 7/2 in order to create an easy 
survey. For other intervals in Table 2.5 the square roots are always considered with the sign which 
corresponds to the quadrant where the argument is. 


2.7.2.2 Trigonometric Functions of the Sum and Difference of Two Angles 
(Addition Theorems) 











sin(a 8) = sina cos B+cos asin 8, (2.89) cos(a+) = cos a cos 8 sin asin 0. (2.90) 








tana + tan 





_ cotacot 8 F1 














te + b) = — 2.91 t(a sa 8) = 2.92 
an(a $) 1 F tana tan 8’ (2.91) cot(a £ £) cot B X cota ' (2.92) 
sin(a + 8 + y) = sin a cos 8 cos y + cos a sin 8 cos y 
+ cos a cos sin y — sin a sin 6 sin y, (2.93) 
cos(a + 8 + 7) = cosa cos 8 cos y — sin a sin B cos y 
— sin q cos f sin y — cosa sin f sin y. (2.94) 
2.7.2.3 Trigonometric Functions of an Integer Multiple of an Angle 
sin 2a = 2sinacosa, (2.95) cos 2a = cos? o — sin? a, 2.97) 
sin3a = 3sina — Asin? a, (2.96) cos 3a. — 4cos? a — 3cosa, (2.98) 
sin4a — 8cos! asina — 4cososina, (2.99) cos 4a — 8cos*a — 8cos?a--1, (2.100) 
2tana cot?^a — 1 
tan2a = o 2.101 = CON ek à 
ΠΤ παρα ( ) cot 20 eta”? (2.104) 
3tana — tan? o cot? a — 3cot a 
ta = > 2.102 ^ = 
an 3a {sea (2.102) cot 3a τα. στ (2.105) 
Atana — 4tan? o cotta — 6cot? a 4-1 
tan 4a = —— 2.103 τ ; 
enor 1— 6tan?a 4- tan* a, ( ) cot da Acoiw=dA cota (2.106) 


For larger values of n in order to gain a formula for sin na and cos na the de Moivre formula is to be 
used (see 1.5.3.5, p. 38). 
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Using the binomial theorem (see 1.1.6.4, p. 12) gives: 


ση With this 


cos na — cos" a — ( 


cos na +isinna = 


= cos” œ + in cos” 


n ᾿ [n m Μ᾽ n ὦ . 
(5 cos"? a sin? a — (S) cos"? o sin? a 4- (5 cos"-* o sin* a 4- .... 


it follows: 


k=0 


n 


l asina 


f 
Τι {η 
Σ ^ if cos”™ asin" a = (cosa + isin a)” 


n ] n ο . n = . 
;) cos”? asin? a + (5 cos"-^ o sin* a — (2) cos” o sin? a 4- ... 


: z : n . ud n τ f 
sinna = n cos”! asin a — (5 cos"? o sin? a 4- (5 cos"? asina — .... 
5 


(2.107) 


(2.108) 


(2.109) 


: ' f ; : : π 
Table 2.5 Relations between the trigonometric functions of the same argument in the intervalÜ « œ < 5 


























α sina cosa tana cota 
. 3 tana 1 
sina — 2L "——————— 
l-ctan*a 1+cot*a 
- 1 cota 
cosa V1 —sin? a 3 E 
1 + tan a 1 + cot“ a 
sina y1 — cos? qa 1 
tana Ξ 
νἹ--οἰηζα cos Q cota 
V1 —sin?a cosa 1 
cota - = 
sina V1 —cos? a tana 

















2.7.2.4 Trigonometric Functions of Half-Angles 


In the following formulas the sign of the square root must be chosen positive or negative, according to 
the quadrant where the half-angle is. 








É 1 

sin S - 3 - esa), (2.110) 

& a 1—cosa 1— cosa sina 

απ = E E ; 

2 1+ cosa sina 1+cosa 
a 1+ cosa 1+cosa sina 

cot — — = - = ; 
2 1— cosa sina 1— cosa 


α 
COS — — 


2 


1 
/ σα + cosa), 


(2.111) 


(2.112) 


(2.113) 
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2.7.2.5 Sum and Difference of Two Trigonometric Functions 


























= 3 = 
sin a 4 sin B = asin TP s 5 B (2114) sina — sin B — 2cos sin 5 B (2.115) 
η - -β 
cos a+ cos 8 = 2 cos E 5 B cos 2 5 P (2.116) cosa — cos B — —2 sin * αν E 7 f ; (2117) 





sin(a + 8) 


cosa cos B 





sin(a + £) 














tana + tan -- (2.118) cota cot 8 = (2.119) 


- v 
sin a sin 8 











tana + cot p = κ p). (2.120) cota -— tan 8 = «ΕΒΕ (2.121) 
cos a sin sin a cos 8 
2.7.2.6 Products of Trigonometric Functions 
1 
sino sin 8 — g [costa — 8) — cos(a + 8)], (2.122) 
cosa cos B — 5 lcos(o — 5) + cos(a + £)], (2.123) 
sin a cos 8 = sisin(a — B) + sin(a + £)], (2.124) 
ea n E Τε. : 
sin a sin sin y = ΠΣ c B — v) —- sin(B 4- y — a) 
+sin(y +a — 8) — sin(a 4- B 4- 5], (2.125) 
: le, - 
sina cos 8 cos y = qsin(a T B — y) — sin(B 4- y — o) 
+sin(y +a — 8) 4 sin(a 4- B 4- 3], (2.126) 
sina sin B cos» — ir cos(a + B — y) + cos(B +7 — a) 
+cos(y + a — 8) — cos(a+ 6+), (2.127) 
cos a cos 8 cos y = ‘leos(a 4 B — y) 9- cos(B 4- y — a) 
4- cos(^y 4- a — B) + cos(a + 8 + 4]. (2.128) 
2.7.2.7 Powers of Trigonometric Functions 
--- gc 
sin? a = σα — cos2a), (2.129) cos” a = σα + cos 2a), (2.130) 
23 λα... : 3 1 
sin” a = 18 sina — sin 3a), (3.131) cos? a = 4 (cos 3a--3cosa), (2.132) 


1 1 
sinf a = g (cos 4a — 4cos 2a +3), (2.133) costa = g (cos da + 4cos 2a + 3). (2.134) 





84 2. Functions 





For large values of n sin” a and cos” a , can be expressd by applying the formulas for cos na and sin na 


(see 2.7.2.3, p. 82). 


2.7.3 Description of Oscillations 
2.7.3.1 Formulation of the Problem 
In engineering and physics one often meets quantities depending on time and given in the form 

u(t) = Asin(wt + p). (2.135) 
They are called also sinusoidal quantities. Their dependence on time results in a harmonic oscillation. 
The graphical representation of (2.135) results in a general sine curve, as shown in Fig. 2.39. 





ΖΝ 


Figure 2.39 Figure 2.40 
The general sine curve differs from the simple sine curve y = sin z: 
a) by the amplitude A, i.e., the greatest distance between its points and the time axis t, 


2π 
b) by the period T = —, which corresponds to the wavelength (with w as the frequency of the oscilla- 
ω 
tion, which is called the angular or radial frequency in wave theory), 
c) by the initial phase or phase shift by the initial angle ϱ 4 0. 
The quantity u(t) can also be written in the form 
u(t) = asinwt + b cos wt. (2.136) 
b 
Here for a and b A = Va? 4+ b? and tan y = — holds. The quantities a, b, A and y can be represented 
a 
as sides and angle of a right triangle (Fig. 2.40). 
2.7.3.2 Superposition of Oscillations 
In the simplest case the superposition of oscillations is 
the addition of two oscillations with the same frequency. 
It results again in a harmonic oscillation with the same 
frequency: 
l Aj sin(wt+y1)+ Ag sin(wt+y2) = Asin(wt+y)(2.137a) 
ot with 


ΑΞ ν Ay? + A»? --2A1A5 cos(q» = 1), (2.137b) 


A; sin y1 + Ag sin Ye 














tany = ; 2.137c 
5 AY Aı cos pı + A2 cos p2 ( c) 
) where the quantities A and ọ can be determined by a 
Figure 2.41 vector diagram (Fig. 2.41a). 


A linear combination of several sine functions with the same frequency is also possible and yields a 
general sine function (harmonic oscillation) with the same frequency: 


X cAisin(wt + gi) = Asin(wt + p). (2.138) 
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2.7.3.3 Vector Diagram for Oscillations 

The general sine function (2.135, 2.136) can be represented easily by the polar coordinates p = A, yp 
and by the Cartesian coordinates x = a, y — b (see 3.5.2.1, p. 190) in a plane. The sum of two such 
quantities then behaves as the sum of two summand vectors (Fig. 2.41a). Similarly the sum of several 
vectors results in a linear combination of several general sine functions. This representation is called a 
vector diagram. 

The quantity u for a given time t can be determined from the vector diagram with the help of Fig. 
2.41b: First the time axis OP(t) has to be put through the origin O, which rotates clockwise around 
O by a constant angular velocity w. At start t = 0 the axes y and t coincide. Then at any time t 
the projection ON of the vector ti onto the time axis is equal to the absolute value of the general sine 
function u = Asin(wt + p). For time t = 0 the value ug — Asin q is the projection onto the y-axis 
(Fig. 2.41b). 


2.7.3.4. Damping of Oscillations 
The function u(t) = Ae sin(wt + yo) (a, t > 0) (2.139) 


yields the curve of a damped oscillation 
(Fig. 2.42). 

The oscillation proceeds along the t-axis, 
while the curve asymptotically approaches 
the t-axis. The sine curve is enclosed by the 
exponential curves u(t) = +Ae~“, and it 
contacts them in the points 


1 
(t+3)7 -v 
Apo, Ay, Ao,..., Ap = Sn o oao, 


ω 








, 





1 
(5*3) 6 
(—1)^A exp | -a2——£————— 


Figure 2.42 s 

The intersection points with the coordinate axes are B = (0, Asin qo), Co, C1, Co, ..., Cy — 

{π- ι kr —pota ; ; x 
(A, o) . The extrema Do, Di , Do,... are at th = BE AE ; and the inflection points Eo, F4, 

ω ω 
kr — po+2a . ω 
Ῥο,... 819 αἲ ἐκ ο μονος with tana = —. 
a 
Yi 


bance π . 
The logarithmic decrement of the damping is 6 = In = a—, where y; and yj41 are the ordinates 
ω 


Yi+1 








of two consecutive extrema. 


2.8 Cyclometric or Inverse Trigonometric Functions 

The cyclometric functions or arcus functions are the inverses of the trigonometric functions. For a 
unequivocal definition the domain of the trigonometric functions is to be decomposed into monotony 
intervals, to get an inverse function for every monotony interval. So, there are infinitely many such 
intervals, and for each its inverse is to be defined. In order to distinguish them an index k is to be 
assigned according to the corresponding interval. Obviously the trigonometric inverse functions are 
monotony in these intervals. 


2.8.1 Definition ofthe Inverse Trigonometric Functions 


How to define the inverse trigonometric functions will be shown here for the inverse of the sin function 
(Fig. 2.43). The usual notation for it is arcsin x . The domain of y = sin v will be split into monotony 
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Figure 2.43 Figure 2.44 Figure 2.45 Figure 2.46 











T T 
intervals ky — — € x < kr + — with k = 0,+1,+2,.... Reflecting the curve of y — sin z in the line 





y = x yields the curve of the inverse function 
y = arc, sine (2.140a) 


with the domains and ranges 














-l<a<+l1 and kr- Xy kr, where k=0,+1 ον (2.140b) 


The form y = arc; sin x has the same meaning as x = sin y. 
Similarly, one can get the other inverse trigonometric functions which are represented in Fig. 2.44— 
2.46. The domains and ranges of the inverse functions can be found in Table 2.6. 





2.8.2 Reduction to the Principal 
Value 


In their domain the arcus functions have the so-called 
principal values for k = 0, written usually without an = --------------}|- Ż --ķ------}--------------------- 
index, g.e., as arcsinx — arcosin z. In Fig. 2.47 the * 
principal values of the inverse functions are presented. Sues 








arccot ^ 





The values of the different inverses can be calculated 
from the principal values by the following formulas: -1 


xy 


arcy sin z = kr + (—1)" arcsin z. (2.141) arctan 
T 
ee (k+ 1)r — arccosx (k odd), (2.142) 2 
kr + arccos x (k even). : 
Figure 2.47 
arcy tan zx — kg + arctan x. (2.143) 
arc cot x = kr + arccot x. (2.144) 


α A: arcsin0 = 0, arcsin 0 = kr. 

E B: arccot 1 = T arc, cot 1 = τ + kr. 
1 m 1 π 

E C: arccos zT g cosg 5y + (k + 1)r for odd k, 

= d + kr for even k. 


Remark: Calculators give the principal values of the trigonometric inverses. 
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Table 2.6 Domains and ranges of the inverses of trigonometric functions 






































Trigonometric 
Inverse function Domain Range function with the 

same meaning 
arc sine T T ‘ 

: -l<a< i a pice foe oe con 
Jar EIE ) l<a2<l kr q Suskn 2 r-—siny 
per eae } -l<ar<l km € y € (k 4-1) T cosy 
y — arcy cos 
arc tangent apes gee k ο ο cde QT equa 
y — amp teny co«z«o|kr-y«y 7*5 z = tany 
arc cotangent 

D —oo < T < oo kr < y € (k 4-1) z= cot y 
y — arcg cot az 
k — 0,1, €£2,.... For k = 0 one gets the principal value of the inverse functions, 
which is usually written without an index, e.g., arcsin x = arco sin z. 











2.8.3 Relations Between the Principal Values 








csi T ao — x —arccos V1— z? (—1X r X0), (2.145 
arcsin x arccos x = arctan ————— — 145 
2 V1-— a? arccos V1 — x? (0 € x « 1). 
Eos T οὐ aoi m — arcsin y1 — x? (π-1-«ε-«0), (2.146 
arccosz — — — arcsin x — an 14 
2 Vi-aPO larnsnV/i-z2 (0 € x € 1). 


T 
arctan © = a7 arccot x = arcsin ———— (2.147 


"ET 


1 1 
arccot——a (x <0) = ο ια VES 5 (z <0), 
arctan x = 1 (2.148 
arccot — (x > 0) arccos ————— (x > 0). 
x A 
ta =~ — arct (2.149 
arccot x = — — arctan x — arccos ————— 4 
2 YL "P 








1 n 
7 — arcsin x <0), 
arctan= +m (x< 0 AX (z X 0) 


arccot £ = 1 1 (2.150 
arctan — r0 arcsin ———— — z 0). 
ao ty [ena ean 
2.8.4 Formulas for Negative Arguments 
arcsin(—x) = — arcsin x. (2.151) arccos(—x) = m — arccos x. (2.153) 
arctan(—x) = — arctan 2. 52) arccot(—x) = m — arccot x. (2.154) 
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2.8.5 Sum and Difference of arcsin x and arcsin y 


arcsin x + arcsin y = arcsin CIE + yvi- a) (zy <Oora? +y? <1), 
= 7 — arcsin (γι-ῳ + yvi—@) (a >0,y>0,a*+y?>1), (2.155b 
= —7 — arcsin («A - + yi-@#) (2 <0,y <0, 2? +y? > 1). (2.155¢ 
arcsin x — arcsin y = arcsin («i-e = wi-3) (zy > 0 or z? 4 y? € 1), 
= « — arcsin (ονι-ῳ — wi-s) (a >0,y <0, 2? +y?>1), (2.156b 
= —7 — arcsin (s - — wi-s) (x « 0, y » 0, £? +y? > 1). (2.156c 


2.8.6 Sum and Difference of arccos z and arccos y 


arccos £ + arccos y = arccos (ου —vVl1-z^f1— y) (x 4 y 2 0), 


= 2r — arccos (cy —V1-zaz?^J1— v) (x 4- y « 0). 


arccos @ — arccos y = — arcoos (ry + VT= x?/1 -y) (x 2 y), 
= arceos (xy + VI= 28/1 — y?) (a <y). 


2.8.7 Sum and Difference of arctan z and arctan y 


ty (xy « 1), 
"y 





x 
arctan x + arctan y = arctan Τ 





£i 
= m + arctan 1 ty 


(x > 0, zy > 1), 


zc 
=-—7 + arctan Τ s 





(a <0, xy 1). 


z—y 
arctan x — arctan y = arctan (zy > —1), 
lc cy 





= r + arctan = (z > 0, zy < —1), 


1+ ay 


g= y 
=-—7 + arctan - 
1 ση 








(a <0, xy « —1). 


2.8.8 Special Relations for arcsin z, arccos x, arctan x 


1 
2arcsin zz = arcsin (ον 1-- a?) (i < a] : 


1 
= 7 — arcsin (2x 1 — 2?) συ 


(2.155a 


(2.156a 


(2.157a 
(2.157b 
(2.158a 


(2.158b 


(2.159a 





(2.159b 
(2.159c 
(2.1602. 


(2.160b 





(2.160c 


(2.161a) 


(2.161b) 
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1 
= —r — arcsin (22v1 - a) (4 κας E : (2.161c 








2arccosz — arccos(27? — 1) (0X x «€ 1), (2.162a 
= 2n — arccos(23? — 1) (—1 € x « 0). (2.162b 
2arctan zz = arctan 1 7 (|z| < 1), (2.163a 
] 2x . ' 

= T + arctan Iz (£ > 1), (2.163b 

: 2x . 1 

— —7 - arctan ine (z < —1). (2.163c 

cos(n arccos £) = Ta(£) (n 1), (2.164 


where n 7 1 can also be a fractional number and T; (x) is given by the equation 
(a+ Va®=1)" +(x- az =T)" 
Ξ ; 


For any integer n, Ta(x) is a polynomial of x (a Chebyshev polynomial). To study the properties of the 
Chebyshev polynomials see 19.6.3, p. 988. 


T, (x) = 








(2.165 


2.9 Hyperbolic Functions 
2.9.1 Definition of Hyperbolic Functions 


Hyperbolic sine, hyperbolic cosine and hyperbolic tangent are defined by the following formulas: 
at ο 


e qe egt 


sinh x = E , (2.166) cosh z = E. (2.167) tanh x = I 


The geometric definition (see 3.1.2.2, p. 132), is an analogy to the trigonometric functions. 
Hyperbolic cotangent, hyperbolic secant and hyperbolic cosecant are defined as reciprocal values of the 
above hyperbolic functions: 


(2.168) 


1 P pet 1 2 
cp (2.169) sech ¢ = ——— = —~_ (2.170) 


coshhz  e*-e-*' 





coth z — = - 
tanhz | e?—e-* 


cosech z — E B fs : (2.171) 
sinhr e*—e* 
The shapes of curves of hyperbolic functions are shown in Fig. 2.48—2.52. 
2.9.2 Graphical Representation of the Hyperbolic Functions 
2.9.2.1 Hyperbolic Sine 


y — sinh x (2.166) is an odd strictly monotone increasing function between —oo and +00 (Fig. 2.49). 
The origin is its symmetry center, the inflection point, and here the angle of slope of the tangent line is 


π 
J= There is no asymptote. 


2.9.2.2 Hyperbolic Cosine 


y — cosh x (2.167) is an even function, it is strictly monotone decreasing for x < 0 from +00 to 1, and 
for x > 0 it is strictly monotone increasing from 1 until +00 (Fig. 2.50). The minimum is at z — 0 
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Figure 2.48 Figure 2.49 Figure 2.50 
and it is equal to 1 (point A(0, 1)); it has no asymptote. The curve is symmetric with respect to the 


2 
i : y X E 
y-axis and it always stays above the curve of the quadratic parabola y = 1+ z (the broken-line curve). 


Because the function demonstrates a catenary curve, the curve is called the catenoid (see 2.15.1, p. 107). 


2.9.2.3 Hyperbolic Tangent 


y = tanh x (2.168) is an odd function, for —oo « x < +00 strictly monotone increasing from —1 to +1 
(Fig. 2.51). The origin is the center of symmetry, and the inflection point, and here the angle of slope 





T 
of the tangent line is y = 4 The asymptotes are the lines y = +1. 





Figure 2.51 


2.9.2.4 Hyperbolic Cotangent 


y — cothz (2.169) is an odd function which is not continuous at x = 0 (Fig. 2.52). It is strictly 
monotone decreasing in the interval —oo « x < 0 and it takes its values from —1 until —oo ; in the 
interval 0 «€ r « -oo it is also strictly monotone decreasing with values from 4-oo to --1. It has no 
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inflection point, no extreme value. The asymptotes are the lines x — 0 and y = +1. 


2.9.8 Important Formulas for the Hyperbolic Functions 


There are similar relations between the hyperbolic functions as between trigonometric functions. The 
validity of the following formulas can be shown directly from the definitions of hyperbolic functions, or 
considering the definitions and relations of these functions also for complex arguments, from (2.199)— 
(2.206), they can be calculated from the formulas known for trigonometric functions. 


2.9.3.1 Hyperbolic Functions of One Variable 








cosh? x — sinh? x — 1, (2.172) coth? z — cosech? x — 1, (2.173) 
sech? z + tanh? y — 1, (2.174) tanh z: coth z = 1, (2.175) 
sinh x . cosh x 

= tanhz, (2.176) - = coth z. (2.177) 
cosh x sinh x 


2.9.3.2 Expressing a Hyperbolic Function by Another One with the 
Same Argument 


The corresponding formulas are collected in Table 2.7. 


2.9.3.3 Formulas for Negative Arguments 


sinh(—r) — — sinh z, (2.178) cosh(—a) = cosh z, (2.180) 
tanh(—x) — — tanh z, (2.179) coth(—x) — — coth x. (2.181) 


Table 2.7 Relations between two hyperbolic functions with the same arguments for x > 0 





sinh x cosh x tanh x coth x 
sinh x — cosh? x — 1 -a T =i 
γι- tanh? x y coth? x — 1 

1 coth x 
V1- tanh?z | ycoth? x — 1 
sinh x Vcosh? x — 1 1 





n 


cosh x sinh? £ + 











tanh x 
sinh? x +1 cosh x coth x 
V'sinh? z 4-1 cosh x 1 

coth s — 























sinh x leosh? 2 — 1 tanh x 





2.9.3.4 Hyperbolic Functions of the Sum and Difference of Two Arguments 
(Addition Theorems) 


sinh(x + y) = sinh x cosh y + cosh x sinh y, (2.182) 








cosh(x + y) = cosh z cosh y + sinh x sinh y, (2.183) 
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tanh z + tanh y 1 coth z coth y 

tanh(x + y) = —— 2.184 th(a + y) = ————_. 2.185 

pnie) 1 tanh z tanh y ' ( J eotie y) coth x + coth y ( ) 
2.9.3.5 Hyperbolic Functions of Double Arguments 

2 tanh x 

sinh 2x — 2sinh x cosh z, (2.186) tanh 2x = 14+ tanh? x’ (2.188) 

cosh 2a = sinh? x sh? a: 2. 14 coth? 

cosh 2x = sinh” x + cosh’ x, (2.187) sah = coth’ g (2.189) 

2 coth x 

2.9.3.6 De Moivre Formula for Hyperbolic Functions 

(cosh x + sinh z)” = (ey — e*'"* = coshna + sinh na. (2.190) 
2.9.3.7 Hyperbolic Functions of Half-Argument 

T 1 x 1 

sinh = = +4/=(cosha — 1), (2.191) cosh = = 4/=(cosha + 1), (2.192) 

2 2 2 2 
The sign of the square root in (2.191) is positive for x > 0 and negative for x < 0. 
rx  coshr-1l sinh x x sinh x cosh a 4- 1 

anh l2 —————— -————— : oth — = —_——_ = —____. 2.194 

tank 2 sinh 2; cosha +17 Ἢ 2. coshz-1 sinh x ae) 
2.9.3.8 Sum and Difference of Hyperbolic Functions 

sinh z + sinhy = 2sinh T odi + x i (2.195) 

r-cy u—y 
cosh x + cosh y = 2 cosh cosh ao (2.196) 
cosh x — cosh y = 2 sinh ZEY sinh (2.197) 
σῇ L ct 
tanh x + tanh y = SNEEU) (2.198) 


cosh g cosh y ` 


2.9.3.9 Relation Between Hyperbolic and Trigonometric Functions with 
Complex Arguments z 


sinz = —isinhiz, (2.199) sinh z = —isiniz, (2.203) 
cos z = cosh iz, (2.200) cosh z = cosiz, (2.204) 
tanz = —itanhiz, (2.201) tanh z = —i tan iz, (2.205) 
cot z — icothiz, (2.202) coth z = icotiz. (2.206) 


Every relation between hyperbolic functions, which contains « or az but not az+b, can be derived from 
the corresponding trigonometric relation with the substitution isinh x for sina and cosh z for cos o. 


Bl A: cos?a+sin?a = 1, cosh? x + i? sinh? x = 1 or cosh? x — sinh? x = 1. 
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W B: sin2o — 2sina cosa, isinh 2 — 2isinh x cosh x or sinh 2r — 2sinh z cosh 2. 


2.10 Area Functions 
2.10.1 Definitions 


The area functions are the inverse functions of the hyperbolic functions, i.e., the inverse hyperbolic 
functions. The functions sinh x , tanh z, and coth x are strictly monotone, so they have unique inverses 
without any restriction; the function cosh z has two monotonic intervals so there are to consider two 
inverse functions. The name area refers to the fact that the geometric definition of the functions is the 
area of certain hyperbolic sectors (see 3.1.2.2, p. 132). 


2.10.1.1 Area Sine 
The function y = Arsinh x (2.207) 


(Fig. 2.53) is an odd, strictly monotone increasing function, with domain and range given in Ta- 
ble 2.8. (2.53) is equivalent to the expression r = sinh y. The origin is the center of symmetry and 


T 
the inflection point of the curve, where the angle of slope of the tangent line is o — a 


2.10.1.2 Area Cosine 
The functions y= Arcosha and y= — Arcosh x (2.208) 


(Fig. 2.54) or x = cosh y have the domain and range given in Table 2.8; they are defined only for 
x > 1. The function curve starts at the point A(1,0) with a vertical tangent line and the function 
increases or decreases strictly monotonically respectively. 


y-Arcosh x 


23456x 





y=-Arcosh x 


Figure 2.53 Figure 2.54 


Table 2.8 Domains and ranges of the area functions 





Hyperbolic function 


ros 1 1 £ ye - . 
Area function Domain Range with same meaning 





area sine 


y — Arsinh x =œ < Tr <œ | -œ < y < œ x =sinhy 

area cosine 
= Arcosh x < oo 

(=e LSa< 00 - Be z = cosh y 
area tangent 

y — Artanh x |z|«1 |—-oo«y «oo x = tanh y 
area cotangent 

y = Arcoth x |z|>1 | -coo<y<0 x =cothy 














0<y<o 
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-4 -3 -2 - 





Figure 2.55 Figure 2.56 
2.10.1.3 Area Tangent 
The function y — Artanh x (2.209) 


(Fig. 2.55) or r — tanhy is an odd function, defined only for |x| < 1, with domain and range given in 
Table 2.8. The origin is the center of symmetry and also the inflection point of the curve, and here the 





π 

angle of slope of the tangent lineis o — —. The asymptotes are vertical, their equations are x = +1. 
2.10.1.4 Area Cotangent 

The function y — Arcoth (2.210) 


(Fig. 2.56) or x — coth y is an odd function, defined only for |x| > 1, with domain and range given in 
Table 2.8. In the interval —oo < « < —1 the function is strictly monotone decreasing from 0 until —oo, 
in the interval 1 < 7 < +00 it is strictly monotone decreasing from +00 to 0. It has three asymptotes: 
their equations are y = 0 and x — t1. 





2.10.2 Determination of Area Functions Using Natural Logarithm 


From the definition of hyperbolic functions ((2.166)-(2.171), see 2.9.1, p. 89) follows that the area 
functions can be expressed with the logarithm function: 














Arsinh x = In (x + V2? +1), (2.211) 
1 
Arcosh x — In (x 4- Vx? — 1) =In ( 3 ;) (x 2 1), (2.212) 
α-- γα -- 
1... ee 1. +1 
Artanh z — jl 1 τ (|z| < 1), (2.213) Arcoth x = jn τ i (|[r| 2» 1). (3.214) 
= z- 


2.10.3 Relations Between Different Area Functions 


. . x Vi2+1 
Arsinh x = (sign) Arcosh Va? + 1 = Artanh ——— = Arcoth ———— 
ena) Vx? +1 T 


(|x| « oo), (3.215) 





ya? —1 x 
Arcosh x — Arsinh v x? = Artank — Arcotl £.> 1), 2.216 
rcosh x rsinh Vx rtanh —— rcoth ΕΞ (a > 1) ( ) 
x 1 1 
Artanh x = Arsinh vat = Arcoth τ (sign x) Arcosh Vina (| <4), (2.217) 
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1 
Arcoth z — Artanh — = (sign x) Arsinh 
x 


|z] 


α2-1 


= (sign x) Arcosh (Ix] > 1). (2.218) 


2.10.4 Sum and Difference of Area Functions 
Arsinh z + Arsinh y = Arsinh (ον 1 γην 1 -- 2) ; (2.219) 


























Arcosh « + Arcosh y = Arcosh (cy + 4/ (x? — 1)(y? — 1)) ; (2.220) 

Artanh x + Artanh y = Artanh τ BS f (2.221) 
2.10.5 Formulas for Negative Arguments 

Arsinh(—«) = — Arsinh z, (2.222) 

Artanh(—z) — — Artanh z, (2.223) Arcoth(—z) — — Arcoth x. (2.224) 


The functions Arsinh, Artanh and Arcoth are odd functions, and Arcosh (2.212) is not defined for 
arguments x « 1. 


2.11 Curves of Order Three (Cubic Curves) 
A curve is called an algebraic curve of order n if it can be written in the form of a polynomical equation 
F(xv, y) — 0 of two variables where the left-hand side is a poynomial expression of degree n. 


Wl The cardioid with equation (z? - y?) (a? - y? — 2ax) — a?y? — 0. (a » 0) (see 2.12.2, p. 98) isa curve 
of order four. The well-known conic sections (see 3.5.2.11, p. 206) result in curves of order two. 


2.11.1 Semicubic Parabola 

The equation y=ax*/? (a> 0, x > 0) (2.225a) 
or in parametric form «=t?, y-at? (a0, —oo«t« oo) (2.225b) 
gives the semicubic parabola (Fig. 2.57). It has a cuspidal point at the origin, it has no asymptote. 
The curvature K = τ. m takes all the values between oo and 0. The arclength of the curve 
between the origin and the point P(x,y) is L = l ja + 9a2x)3/? — 8). 


27a? 
2.11.2 Witch of Agnesi 


aè 


The equation y-——— 
y= 2 +r? 


(a » 0, —oo « z < oo) (2.2262) 

determines the curve represented in Fig. 2.58, the witch of Agnesi. It has an asymptote with the 
α 

equation y = 0, it has an extreme point at A(0, a), where the radius of curvature is r = y The inflection 

33 

s 


a 3a 


ΕΙ 








points B and C are at ( ) , where the angles of slope of the tangent lines are tan yp = 4 
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y 
a>0 
0 x 
Figure 2.57 Figure 2.58 Figure 2.59 


The area of the region between the curve and its asymptote is equal to S = ra”. The witch of Agnesi 
(2.226a) is a special case of the Lorentz or Breit-Wigner curve 

ο α 

~ B+ (a — 0)? 
E Ὃν Fourier transform of the damped oscillation is the Lorentz or Breit-Wigner curve (see 15.3.1.4, 
p. 791). 


y (a>0,b#0). (2.226b) 


2.11.3 Cartesian Folium (Folium of Descartes) 


Theequation a?--y? —3axy (a0) or (2.227a) 
] tric f 3at 3at? id 
in parametric form  r — — —, y — ——4 with 
P IFE 148 
t —tanXP0Or (a»0,—o00«t«-—1and —1«t «oo) (2.227b) 


gives the Cartesian folium curve represented in Fig. 2.59. The origin is a double point because the 
curve passes through it twice, and here both coordinate axes are tangent lines. At the origin the radius 


x 3a ς ; 
of curvature for both branches of the curve is r — *' The equation of the asymptote is x +y +a = 0. 


] 3 3 . 3a? 
The vertex A has the coordinates A (50. τα). The area of the loop is Sı = rs The area $5 between 
the curve and the asymptote has the same value. 


2.11.4 Cissoid 


r3 








The equation y? = z (a > 0), (2.228a) 
α- απ 
: tric f at? at? ith 
or in parametric form r — — —;, y = ———; wit 
P 1+2 pg 
t =tanxP0r (a> 0,— < t< oo) (2.228b) 
aid 
or with polar coordinates p = EK (a » 0) (2.228c) 
cos p 


(Fig. 2.60) describes the locus of the points P for which 


0P = MQ (2.229) 
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α 
is valid. Here M is the second intersection point of the line 0P with the drawn circle of radius 2) and 
Q is the intersection point of the line OP with the asymptote x = a. The area between the curve and 


: 3 
the asymptote is equal to S — qm. 





Figure 2.60 Figure 2.61 
2.11.5 Strophoide 


Strophoide is the locus of the points Pj and P», which are on an arbitrary half-line starting at A (A is 
on the negative x-axis) and for which the equalities 

MP, =MP,=0M (2.230) 
are valid. Here M is the intersection point with the y-axis (Fig. 2.61). The equation of the strophoide 
in Cartesian, and in polar coordinates, and in parametric form is: 








bas cos 2 
yog ( + z) (a > 0), (2.231a) p= a ja So), (2.231b) 
α- π COS (p 
Pu d ith t — tan XPOr (a0 «t « oo) (2.231c) 
=a = a4 with = tan X a —Co oo). . ς 
μπε Ἡ +1 ᾿ 
The origin is a double point with tangent lines y = +a. The asymptote has the equation x = a. The 





1 
vertex is A(—a, 0). The area of the loop is Sı = 2a? — σπα”, and the area between the curve and the 


1 
asymptote is Sy = 2a” + σπα”. 


2.12 Curves of Order Four (Quartics) 


2.12.1 Conchoid of Nicomedes 
The Conchoid of Nicomedes (Fig. 2.62) is the locus of the points P, for which 


0P —0M x1 (2.232) 


holds, where M is the intersection point of the line 0/,0P5 with the asymptote r — a. The “+” sign 
belongs to the right branch of the curve, the *—" sign belongs to the left one in relation to the asymptote. 
The equations for the conchoid of Nicomedes are the following in Cartesian coordinates, in parametric 
form and in polar coordinates: 


(a —a)?(a? + y?)— Pa? 20. (a 0,0 0), (2.233a) 








w=a+lcosy, y=atanyg+lsiny 
(a > 0, right branch: — 7 «φς« 5 left branch: z <y< T) (2.233b) 
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p= +1 (“+” sign: right branch, “—” sign: left branch,) (2.233c) 





Figure 2.62 


1. Right Branch: The asymptote is = a. The vertex A is at (a +1, 0), the inflection points B, C 
have as z-coordinate the greatest root of the equation x3 — 3a?x + 2a(a? — I?) = 0. The area between 
the right branch and the asymptote is S = oo. 

2. Left Branch: The asymptote is « = a. The vertex D is at (a — 1,0). The origin is a singular 
point, whose type depends on a and I: 

Case a) For | < a it is an isolated point (Fig. 2.62a). The curve has two further inflection points Æ 
and F, whose abscissa is the second greatest root of the equation z? — 3a?z -- 2a(a? — 1?) — 0. 


Caseb) For! > athe origin is a double point (Fig. 2.62b). The curve has a maximum and a minimum 


tl? — a? 
— — ——— . Here the 





value at z — a — Val? . At the origin the slopes of the tangent lines are tana = 
IVE — a? 

2a ᾿ 
Case c) For l = a the origin is a cuspidal point (Fig. 2.62c). 


2.12.2 General Conchoid 

The conchoid of Nicomedes is a special case of the general conchoid. One gets the conchoid of a given 
curve by elongating the length of the position vector of every point by a given constant segment +1. 
Considering a curve in a polar coordinate system with an equation p = f(y), then the equation of its 
conchoid is 


p— f(e) xt. (2.234) 


So, the conchoid of Nicomedes is the conchoid of the line. 


a 


radius of curvature is rg — 








2.12.3 Pascal’s Limagon 
The conchoid of a circle is called the Pascal limaçon (Fig. 2.63) if in (2.232) the origin is on the 
perimeter of the circle, which is a further special case of the general conchoid (see 2.12.2, p. 98). The 
equations in the Cartesian and in the polar coordinate systems and in parametric form are the following 
(see also (2.246c), p. 105): 

(a? --y? — az)? = P(x? - y?) (a 0,1» 0), (2.235a) 


p=acosy+l (α»Σ0, 1» 0), (2.235b) 
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v —acos!p--lcosp, y-acospsing-Flsing (a0, 12 0, 0 € o « 2x) (2.235c) 





with a as the diameter of the circle. The vertices A, B are at (a 3- 0,0). The shape of the curve depends 

on the quantities a and I, as can be seen in Fig. 2.63 and Fig. 2.64. 

a) Extreme Points and Inflection Points: For a > l the curve has four extreme points C, D, E, F; 

—l+ VI? + 8a? 
4a 





for a € lit has two; they are at (cos g= 


2a? + 1? 
3al ` 


| For a < 1 < 2a there exist two inflection 


points G and H at (s p=- 


PO wás-B 


b) Double Tangent: For | < 2a, at the points J and K at (-z ‘al 





) there is a double 
a 4a 


tangent. 
c) Singular Points: The origin is a singular point: For a < lit is an isolated point, for a > l it is a 





a<l<2a 





b) 

Figure 2.63 

double point and the slopes of the tangent lines are tana = pO here the radius of curvature is 
m= IVETE, 


For a = l the origin is a cuspidal point; then the curve is called a cardioid (see also 2.13.3, p. 103). 





2 
πα 
The area of the limacon is S = we + rl’, where in the case a > | (Fig. 2.63c) the area of the inside 


loop is counted twice. 


2.12.4 Cardioid 
The cardioid (Fig. 2.64) can be defined in two different ways, as: 
1. Special case of the Pascal limagon with 
0P =0M +a, (2.236) 


where a is the diameter of the circle. 





2. Special case of the epicycloid with the same diameter a for the fixed and for the moving circle (see 
2.13.3, p. 103). The equation is 
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(a? + y?)? — 2aa(a? + y*) =a?y? (a> 0), (2.237a) 
and the parametric form, and the equation in polar coordinates are: 
x =acosp(l+cosy), y=asiny(1+ cosy) 








(a > 0,0 < v « 2n), (2.237b) 

p=a(l+cosy) (a> 0). (2.237c) 

The origin is a cuspidal point. The vertex A is at (2a, 0); extreme points 
1 

C and D are at cos y = 3 with coordinates E j κα). The area 


= s 3 : m . : : 
a=1 is S = 370. i.e., six times the area of a circle with diameter a. The 


Figure 2.64 length of the curve is L — 8a. 


2.12.5 Cassinian Curve 

The locus of the points P, for which the product of the distances from two fixed points F} and F> 
with coordinates (c, 0) and (—c, 0) resp., is equal to a constant a? # 0, is called a Cassinian curve 
(Fig. 2.65): 











FP. PP = 0. (2.238) 
The equations in Cartesian and polar coordinates are: 
(2 +y- 2e- y) =at, (a 0,0» 0), (2.2392) 
P = P cos 29 + Aj cf cos? 2p 4- (a — c*) | (a 0,c » 0). (2.239b) 
y a 
G E 
c Q P, A 
0* c x 
K I 
c«a«c42' a«c 





Figure 2.65 
The shape of the curve depends on the quantities a and c : 
Case a > cv/2: For a » cV2 the curve is an oval whose shape resembles an ellipse (Fig. 2.65a). 
The intersection points A, C with the x-axis are (+ Va? + 2 ,0), the intersection points B, D with the 
y-axis are (0, 3-γ/αΣ -- ὦ). 
Case a = cvV2: For a = cV2 the curve is of the same type with A, C (+cV3, 0) and B, D (0,+c), 


where the curvature at the points P and D is equal to 0, i.e., there is a narrow contact with the lines 
y — c. 














o 
I 





Case c « a « cV/2: For c « a < cy/2 the curve is a pressed oval (Fig. 2.65b). The intersection 


points with the axes are the same as in the case a 7 cv/2, also the extreme points B, D, while there are 


v 4Ac* — a a? . ; ; 
2 (t3 and there are four inflection points P, L, 
c 2c 











further extreme points E, G, K, I at (« 
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1 . a* — 
M,N at |ty/<(m—n), + aim +n) J| with n = 


4 


and m = 








Case a = c: For a = c there is the lemniscate. 
Case a < c: Fora < c there are two ovals (Fig. 2.65c). The intersection points A, C and P,Q 





with the x-axis are at (+ Va? + 2, 0 

V4c4 — a4 

at | + y H 
2c 2c 

coordinate representation. 











2.12.6 Lemniscate 





) and ( + Ve — a?,0). The extreme points E, G, K, I are 
242 p 


2 
α ; ; : 
F . The radius of curvature is r — —4— ————;, , where p satisfies the polar 
: c* — a^ 4 3p* 


The lemniscate (Fig. 2.66) is the special case a — c of the Cassinian curve satisfying the condition 





Ba 
; (RR 
nP.EP- ( 1 ) ; 





where the fixed points Fi, F3 are at (£a, 0). The equation 


in Cartesian coordinates is 
(a? + y?)? — 2a?(a? — 4?) =0 (a 
and in polar coordinates 


p=ay/2cos2p (a> 0). 


The origin is a double point and an infl 


(2.240) 


> 0) (2.241a) 


(2.241b) 


ection point at the 








same time, where the tangents are y = 4 


The intersection points A and C with th 
αγ3 
2^" 














G, K, I are at ( 


cs Figure 2.66 





e z-axis are at (+ a/2, 0), the extreme points of the curve E, 





a π 
5) . The polar angle at these points is p — to The radius of curvature is 


2a? 
7 and the area of every loop is S = αὖ. 
p 


2.13 Cycloids 


2.13.1 Common (Standard) Cycloid 


The cycloid is a curve which is describe 
along a line without sliding (Fig. 2.67). 
is the following: 


x — a(t — sint), y — a(1— cost) 

(a0, —oo«t«oo) (2.242a) 
where a is the radius of the circle and t 
is the angle X PC4B in radian measure. 
In Cartesian coordinates 





à —1 
x +4/y(2a — y) = aarcy cos 2 
a 





(a>0, k=0,+1,2,...) (2.242b) 
holds. 











d by a point of the perimeter of a circle while the circle rolls 
The equation of the usual cycloid written in parametric form 








Figure 2.67 


102 2. Functions 





The curve is periodic with period O00, = 2ra. At Oo, O1, Os, .. ., Oy — (2kra,0) there are cusps, the 
vertices are at Ag,4 — ((2k--1)ra,2a. (k = 0,+1,+2,...)). The arc length of OpP is L — 8a sin?(t/4), 
the length of one arch is Lo,4,0, = 8a. The area of one arch is S = 37a?. The radius of curvature 
is r = 4asin it, at the vertices r4 — 4a. The evolute of a cycloid (see 3.6.1.6, p. 254) is a congruent 
cycloid, which is denoted in Fig. 2.67 by the broken line. 


2.13.2 Prolate and Curtate Cycloids or Trochoids 
Prolate and curtate cycloids or trochoids are curves described by a point, which is inside or outside of a 
circle, fixed on a half-line starting from the center of the circle, while the circle rolls along a line without 
sliding (Fig. 2.68). 
The equation of the trochoid in parametric form is 

x — a(t — Asint), (2.2432) 


y — a(1 — Acost), (2.243b) 














where a is the radius of the circle, t is the angle X PC1M, and Aa — Ci P. 

The case À > 1 gives the prolate cycloid and \ < 1 the curtate one. 

The period of the curve is O9O; = 27a, the maximum points are at Ay, Ao,..., Api = ((2k+1)ra, (1+ 
A)a), the minimum points are at Bo, Bi, Bo,..., By = (2kma,(1 — A)a) (k = 0,41,+2,...). The 

















prolate cycloid has double points at Do, Di, Do,..., Dy — [2era τα E — Vr? — to?) | , where to is the 


smallest positive root of the equation t = 
Asint. 

The curtate cycloid has inflection points 
at EA, E», m Exi = 

[a (aro; cos À — AV/1— 12) ,a(1— x). 
The calculation of the length of one cy- 
cle can be done by the integral L = 


2π 
af V 1 4- A? — 2A cost dt. The shaded 
0 


area in Fig. 2.68 is S = «a?(2 4- X2). 
The radius of curvature 
(1 4- M — 2A cos t)? 
A(cost — A) 
(12A)? 


the value ra — —a-——— —— at the max- 


λ 
(lA)? 
λ 





isr—a , which has 








ima and the value rg = a at the 


Figure 2.68 minima. 


2.13.3 Epicycloid 

A curve is called an epicycloid, if it is described by a point of the perimeter of a circle while this circle 
rolls along the outside of another circle without sliding (Fig. 2.69). The equation of the epicycloid in 
parametric form is 








A A 
xz = (A+ a) cosy — acos Lo, y = (A+a)siny — asin ndm (—oo « e « oo), (2.244) 


a 
where A is the radius of the fixed circle, a is the radius of the rolling one, and g is the angle ¢ COx. The 


. A 
shape of the curve depends on the quotient m — —. 
α 


2.13 Cycloids 103 





For m = 1 one gets the cardioid. 





Figure 2.70 


Case m integer: For an integer m the curve consists of m identically shaped branches surrounding the 


2k 
fixed curve (Fig. 2.69a). The cusps A1, A», ..., Ag areat | p— A, y = B (k 20,1,...,m— y) 1 
m 


2n 1 
the vertices B4, B5,..., Bm are at (» =A+2a, p=— (t + 5) | 
m 
Case m a rational fraction: If m is a non-integer rational number, the identically shaped branches 
follow each other around the fixed circle, overlapping the previous ones, until the moving point P re- 
turns back to the starting-point after a finite number of circuits (Fig. 2.69b). 


Case m an irrational: For an irrational m the number of round trips is infinite; the point P never 
returns to the starting-point. 
8(A +a) 
m ` 
curve is Liota = 8(A + a). The area of the sector A4 B4 A541 (without the sector of the fixed circle) is 
3A + 2a 4a(A +a 4a(A +a 
Ἔν, ( + ( ) ( ) 


Ap 
. The radius of curvature is r — —————— sin at the vertices rg = 
A ) 2a+A 2a’ B 2α-- Α 


The length of one branch is LA,g,A, — For an integer m the total length of the closed 





2.13.4 Hypocycloid and Astroid 
A curve is called a hypocycloid if it is described by a point of the perimeter of a circle, while this circle 
rolls along the inside of another circle without sliding (Fig. 2.70). The equation of the hypocycloid, 
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the coordinates of the vertices, the cusps, the formulas for the arc-length, the area, and the radius of 
curvature are similar to the corresponding formulas for the epicycloid, only “++a” is to be replaced by 
“—q”. The number of cusps for integer, rational or irrational m is the same as for the epicycloid (now 
m 2 1 holds). 
Case m = 2: For m = 2 the curve is actually the diameter of the fixed circle. 
Case m — 3: For m = 3 the hypocycloid has three branches (Fig. 2.70a) with the equation 

x = a(2cos q + cos 29), y = a(2sin y — sin 2y) (2.245a) 
and Ltotal = 16a, Stotai = 27a?. 
Case m = 4: For m = 4 (Fig. 2.70b) the hypocycloid has four branches, and it is called an astroid 
(or asteroid). Its equation in Cartesian coordinates and in parametric form is 


aq? +y? = A, (99455) r= Aco p, y= Asin y (0xq«m) (2.245c) 
and Ltotal = 24a = 6A, Stotal = ara, 
ya 











a) 1, a>0 






Figure 2.71 





Figure 2.72 


2.13.5 Prolate and Curtate Epicycloid and Hypocycloid 

The prolate and curtate epicycloid and the prolate and curtate hypocycloid, which are also called the 
epitrochoid and hypotrochoid, are curves (Fig. 2.71 and Fig. 2.72) described by a point, which is inside 
or outside of a circle, fixed on a half-line starting at the center of the circle, while the circle rolls around 


2.14 Spirals 105 





the outside (epitrochoid) or the inside (hypotrochoid) of another circle, without sliding. 
The equation of the epitrochoid in parametric form is 








A Aca 
x = (A + a) cos y — Aa cos ( τ -ο) : y — (Ac a)sin o — Aasin ( e e) ; (2.2462) 
a a 


where A is the radius of the fixed circle and a is the radius of the rolling one. For the hypocycloid “+a” 
is to be replaced by “—a”. For \a = CP one of the inequalities A > 1 or À « 1 is valid, depending on 
whether the prolate or the curtate curve is considered. 


For A = 2a, and for arbitrary À 4 1 the hypocycloid with equation 

z=a(l+A)cosy, y=a(1—A)siny (0< y < 27) (2.246b) 
describes an ellipse with semi-axes a(1 + A) and a(1 — A). For A = a it results in the Pascal limaçon 
(see also 2.12.3, p. 98): 

x =a(2cosy—Acos2y), y=a(2siny— Asin 2y). (2.246c) 


Remark: For the Pascal limacon on 2.12.3, p. 98 the quantity denoted by a there is denoted by 2a 
here, and the / there is the diameter 2a here. Furthermore the coordinate system is different. 


2.14 Spirals 
2.14.1 Archimedean Spiral 


An Archimedean spiralis a curve (Fig. 2.73) described by a point which is moving with constant speed 
v on a ray, while this ray rotates around the origin at a constant angular velocity w. The equation of 
the Archimedean spiral in polar coordinates is 


p=alyl, απο (a »0,—oo0 « 9 « oo). (2.247) 








Figure 2.73 Figure 2.74 


The curve has two branches (y < 0, > 0) in a symmetric position with respect to the y-axis. Every 
ray OK intersects the curve at the points 0, Ai, Ag,...,An,...; their distance is A;Aj,;; = 27a. The 


hs 2L 
arclength OP is L = z (ον q? -- 1-4 Arsinh e). where for large y the expression "m tends to 1. The 


2 (v2 + 1342 


a 
area of the sector POP, is S = (οι — v1). 'The radius of curvature is r — a 212 and at the 
p 


origin ro = 


NIS 


2.14.2 Hyperbolic Spiral 


The equation of the hyperbolic spiralin polar coordinates is 


a (a>0, =œ <p <0, 0< p< %0). (2.248) 
P 
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The curve of the hyperbolic spiral (Fig. 2.74) has two branches (p « 0, > 0) in a symmetric 
position with respect to the y-axis. The line y = a is the asymptote for both branches, and the origin is 


2 2 
a 1 1 a 
an asymptotic point. The area of the sector P,0P, is S = — | — — —],and lim S = — is valid. 
2 Ἆφι Q2 27100 21 


3 
νι 2 
The radius of curvature is r = = 5) À 
Q p 


2.14.3 Logarithmic Spiral 


The logarithmic spiralis a curve (Fig. 2.75) which intersects all the rays starting at the origin 0 at the 
same angle a. The equation of the logarithmic spiral in polar coordinates is 


p-—ac*? (a»0,—oo««o), (2.249) 
where k = cota. The origin is the asymptotic point of the curve. The arclength P,P is L = 
8 ymp P 8 
V1 +k? = v1 +k? 
Y — p1), the limit of the arclength 0^ calculated from the origin is Lo = Y. The 


radius of curvature is r = V1 + k?p = Lok. 


$ 2 π 3 à 
Special case of a circle: For a = 3 holds k = 0, and the curve is a circle. 


2.14.4 Evolvent of the Circle 


The evolvent of the circle is a curve (Fig. 2.76) which is described by the endpoint of a string while 
rolling it off a circle, and always keeping it tight, so that AB= BP. The equation of the evolvent of the 
circle is in parametric form 

1 -acosQ-Fapsing, y-asing — agcos p, (2.250) 


where a is the radius of the circle, and  — X B0x. The curve has two branches in symmetric position 
with respect to the z-axis. It has a cusp at A(a, 0), and the intersection points with the z-axis are at 





α x 1 
x = — ——, where go are the roots of the equation tan -- φ. The arclength of AP is L = zap. The 


COS (po 
radius of curvature is r — a — v2aL ; the centre of curvature P is on the circle, i.e., the circle is the 
evolute of the curve. 





Figure 2.75 Figure 2.76 Figure 2.77 
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2.14.5 Clothoid 


The clothoid is a curve (Fig. 2.77) such that at every point the radius of curvature is inversely propor- 
tional to the arclength between the origin and the considered point: 


2 
r= E. (a » 0). (2.2512) 


The equation of the clothoid in parametric form is 


f nt? f nt? 5 m 
£ = ay T [o — dt, y-ayr [sin — dt with t— ——, s-0P. (2.251b) 
g 2 j 2 αγπ 


The integrals cannot be expressed in terms of elementary functions; but for any given value of the pa- 
rameter t = to, t1,... it is possible to calculate them by numerical integration (see 19.3, p. 963), so the 
clothoid can be drawn pointwise. About calculations with a computer see the literature. 


The curve is centrosymmetric with respect to the origin, which is also the inflection point. At the inflec- 
tion point the x-axis is the tangent line. At A and B the curve has asymptotic points with coordinates 


(+7 +7) and ( EE avr 





2 2 


where the transition between a line and a circular arc is made by a clothoid segment. 


) . The clothoid is applied, for instance in road construction, 


2.15 Various Other Curves 
2.15.1 Catenary Curve 


The catenary curve is a curve wich has the shape of a homogeneous, flexible but inextensible heavy 
chain hung at both ends (Fig. 2.78) represented by a continuous line. The equation of the catenary 
curve is 


ezla + e t/a 


2 
y =acosh— =a 5 (a » 0). (2.252) 
a 
The parameter a determines the vertex A at (0,a). The curve is symmetric to the y-axis, and is always 
2 


higher than the parabola y = a + or , which is represented by the broken line in Fig. 2.78. The 
a 
οἳ/α Z ο-Ώ/α 


arclength of AP is L — asinh 2.5. 5 


. The area of the region (AP M has the value S = 


T y? T L 
aL — a? sinh -. The radius of curvature is r 2 — = a cosh? = =a+—. 
a a a a 





Figure 2.78 Figure 2.79 
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The catenary curve is the evolute of the tractrix (see 3.6.1.6, p. 254), so the tractrix is the evolvent (see 
3.6.1.6, p. 255) of the catenary curve with vertex A at (0, a). 


2.15.2 Tractrix 


The tractrix (the thick line in Fig. 2.79) is a curve such that the length of the segment PM of the 
tangent line between the point of contact P and the intersection point with a given straight line, here 
the x-axis, is a constant a. Fasting one end of an inextensible string of length a to a material point P, 
and dragging the other end along a straight line, here the x-axis, then P draws a tractrix. The equation 
of the tractrix is 


α a X Va? — y? 

x =aArcosh — + \/a? — y? = aln y yat y. (a>0,0<y<a). (2.253) 
y y 

The z-axis is the asymptote. The point A at (0,a) is a cusp. The curve is symmetric with respect to 














A a 
the y-axis. The arclength of AP is L = aln—. For increasing arclength L the difference L — x tends 
y 
to the value a(1 — In2) z 0.307a, where x is the abscissa of the point P. The radius of curvature is 
x -- pe 
r — acot —. The radius of curvature PC and the segment PE = b are inversely proportional: rb = a? . 
y 


The evolute (see 3.6.1.6, p. 254) of the tractrix, i.c., the geometric locus of the centers of circles of 
curvature C, is the catenary curve (2.252), represented by the dotted line in Fig. 2.79. 


2.16 Determination of Empirical Curves 


2.16.1 Procedure 
2.16.1.1 Curve-Shape Comparison 


If there are only empirical data for a function y = f(x), it is possible to get an approximate formula in 
two steps. First choose a formula for an approximation which contains free parameters. Then calculate 
the values of the parameters. If there is no theoretical description for the type of formula, then first 
choose the approximate formula which is the simplest among the possible functions, comparing their 
curves with the curve of empirical data. Estimation of similarity by eye can be deceptive. Therefore, 
after the choice of an approximate formula, and before the determination of the parameters, it is to 
check whether it is appropriate. 


2.16.1.2 Rectification 


Supposing there is a definite relation between x and y and in the chosen approximate formula two 
functions X = (x,y) and Y = y(x, y) are introduced such that a linear relation of the form 


Y=AX+B (2.254) 


holds, where A and B are constant. Calculating the corresponding X and Y values for the given x and 
y values, and considering their graphical representation, it is easy to check if they are approximately 
on a straight line, or not. Then it can be decided whether the chosen formula is appropriate. 


@ A: Ifthe approximate formulais y = 





i 
,thensubstituting X — z, Y = —, one gets Y = aX +b. 
ax +b y 


1 1 
Another possible substitution is ¥ = —, Y = —. Then Y = a + bX follows. 
x y 


W B: Using semi-logarithmic paper, 2.17.2.1, p. 116. 

W C: Using double logarithmic paper, 2.17.2.2, p. 117. 

In order to decide whether empirical data satisfy a linear relation Y = AX + B or not, one can use 
linear regression or correlation (see 16.3.4, p. 839). The reduction of a functional relationship to a linear 
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relation is called rectification. Examples of rectification of some formulas are given in 2.16.2, p. 109, 
and for an example discussed in detail, see in 2.16, p. 114. 


2.16.1.3 Determination of Parameters 

The most important and most accurate method of determining the parameters is the least squares 
method (see 16.3.4.2, p. 841). In several cases, however, even simpler methods can be used with success, 
for instance the mean value method. 


1. Mean Value Method 

The mean value method uses the linear dependence of the “rectified” variables X and Y, i.e., Y = 
AX +B as follows: The conditional equations Y; — AX;4- B for the given values Y;, X; are to be divided 
into two groups, which have the same size, or approximately the same size. By adding the equations 
in the groups one gets two equations, from which A and B can be determined. Then replacing X and 
Y by the original variables x and y again, one gets the connection between x and y, which is what one 
was looking for. 

If not all the parameters are to be determined, one can apply the mean value method again with a 
rectification by other amounts X and Y (see, e.g., 2.16.2.11, p. 113). 

Rectification and the mean value method are used above all when certain parameters occur in non-linear 
relations in an approximate formula, as for instance in (2.267b), (2.267c). 

2. Least Squares Method 

When certain parameters occur in non-linear relations in the approximation formula, the least squares 
method usually leads to a non-linear fitting problem. Their solution needs a lot of numerical calculations 
and also a good initial approximation. These approximations can be determined by the rectification 
and mean value method. 


2.16.2 Useful Empirical Formulas 

In this paragraph some of the simplest cases of empirical functional dependence are discussed, and 
the corresponding graphs are presented. Each figure shows several curves corresponding to different, 
parameter values involved in the formula. The influence of the parameters upon the forms of the curves 
is discussed in the following sections. 

For the choice of the appropriate function, usually only that part of the corresponding graph is to be 
considered, which is used for the reproduction of the empirical data. Therefore, e.g., one should not 
think that the formula y = az? + bx + c is suitable only in the case when the curve of the empirical 
data have a maximum or minimum. 


y y y 


0 x 0 x 0 x 
Figure 2.80 Figure 2.81 Figure 2.82 


2.16.2.1 Power Functions 


1. Type y — az*: 
Typical shapes of curve for different values of the exponent b 
y = az? (2.255a) 
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are shown in Fig. 2.80. The curves for different values of the exponent are also represented in Figs. 2.15, 
2.21, 2.24, 2.25 and Fig. 2.26. The functions are discussed on pages 66, 70 and 71 for the formula 
(2.44) as a parabola of order n, formula (2.45) as a reciprocal proportionality and formula (2.50) as a 
reciprocal power function. The rectification is made by taking the logarithm 

X=logr, Y=logy: Y=loga+bX. (2.255b 
2. Ίγρου -- ααὖ -- c: 
The formula 

y =ar? +c (2.256a. 
produces the same curve as in (2.255a), but it is shifted by c in the direction of y (Fig. 2.82). If b is 
given, the following rectification can be used: 


X=a2, Y=y: Y=aX+e. (2.256b 
If b is not known first c is to be determined, then the rectification can be done in accordance with 
X —logz, Y —log(y— c): Y —loga-br. (2.256c 


In order to determine a first approach of c, one can choose two arbitrary abscissae 4, x3 and a third 
one, 13 — /%1£2 as well as the corresponding ordinates 1, yo, ya, so that now 
2 
UP — Y3 ς 
Dc (2.256d 
Vi - ya — 2ys 
holds. After having determined a and b, one can get a corrected c, namely as the average of the amounts 
b 
y — ax’. 





y y 


Figure 2.83 Figure 2.84 


2.16.2.2 Exponential Functions 
1. Type y = ae”: 


The characteristic shapes of the curves of the function 

y — ae* (2.2572) 
are shown in Fig. 2.81. The discussion of the exponential function (2.54) and its graph (Fig. 2.26) is 
presented in 2.6.1, p. 72. For the rectification one takes 

X=2, Y=logy: Y =loga+bloge:X. (2.257b) 
2. Typey =ae*+c: 
The formula 

y =a +c (2.258a) 
produce the same curve as (2.257a), but it is shifted by c in the direction of y (Fig. 2.83). Begin with 
the determination of a first approach c; for c then the rectification by logarithm: 

Y =log(y—a), X=a2: Y —loga-bloge- X. (2.258b) 
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T+ 22 
2 





In order to determine c as for (2.256d) two arbitrary abscissae x1, £2 are to be chosen and x3 = 


] 2 
Y1yYy2 — Y3 
Yı + Y2 — 2y3 
and b one can get a corrected c, namely as the average of the amounts y — az. 


2.16.2.3 Quadratic Polynomial 
Possible shapes of curves of the quadratic polynomial 
y — ax? bx 4 c (2.2592) 


are shown in Fig. 2.84. For the discussion of quadratic polynomials (2.41) and their curves Fig. 2.12 
(see 2.3.2, p. 64). Usually the coefficients a, b and c are determined by the least squares method; but 
in this case also rectification is possible. Choosing an arbitrary point of data (21, y1) one rectifies 


as well as the corresponding ordinates yi, yo, ya to get c — . After having determined a 


y—uU 
Z—Zi 





X=2, Y= > Y=(b+ar,)+ax. (2.259b) 


If the given x values form an arithmetical sequence with a difference h, one rectifies 

Y=Ay, X=a: Y =(bh+ah?) + 2ahX. (2.259c) 
In both cases after the determination of a and b from the equation 

» γα» αἲ ΕὐΣδ απο (2.259d) 
cis to be calculated; n is the number of the given x values, for which the sum is calculated. 
2.16.2.4 Rational Linear Functions 


The rational linear function 








x+b 
y= ax (2.260a) 
cz 4d- d 
is discussed in 2.4 with (2.46) and graphical representation Fig. 2.17 (see p. 66). 
Choosing an arbitrary data point (24, y1) the rectification is done by the formulas 
2 -- 21 
Y =——, X=2: Y=A+BX. (2.260b) 
y—wu 


After determining the values A and B the relation can be written in the form (2.260c). Sometimes 
instead of (2.260a) the form (2.260d)is sufficient: 
g= T} x 1 


, (2.2 E -- | 
A+ Ba | me V—7ume-d 7 UT. (2.2604) 











y=y + 





1 1 T 
Then in the first case the rectification can be made by X = — and Y = — or X 2 rand Y = Z and by 
T y y 


1 
X =x and Y = — in the second case. 
y 


2.16.2.5 Square Root of a Quadratic Polynomial 
Several possible shapes of curves of the equation 

yY = ax" + ba +e (2.261) 
are shown in Fig. 2.85. The discussion of the function (2.52) and its graph Fig. 2.23 (see p. 71). 


If introducing the new variable Y = y?, the problem can be reduced to the case of the quadratic poly- 
nomial in 2.16.2.3, p. 111. 
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Figure 2.85 Figure 2.86 


2.16.2.6 General Error Curve 
The typical shapes of curves of the functions 


ba+cx? 


y = ae or logy = loga + bz loge + cz’ loge (2.262) 
are shown in Fig. 2.86. The discussion of the function with equation (2.61) and its graph Fig. 2.31 
(see p. 75). 
Introducing the new variable Y = log y, the problem is reduced to the case of the quadratic polynomial 
in 2.16.2.3, p. 111. 


2.16.2.7 Curve of Order Three, Type II 
The possible shapes of graphs of the function 

1 
ax? + ba +c 
are represented in Fig. 2.87. The discussion of the function with equation (2.48) and with graphs 
Fig. 2.19 (see p. 67). 


y= (2.263) 


: : ' 1 : ; . 
By introducing the new variable Y = —, the problem is reduced to the case of the quadratic polynomial 
y 


in 2.16.2.3, p. 111. 


| | 
0 x 
0 x 
Figure 2.87 Figure 2.88 
2.16.2.8 Curve of Order Three, Type III 
Typical shapes of curves of functions of the type 


T 


= = 2.264 
az? + bu +ce (2.264) 


2.16 Determination of Empirical Curves 113 





are represented in Fig. 2.88. The discussion of the function with equation (2.49) and with graphs 
Fig. 2.20 (see p. 68). 


Introducing the new variable Y = 7 the problem can be reduced to the case of the quadratic polynomial 
y 

in 2.16.2.3, p. 111. 

2.16.2.9. Curve of Order Three, TypeI 


Typical shapes of curves of functions of the type 
b ς 
y-act-t-3 (2.265) 
T T 


are represented in Fig. 2.89. The discussion of the function with equation (2.47) and with graphs 
Fig. 2.18 (see p. 67). 


1 

Introducing the new variable X — — the problem can be reduced to the case of the quadratic polynomial 
x 

in 2.16.2.3, p. 111. 


y y 


Figure 2.89 Figure 2.90 


2.16.2.10 Product of Power and Exponential Functions 
Typical shapes of curves of functions of the type 

y =arre™ (2.266a) 
are represented in Fig. 2.90. The discussion of the function with equation (2.62) and with graphs 
Fig. 2.31 (see p. 75). 
If the empirical values of x form an arithmetical sequence with difference h, the rectification follows in 
accordance with 

Y = Alogy, X =Alogx: Y = hcloge+bX. (2.266b) 
Here A log y and A log x denote the difference of two subsequent values of log y and log x respectively. 
If the x values form a geometric sequence with quotient q, then the rectification follows by 

X-—z, Y —Alogy: Y —blogq- c(q — 1)X log e. (2.266c) 
After b and c are determined the logarithm of the given equation is taken, and the value of log a is 
calculated like in (2.259d). 
If the given x values do not form a geometric sequence, but one can choose pairs of two values of x such 
that their quotient q is the same constant, then the rectification is the same as in the case of a geometric 
sequence of x values with the substitution Y = A; log y. Here A, log y denotes the difference of the two 
values of log y whose corresponding x values result in the constant quotient q (see 2.16.2.12, p. 114). 


2.16.2.11 Exponential Sum 
Typical shapes of curves of the exponential sum 
y = ae” + ce™ (2.267a) 
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are represented in Fig. 2.91. The discussion of the function with equation (2.60) and with graphs 
Fig. 2.29 (see p. 74). 

If the values of x form an arithmetical sequence with difference h, and y, yi, y2 are any three consecutive 
values of the given function, then the rectification is made by 





Y=, x=”. ya +e uhi un (2.267b) 
y y 
After b and d are determined, follows again a rectification by 
Y-yet, Χ-εύς. Υ..αΧ {6 (2.267c) 


2.16.2.12 Numerical Example 

Find an empirical formula to describe the relation between x and y for given values in Table 2.9. 

Choice of the Approximation Function: Comparing the graph prepared from the given data (Fig. 

2.92) with the curves discussed before, one can see that formulas (2.264) or (2.266a) with curves in 

Fig. 2.88 and Fig. 2.90 can fit the considered case. 

Determination of Parameters: Using the formula (2.264) to rectify are AŽ and x. The calculation 
1 


shows, however, the relationship between x and A- is far from linear. To verify whether the formula 
y 


(2.266a) is suitable one plots the graph of the relation between Aloga and Alogy for h = 0,1 in 
Fig. 2.93, and also between A, logy and x for g = 2 in Fig. 2.94. In both cases the points fit a 


straight line well enough, so the formula y = ax°e® can be used. 


Table 2.9 For the approximate determination of an empirically given function relation 





s| y 3 A α | Ίαν | Alex | Algy | Ailey | ger 
0.1 | 1.78 0.056 0.007 | —1.000 0.250 0.301 0.252 0.252 1.78 
0.2 | 3.18 0.063 0.031 | —0.699 0.502 0.176 +0.002 —0.097 3.15 
0.3 | 3.19 0.094 0.063 | —0.523 0.504 0.125 —0.099 —0,447 3.16 
0.4 | 2.54 0.157 0.125 | —0.398 0.405 0.097 —0.157 —0.803 2.52 
0.5 | 1.77 0.282 0.244 | —0.301 0.248 0.079 —0.191 —1.134 1.76 
0.6 | 1.14 0.526 0.488 | —0.222 0.057 0.067 —0.218 —1.455 1.14 





0.7 | 0.69} 1.014 | 0.986 | —0.155 | —0.161 | 0.058 | —0.237 = 0.70 
0.8 | 0.40 | 2.000 | 1.913 | —0.097 | —0.398 | 0.051 | —0.240 = 0.41 
0.9 | 0.23 | 3.913 | 3.78 | —0.046 | —0.638 | 0.046 | —0.248 = 0.23 
1.0 | 0.13 | 7.69 8.02 0.000 | —0.886 | 0.041 | —0.269 = 0.13 
1.1 |0.07 | 15.71 | 14.29 0.041 | —1.155 | 0.038 | —0.243 = 0.07 
1.2 | 0.04 | 30.0 = 0.079 | —1.398 = = = 0.04 






































In order to determine the constants a, b and c, a linear relation between x and A, log y is to be searched 
by the method of mean values. Adding the conditional equations A; log y = blog 2 + cx log e in groups 
of three equations each, yields 


—0.292 = 0.903b + 0.2606c, —3.392 = 0.903b + 0.6514c, 


and from here b = 1.966 and c = —7.932 holds. To determine a, the equations of the form logy = 
loga + blogz + cloge- x are to be added, which yields —2.670 = 12 log a — 6.529 — 26.87, so from 
loga = 2.561, a = 364 follows. The values of y calculated from the formula y = 364x1%%e-7-2" are 
given in the last column of Table 2.9; they are denoted by Yer as an approximation of y. The error 
sum of squares is 0.0024. 

Using the parameters determined by rectification as initial values for the iterative solution of the non- 
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Figure 2.91 Figure 2.92 





Figure 2.93 Figure 2.94 


linear least squares problem (see 19.6.2.4, p. 987) 


12 
ΣῚνι -- ααὐε΄”η]} -- min! 
il 


yields a = 396.601 986, b — 1.998 098, c = —8.000 0916 with the small error sum of squares 0.000 0916. 


2.17 Scales and Graph Paper 
2.17.1 Scales 


The base of a scale is a function y — f(x). The task is to construct a scale from this function so that 
on a curve, e.g. on a line, the function values of y are to be inserted as the values of the argument x. A 
scale can be considered as a one-dimensional representation of a table of values. 

The scale equation for the function y = f(x) is: 


y = Iļf(e) — f(2o)]. (2.268) 
The starting point of the scale is fixed at πρ. The scale factor l takes into consideration that for a con- 
crete scale there it is only one given scale length. 


E A Logarithmic Scale: For l= 10 cm and po = 1 the scale equation is 
y — 10[lg x — lg 1] = 10 lg x (in cm). For the table of values 
x |1| 2 | 3 | 4] 5 | 6 | 7 | 8 | 9 |10 
y =Iga | 0 | 0.30 | 0.48 | 0.60 | 0.70 | 0.78 | 0.85 | 0.90 | 0.95 | 1 


one gets the scale shown in Fig. 2.95. 
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Figure 2.95 


W B Slide Rule: The most important application of the logarithmic scale, from a historical view- 
point, was the slide rule. Here, for instance, multiplication and division were performed with the help 
of two identically calibrated logarithmic scales, which can be shifted along each other. 
From Fig. 2.96 one can read: y3 = yi + Ya, ie., lg v3 = lg xı +lg £2 = lg 122, hence 73 = T1: £2; Yı = 

x 
Ja — ya, LC. Ign = les — Iga — 1g 77, so aS. 

22 
E C Volume Scale on the lateral he ae of a conical shaped funnel: A scale is to be marked on the 
funnel, so that the volume inside could be read from it. The data of the funnel are: Height H = 15 cm, 
upper diameter D = 10 cm. 


1 
Taking in mind Fig. 2.97a gives the scale equation as follows: Volume V = gr mh apothem s 





r D/2 1 
Vh? - r?, tana — 5 = ΄ zug From these h — 3r, s 2 ry10, V — ( Jm follows, so the scale 
equation is s — c VI T e κ: 2.164 V. The following table of values contains the calibration marks on 


the funnel as in the figure: 


























V|0| 50 | 100 | 150 | 200 | 250 | 300 | 350 
s [01 7.96 | 10.03 | 11.48 | 12.63 | 13.61 | 14.46 | 15.22 | 
30077 
1 X Xs 10 
I ] I ] 
Y 1 X; 10 
I I ] 
2 
Ys b) 
Figure 2.96 Figure 2.97 


2.17.2 Graph Paper 


The most useful graph paper is prepared so that the axes of a right-angle coordinate system are cali- 
brated by the scale equations 


2 --ι[ϑ(ω) -- σ(υο)]. w-la[f(v) — f(vo)]. (2.269) 
Here lı and lz are the scale factors; uo and vo are the initial points of the scale. 
2.17.2.1 Semilogarithmic Paper 
If the x-axis has an equidistant subdivision, and the y-axis has a logarithmic one, then one talks about 
semilogarithmic paper or about a semilogarithmic coordinate system. 
Scale Equations: 

x = lju — uo] (linear scale), y = lllgv — lgvo] (logarithmic scale). (2.270) 
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The Fig. 2.98 shows an example of semilogarithmic paper. 


Representation of Exponential Functions: On semilogarithmic paper the graph of the exponential 
function 

y =ae** (o, B const) (2.271a) 
is a straight line (see rectification in 2.16.2.2, p. 110). This property can be used in the following way: 
If the measuring points, introduced on semilogarithmic paper, lie approximately on a line, it can be 
supposed a relation between the variables as in (2.271a). With this line, estimated by eye, one can 
determine the approximate values of a and 8: Considering two points P; (x1, y1) and P»(x2, y) from 
this line one gets 
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2.17.2.2 Double Logarithmic Paper 
If both axes of aright-angle x, y coordinate system are calibrated with respect to the logarithm function, 
then one talks about double logarithmic paper or log-log paper or a double logarithmic coordinate system. 
Scale Equations: The scale equations are 

x = hflg u — lg uo], y — la[lg v — lg vol, (2.272) 
where {1 5 are the scale factors and ug, vo are the initial points. 
Representation of Power Functions (see 2.5.3, p. 71): Log-log paper has a similar arrangement 
to semilogarithmic paper, but the x-axis also has a logarithmic subdivision. In this coordinate system 
the graph of the power function 

y aa^ (a, const) (2.273) 
is a straight line (see rectification of a power function in 2.16.2.1, p. 109). This property can be used in 
the same way as in the case of semilogarithmic paper. 
2.17.2.3 Graph Paper with a Reciprocal Scale 
The subdivisions of the scales on the coordinate axes follow from (2.45) for the function of inverse 
proportionality (see 2.4.1, p. 66). 


a a 

Scale Equations: x = liju — uo], y= l2 | - =] (a const), (2.274) 
U U9 

where l; and lz are the scale factors, and αρ, vo are the starting points. 


W Concentration in à Chemical Reaction: For a chemical reaction, the concentration denoted by 
c = c(t), has been measured as a function of the time t, giving the following results for c: 
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t/min | 5 | 10 | 20 | 40 
c- 107 /mol/I | 15.53 | 11.26 [7.27 | 4.25 





It can be supposed that the reaction is of second order, i.e., the relation should be 


aa Ὁ ee 
c(t) = Irak (co, k const). (2.275) 


1 1 
Taking the reciprocal value of both sides, one gets = = — + kt, i.e., (2.275) can be represented as a 
€ © 
line, if the corresponding graph paper has a reciprocal subdivision on the y-axis and a linear one on the 
1 
x-axis. The scale equation for the y-axis is, e.g., y = 10: -- cm. 
υ 
It is obvious from the corresponding Fig. 2.99 that the measuring points lie approximately on a line, 
i.e., the supposed relation (2.275) is acceptable. 
From these points the approximate values of both parameters k (reaction rate) and co (initial concen- 
tration) can be determined. Choosing two points, e.g., P, (10, 10) and P3(30, 5), one gets: 
1/ει --1/ο 
— dad 


2.17.2.4 Remark 

There are several other possibilities for constructing and using graph papers. Although today in most 
cases there are high-capacity computers to analyze empirical data and measurement results, in every- 
day laboratory practice, when getting only a few data, graph papers are used quite often to show the 
functional relations and approximate parameter values needed as initial data for applied numerical 
methods (see the non-linear least squares method in 19.6.2.4, p. 987). 


k & 0.005, co =~ 20 - 107. 


2.18 Functions of Several Variables 
2.18.1 Definition and Representation 


2.18.1.1 Representation of Functions of Several Variables 
A variable value u is called a function of n independent variables τι, 29, ... v4, if for given values of the 
independent variables, u is a uniquely defined value. Depending on how many variables there are, two, 
three, or n, one writes 

u-—f(rzy), w= fyz) U= f (Etga): (2.276) 
Substituting given numbers for the n independent variables yields a value system of the variables, which 
can be considered as a point of the n-dimensional space. The single independent variables are called 
arguments; sometimes the entire n tuple together is called the argument of the function. 
Examples of Values of Functions: 
E A: u= f(x,y) = xy? has for the value system z = 2, y = 3 the value f(2,3) = 2-3? = 18. 
WB D: u — f(x,y,z,t) = x ln(y — zt) takes for the value system x = 3, y = 4, z = 3, t = 1 the value 
f(3,4,3,1) = 3ln(4— 3- 1) = 0. 


2.18.1.2 Geometric Representation of Functions of Several Variables 


1. Representation of the Value System of the Variables 

The value system of an argument of two variables x and y can be represented as a point of the plane 
given by Cartesian coordinates x and y. A value system of three variables x, y, z corresponds to a point 
given by the coordinates z, y, z in a three-dimensional Cartesian coordinate system. Systems of four 
or more coordinates cannot be represented obviously in our three-dimensional imagination. 


Similarly to the three-dimensional case the system of n variables £1, £2,. . . r4 is to be considered as a 
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point of the n-dimensional space given by Cartesian coordinates £1, %2,...,Un. In the above example 
B, the four variables define a point in four-dimensional space, with coordinates x = 3, y = 4,2 = 3 
and t — 1. 


u 2. Representation of the Function u — f(x,y) of Two 
Variables 
a) A function of two independent variables can be represented by a sur- 
0 face in three-dimensional space, similarly to the graph representation 
of functions of one variable (Fig. 2.100, see also 3.6.3, p. 261). Consid- 
ering the values of the independent variables of the domain as the first 
y two coordinates, and the value of the function u — f(x, y) as the third 
coordinate of a point in a Cartesian coordinate system, these points 
Figure 2.100 form a surface in three-dimensional space. 





Examples of Surfaces of Functions: 
MA: u=1- i — i represents a plane (Fig. 2.101a, see also 3.5.3.10, p. 218). 


2 2 
E B: u= = + ^ represents an elliptic paraboloid (Fig. 2.101b, see also 3.5.3.13, 5., p. 226). 

E C: u= 16 — 2? — y? represents a hemisphere with r — 4 (Fig. 2.101c). 

b) The shape of the surface of the function u = f(x,y) can be pictured with the help of intersection 
curves, which can be get by intersecting the surface parallel to the coordinate planes. The intersection 
curves u = const are called level curves. 

Β In Fig. 2.101b,c the level curves are ellipses or concentric circles (not denoted in the figure). 
Remark: A function with an argument of three or more variables cannot be represented in three- 
dimensional space. Similarly to surfaces in three-dimensional space also the notion of a hyper surface 
in n-dimensional space is in use. 





Figure 2.101 


2.18.2 Different Domains in the Plane 
2.18.2.1 Domain of a Function 


The domain of definition of a function (or domain of a function) is the set of the system of values or points 
which can be taken by the variables of the argument of the function. The domains defined in this way 
can be very different. Mostly they are bounded or unbounded connected sets of points. Depending on 
whether the boundary belongs to the domain or not, the domain is closed or open. An open, connected 
set of points is called a domain. If the boundary belongs to the domain, it is called a closed domain, if 
it does not, sometimes it is called an open domain. 


2.18.2.2 Two-Dimensional Domains 
Fig. 2.102 shows the simplest cases of connected sets of points of two variables and their notation. Do- 
mains are represented here as the shaded part; closed domains, i.e., domains whose boundary belongs to 
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them, are bounded by thick curves in the figures; open domains are bounded by dotted curves. Includ- 
ing the entire plane there are only simply connected domains or simply connected regions in Fig. 2.102. 


2.18.2.3 Three or Multidimensional Domains 

These are handled similarly to the two-dimensional case. It concerns also the distinction between sim- 
ply and multiply connected domains. Functions of more than three variables will be geometrically 
represented in the corresponding n-dimensional space. 


unbounded y 
closed 
domain 


unbounded 
open domain 











bounded 
open domain 


Figure 2.102 


2.18.2.4 Methods to Determine a Function 


1. Definition by Table of Values Functions of several variables can be defined by a table of values. 
An example of functions of two independent variables are the tables of values of elliptic integrals (see 
21.9, p. 1103). The values of the independent variables are denoted on the top and on the left-hand 
side of the table. The required value of the function is in the intersection of the corresponding row and 
column. It is called a table with double entry. 
2. Definition by Formulas Functions of several variables can be defined by one or more formulas. 

x+y fora >0, y>0, 

r—yforr20,gy«0, 

^^y forrc«0,9y20, 
y forr«0,gy«0. 

3. Domain of a Function Given by One Formula In the analysis mostly such functions are 
considered which are defined by formulas. Here the union of all value systems for which the analytical 
expression has a meaning is considered to be the domain, i.e., for which the expression has a unique, 
finite, real value. 
Examples for Domains: 
α A: u — 2? - y?: The domain is the entire plane. 


1 : ; ae ; i 
BB: u = ——: The domain consists of all value systems x,y, satisfying the inequality 


/A16 — x? — 3? 


z? 4- y? « 16. Geometrically this domain is the interior of the circle in Fig. 2.103a, an open domain. 





A: u= xy. ο. 
B: u — z In(y — zt). NS E a 
=T 








E C: u= arcsin(x +y): The domain consists of all value systems x, y, satisfying the inequality —1 < 
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x+y < +1, i.e., the domain of the function is a closed domain, the stripe between the two parallel lines 
in Fig. 2.103b. 

E D: u = arcsin(2x — 1) + V1 — y? + \V/y + Inz: The domain consists of the value system z, y, z, 
satisfying the inequalities 0 < x < 1,0 < y < 1, z > 0, i.e., it consists of all points of the three 
dimensional x, y, z space lying above a square with side-length 1 shown in Fig. 2.103c. 
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Figure 2.105 


y 
If from the interior of the considered part of the plane a point or a 
bounded, simply connected point set is missing, as shown in Fig. 2.104, 
it is called a doubly-connected domain or doubly-connected region. Mul- 
tiply connected domains are represented in Fig. 2.105. A non-connected 
6 > region is shown in Fig. 2.106. 


Figure 2.106 


2.18.2.5 Various Forms for the Analytical Representation of a Function 


Functions of several variables can be defined in different ways, just as functions of one variable. 
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1. Explicit Representation 
A function is given or defined in an explicit way if its value (the dependent variable) can be expressed 
by the independent variables: 

u — f(zi,23,..., 24). (2.277) 


2. Implicit Representation 
A function is given or defined in an implicit way if the relation between its value and the independent 
variables is given in the form: 

F(ri,22,...,24,u) — 0, (2.278) 
if there is a unique value of u satisfying this equality. 
3. Parametric Representation 
A function is given in parametric form if the n arguments and the function are defined by n new vari- 
ables, the parameters, in an explicit way, supposing there is a one-to-one correspondence between the 
parameters and the arguments. For a two-variable function, for instance 


αφ), y-—w(rns) uc-x(rs), (2.27792) 
and for a three-variable function 
s-9(nst, y-9(nst, z-x(nst, u-k(nst) (2.2795) 


etc. 


4. Homogeneous Functions 


A function f (21,32, ..., x4) of several variables is called a homogeneous function if the relation 
f (Az, Az2,... , Am) — A" f(mi,zo,. .., v4) (2.280) 
holds for arbitrary A. The number m is the degree of homogeneity. 


T3 
zy + L , the degree of homogeneity is m = 2. 
y 





Wb A: Foru(z,y) —a?—3zy y? x 


2 2 
ο ος , the degree of homogeneity ism = 0. 
2x — 3y $ 


2.18.2.6 Dependence of Functions 
1. Special Case of Two Functions 


Two functions of two variables u = f(x,y) and v = g(x,y), with the same domain G, are called 
dependent functions if one of them can be expressed as a function of the other one u — F(v). For every 
point of the domain G of the functions the identity 

f(y) = F(e(v,y) or 4(f,o)—0 (2.281) 
holds. If there is no such function F'(p) or 8(f, p) , one speaks about independent functions. 
Bl u(z,y) — (a? 4- y?)?, v — Va? 4 y? are defined in the domain x? + y? > 0, i.e., in the whole x, y 
plain, and they are dependent, because u — v* holds. 


WB DB: Foru(r,y) — 


2. General Case of Several Functions 
Similarly to the case of two functions, the m functions 14, u2,. . . , Um of n variables 21, 15,. .. , x, in their 
common domain G are called dependent if one of them can be expressed as a function of the others, 
i.e., if for every point of the domain G the identity 

ui = f (u1, U2, .. . , Ui—1, Ui+l;---; Um) Or (u1, U2, ..., Um) =0 (2.282) 
is valid. If there is no such functional relationship, they are independent functions. 
E The functions u = £1 + £2 +-+ En, V = T1? + £2 +- + Tn? and w = T182 + T183 +: + 3134 + 
1333 d- * * - 7, 11,4 are dependent because v — u? — 2w holds. 


2.18 Functions of Several Variables 125 





3. Analytical Conditions for Independence 
Suppose every partial derivative mentioned below exists. Two functions u — f(x, y) and 




















v — q(r,y) are independent on a 
Of Of domain if their functional determi- 
De Ar Dlu.v nant or Jacobian determinant is not 
o ps , short A Φ) or = - ) (2.283a) identically zero here. Analogously, 
pee ie (x,y) (x,y) in the case of n functions of n vari- 
Or Oy ables u4 — fi(xi,...,24),..., Un = 
fasces): 
Of Of of 
Or, δα | Oan If the number m of the func- 
ὃ ὃ ὃ tions uj, u2, ..., uj is smaller 
Of, Of ed ofa D fay faye ees Fa) than the number of variables 
Ox, Ox Ox, | z —— — ———7- 7 0.(2.283b) ; 
πα 2 U^ 7 Dn aa,..- T4) ad 11, 12, ..., Xn, these functions 
: : ; : : are independent if at least one 
Of, Of, Of, subdeterminant of order m of 
Or, Ox ^ Óm, the matrix (2.283c) is not iden- 
tically zero: 
Ou; Qui Qui The number of independent func- 
αι δι  δϑε tions is equal to the rank r of the ma- 
; Ln : 
ðuz ðw P» trix (2.283c) (see 4.1.4, 7., p. 274). 
ο ος πρ T Here these functions are indepen- 
Ti v2 Tn |. (2.283c) dent, whose derivatives are the ele- 
: : : : ments of the non-vanishing determi- 
OUm OUm Ou, nant of order r. If m > n holds, then 
Ox, Ot ^" m, among the given m functions at most 


n can be independent. 


2.18.3 Limits 
2.18.3.1 Definition 


A function of two variables u = f(x,y) has a limit A at x — a, y = b if when x and y are arbitrarily 
close to a and b, respectively, then the value of the function f(x,y) approaches arbitrarily closely the 
value A. Then one writes: 


lim f(x,y) =A. (2.284) 


yb 


The function may not be defined at (a, b), or if it is defined here, may not have the value A. 


2.18.3.2 Exact Definition 
A function of two variables u = f(x,y) has at the point (a,b) a limit 
A= lim f(x,y) (2.285a) 


yb 


if for arbitrary positive € there is a positive r) such that (see Fig. 2.107) 





|f(z,.y) - AJ «s (2.285b) 
*" X holds for every point (x, y) of the square 
Figure 2.107 lr-a|«m |y-t «m. (2.285c) 





2.18.3.3 Generalization for Several Variables 
a) The notion of limit of a function of several variables can be defined analogously to the case of two 
variables. 
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b) The criteria for the existence of a limit of a function of several variables can be obtained by gener- 

alization of the criterion for functions of one variable, i.e., by reducing to the limit of a sequence or by 

applying the Cauchy condition for convergence (see 2.1.4.3, p. 53). 

2.18.3.4 Iterated Limit 

If for a function of two variables f(x, y) first the limit for x — a has been determined, i.e., lim f(x,y) 
z—a 


for constant y, then for the function obtained, which is now a function only of y, one determines the 
limit for y — b, then the resulting number 


B = lim {7 f(a, y) (2.2862) 
y—b Nr—a 
is called an iterated limit. Changing the order of calculations generally yields an other limit: 
C = lim (im f(e, y) : (2.286b) 
r—a Ny—b 


In general B ¥ C holds, even if both limits exist. 
a? —j? p a a? 


E For the function f(x,y) = 2l for x + 0, y > 0 one gets the iterated limits B = —1 
X Vy 
and C = +1. 
Remark: If the function f (v, y) has a limit A = lim f(x, y), and both B and C exist, then B = C = A 
yb 


is valid. The existence of B and C does not follow from the existence of A. From the equality of the 
limits B — C the existence of the limit A docs not follow. 


2.18.4 Continuity 
A function of two variables f(a, y) is continuous at r — a, y — b, i.e., at the point (a, b), if 1. the point 
(a, b) belongs to the domain of the function and 2. the limit for z — a, y — b exists and is equal to the 
value, i.e., 
lim f(x,y) = f(a,b). (2.287) 
yb 
Otherwise the function has a discontinuity at x — a, y — b. If a function is defined and continuous at 
every point of a connected domain, it is called continuous on this domain. Similarly can be defined the 
continuity of functions of more than two variables. 


2.18.5 Properties of Continuous Functions 


2.18.5.1 Theorem on Zeros of Bolzano 

If a function f(x, y) is defined and continuous in a connected domain, and at two points (z1, y1) and 
(x2, ye) of this domain the values have different signs, then there exists at least one point (3, y3) in this 
domain such that f(x, y) is equal to zero there: 


f(xa.ys) — 0, if f(zı,yı)>0 and f(a2,y2) <0. (2.288) 
2.18.5.2 Intermediate Value Theorem 


If a function f(x, y) is defined and continuous in a connected domain, and at two points (z1, y1) and 
(12, y) it has different values A = f (x1, y1) and B — f(x», y»), then for an arbitrary value C between 
A and B there is at least one point (x3, y3) such that: 

f(z3,9)- C, A«C«B or B«C«A. (2.289) 


2.18.5.3 Theorem About the Boundedness of a Function 


If a function f(x,y) is continuous on a bounded and closed domain, it is bounded in this domain, i.e., 
there are two numbers m and M such that for every point (x, y) in this domain: 


m < f(x,y) < M. (2.290) 
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2.18.5.4 Weierstrass Theorem (About the Existence of Maximum and 
Minimum) 

If a function f(x, y) is continuous on a bounded and closed domain, then it takes its maximum and 

minimum here, i.e., there is at least one point (2^, y") such that all the values f(x, y) in this domain are 

less than or equal to the value f(x’, y’), and there is at least one point (x", y") such that all the values 

f(x,y) in this domain are greater than or equal to f(x", y"): For any point (x, y) of this domain 


Fx y) 2 f(x) 2 f(x" y") (2.291) 


is valid. 
2.19 Nomography 
2.19.1 Nomograms 


Nomograms are graphical representations of a functional correspondence between three or more vari- 
ables. From the nomogram, the corresponding values of the variables of a given formula — the key 
formula — in a given domain of the variables can be immediately read directly. Important examples of 
nomograms are net charts and alignment charts. 

Nomograms are still used in laboratories, even in the computer age, for instance to get approximate 
values or starting guesses for iterations. 


2.19.2 Net Charts 
To represent a correspondence between the variables x, y, z given by the equation 

F(x,y,z) 20 (2.292) 
(or in many cases explicitly by z = f(x, y)), the variables can be considered as coordinates in space. 
The equation (2.292) defines a surface which can be visualized on two-dimensional paper by its level 
curves (see 2.18.1.2, p. 118). Here, a family of curves is assigned to each variable. These curves form a 
net: The variables x and y are represented by lines parallel to the axis, the variable z is represented by 
the family of level curves. 
H Ohm’s law is U = R- I. The voltage U can be represented by its level curves depending on two 
variables. If R and J are chosen as Cartesian coordinates, then the equation U = const for every con- 
stant corresponds to a hyperbola (Fig. 2.108). By looking at the figure one can tell the corresponding 
value of U for every pair of values R and J, and also J corresponding to every R, U, and also R corre- 
sponding to every J and U. Of course, the investigation is always to be restricted to the domain which 
is interpreted: In Fig. 2.108 holds 0 < R<10,0<I1<10and0<U < 100. 


Remarks: 
1. By changing the calibration, the nomogram can be used for other domains. If, e.g., in (Fig. 2.108) 
the domain 0 < J < 1 should be represented but R should remain the same, then the hyperbolas of U 
are to be marked by U/10. 
2. By application of scales (see 2.17.1, p. 115) it is possible to transform nomograms with complicated 
curves into straight-line nomograms. Using uniform scales on the x and y axis, every equation of the 
form 

αφί2) + yp(z) + x(z) = 0 (2.293) 
can be represented by a nomogram consisting of straight lines. If function scales x = f(z.) and y = 
g(z2) are used, the equation of the form 

F(z2)p(21) + gl(22)Y(21) + x(21) = 0 (2.294) 
has a representation for the variables 21, 2ο and z3 as two families of curves parallel to the axis and an 
arbitrary family of straight lines. 
W By applying a logarithmic scale (see 2.17.1, p. 115), Ohm's law can be represented by a straight-line 
nomogram. Taking the logarithm of R- I = U gives log R + log I = logU . Substituting x — log R 
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and y = log/ results in « + y = logU, i.e., a special form of (2.294). The corresponding nomogram is 
shown in Fig. 2.109. 


2.19.3 Alignment Charts 


A graphical representation of a relation between three variables 21, 22 and z3 can be given by assigning 
a scale (see 2.17.1, p. 115) to each variable. The z; scale has the equation 

Tti = pilzi) yi = Yilzi) (i-1,2,3). (2.295) 
The functions y; and y; are chosen in such a manner that the values of the three variables 21, z2 and 
z3 satisfying the nomogram equation should lie on a straight line. To satisfy this condition, the area of 
the triangle, given by the points (x1, y1), (v2, ye) and (3, y3) , must be zero (see (3.301), p. 195), i.e., 


zı yı 1 φι(σι) φιίσι) 1 
za ya 1| = | alza) alza) 1| =0 (2.296) 
T3 Us l t23(23) Va(23) 1 














must hold. Every relation between three variables z;, 29 and 23, which can be transformed into the 
form (2.296), can be represented by an alignment nomogram . 
Next, follows the description of some important special cases of (2.296). 


2.19.3.1 Alignment Charts with ThreeStraight-Line Scales Through a Point 


If the zero point is chosen for the common point of the lines having the three scales z1, z2 or z3 , then 
(2.296) has the form 


£103) mii(a) 1 
q2(22) maqx(22) 1 
t3(23) mata(23) 1 
since the equation of a line passing through the origin has the equation y = ma. Evaluating the 
determinant (2.297), yields 


= hi: (2.297) 








m3—ma ma—m;,  m;-— ma 


c "ee wee. (2.2982) 
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or 
Οι C2 C3 

φι(ι) «φα(α) «Φα(ζα) 

Here C, C5 and C3 are constants. 











=0 with σι + C3 + C3 7 (3.298) 


T. 4^ 2 
E The equation = + "md f is a special case of (2.298b) and it is an important relation, for instance in 
α 


optics or for the parallel connection of resistances. The corresponding alignment nomogram consists 
of three uniformly scaled lines. 


2.19.3.2 πο. Charts with Two Parallel Inclined Straight-Line Scales 
and One Inclined Straight-Line Scale 


One of the scales is put on the y-axis, the other one on another line parallel to it at a distance d. The 
third scale is put on a line y = ma. In this case (2.296) has the form 





0 ψι(σι) 1 
p3(z3) mps(z3) 1 





Evaluation of the determinant by the first column yields 
d (mip3(z3) — W1(21)) + y3(23) (Y1 (21) — Y2(z2)) = 0. (2.300a) 
Consequently: 
z3) —d 
φι(αι) t — (Y2(22) — md) =0 oder f(z1) - g(z3) — h(z2) — 0. (2.300b) 
3(23 
It is often useful to introduce measure scales E, and E; of the form 


Exfle) gale) — Egh(z2) = 0. (2.300c) 


d 
Then, 3(z3) = t—z——_ holds. The relation E : EF; can be chosen so that the third scale is pulled 
2 
1- —g(z 
BE! 3) 
near a certain point or it is gathered. Substituting m = 0, then E2h(z2) = w2(z2) and in this case, 
the line of the third scale passes through both the starting points of the first and of the second scale. 
Consequently, these two scales must be placed with a scale division in opposite directions, while the 
third one will be between them. 
The relation between the Cartesian coordinates x and y of a point in the x, y plane and the corre- 
sponding angle ¢ in polar coordinates is: 
y? = tan? o. (2.301) 
The corresponding nomogram is shown in Fig. 2.110. The scale division is the same for the scales of x 
and y but they are oriented in opposite directions. In order to get a better intersection with the third 
scale between them, their initial points are shifted by a suitable amount. The intersection points of the 
third scale with the first or with the second one are marked by y = 0 or y = 90° respectively. 
W For instance: x — 3 y = 3.5, gives p © 49.5°. 
2.19.3.3 Alignment Charts with Two Parallel Straight Lines anda 
Curved Scale 


If one of the straight-line scales is placed on the y-axis and the other one is placed at a distance d from 
it, then equation (2.296) has the form 





0 Va) 1 
d polza) 1|= 0. (2.302) 
Φα(5α) Ψα(24) 1 
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Consequently: 
ah (on) also (p3(23) ψη(ζα) 
hn) | vela) T palea) CEA E 


Choosing the scale E for the first scale and E» for the second one, then (2.303a) is transformed into 


(2.303a) 


E 
Ey f(a) + Fxg (22) F ha) + Eyk(z3) = 0, (2.303b) 
where ψι(2ι) = Ei f(z1), Y2(22) = Eag(22) and 
LE, h(z E ES k(2: 
p3(z3) = NC and w»(z3) = LES (s) (2.303c) 


- δι Fjh(za) "MBs FAh(za) 


holds. 

HM The reduced third-degree equation 2° + p*z + q* — 0 (see 1.6.2.3, p. 40) is of the form (2.303b). 
After the substitutions E1 — E» = 1 and f(z) = q@*, g(z2) = p*, h(z3) = z, the formulas to calculate 
d:z 2 

τες and y -—ws(z)— Do 

In Fig. 2.111 the curved scale is shown only for positive values of z. The negative values one gets by 
replacing z by —z and by the determination of the positive roots from the equation 2? 4- p*z — q* — 0. 
The complex roots u 4- iv can also be determined by nomograms. Denoting the real root, which always 
exists, by z1, then the real part of the complex root is u — —24/2, and the imaginary part v can be 





the coordinates of the curved scale are r — qa(z) = 


2 


. . 3 
determined from the equation 3u? — v? + p* = 1a — v* -p*-—0. 


W For instance: y? -- 2y — 5 — 0, i.e., p* — 2,q* — —5. One reads z & 1.3. 


2.19.4 Net Charts for More Than Three Variables 


To construct a chart for formulas containing more than three variables, the expression is to decompose 
by the help of auxiliary variables into several formulas, each containing only three variables. Here, every 
auxiliary variable must be contained in exactly two of the new equations. Each of these equations is to 
be represented by an alignment chart so that the common auxiliary variable has the same scale. 


3 Geometry 


3.1 Plane Geometry 


3.1.1 Basic Notations 
3.1.1.1 Point, Line, Ray, Segment 
1. Point and Line 


Points and straight lines are not defined in today’s mathematics. The relations between them are de- 
termined only by axioms. The line is graphically imaginable as a trace of a point moving in a plane 
along the shortest route between two different points without changing its direction. 
A point is the intersection of two lines. 
2. Closed Half-Line or Ray, and Segment 
A ray is the set of points of a line which are exactly on one side of a given point O, including this point O. 
A ray is imaginable as the trace of a point which starts at O and moves along the line without changing 
its direction, like a beam of light after its emission until it is not led out of its way. 
A segment AB is the set of points of a line lying between two given points A and B of this line, including 
the points A and B. The segment is the shortest connection between the two points A and B in a 
— 

plane. The direction class of a segment is denoted by an arrowhead AB, or its direction starts at the 
first mentioned point A, and ends at the second B. 
3. Parallel and Orthogonal Lines 
Parallel lines run in the same direction; they have no common points, i.e., they do not move off and do 
not approach each other, and they do not have any intersection point. The parallelism of two lines g 
and g' is denoted by g||g' . 
Orthogonal lines form a right angle at their intersection, i.e., they are perpendicular to each other. 
Orthogonality and parallelism are mutual positions of two lines. 
3.1.1.2 Angle 

b 1. Notion of Angle 


B An angle is defined by two rays a and b starting at the same point S, so they can 
be transformed into each other by a rotation (Fig. 3.1). If A is a point on the 
5 a ray a and B is on the ray b, then the angle in the direction given in Fig. 3.1 is 
A denoted by the symbols (a, b) or by 4: AS B, or by a Greek letter. The point S is 
Figure 3.1 called the verter, the rays a and b are called sides or legs of the angle. 


In mathematics, an angle is called positive or negative depending on the rotation being counterclock- 
wise or clockwise respectively. It is important to distinguish the angle X AS B from the angle X: BS A. 
Actually, x ASB = — X BSA (0 € X ASB x 180?) and X ASB — 360? — X BSA (180? € X ASB «€ 
360?) holds. 

Remark: In geodesy the positive direction of rotation is defined by the clockwise direction (see 3.2.2.1, 
p. 144). 

2. Names of Angles 

Angles have different names according to the different positions of their legs. The names given in Ta- 
ble 3.1 are used for angles a in the interval 0 € a < 360° (see also Fig. 3.2). 
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Table 3.1 Names of angles in degree and radian measure 
Names of angles| Degree | Radian |Names of angles Degree Radian | 
round (full) angle [a° = 360°] a=27 [right angle a? = 90° a=r/2 | 
convex angle a° > 180° |m < a < 2rllacute angle 0° < a° < 90° [0° < a < 7/2] 
straight angle a = 180° απ obtuse angle 90° <a < 180°) 7/2<a<7 | 
ct DN δν Z5 gua (> 
acute angle right obtuse straight convex round (full) 
angle angle angle angle angle 


Figure 3.2 
3.1.1.3 Angle Between Two Intersecting Lines 


At the intersection point of two lines g1, g2 there are four angles a, 3, y, 6 (Fig. 3.3). There are to be 
distinguished adjacent angles, vertex angles, complementary angles, and supplementary angles. 


1. Adjacent Angles are neighboring angles at the intersection point 

of two lines with a common vertex S, and with a common leg; both non- 

common legs are on the same line, they are rays starting from S but in gı 
opposite directions, so adjacent angles sum to 180°. 


W In Fig. 3.3 the pairs (a, 8), (8,7), (y, ð) and (a, 5) are adjacent angles. ῥα 

2. Vertex Angles are the angles at the intersection point of two lines, οσο & 
opposite to each other, having the same vertex S but no common leg, and 

being equal. They sum to 180° by the same adjacent angle. Figure 3.3 


E In Fig. 3.3 (a, y) and (4,6) are vertex angles. 


3. Complementary Angles are two angles summing to 90°. 
4. Supplementary Angles are two angles summing to 180°. 
Mi In Fig. 3.3 the pairs of angles (a, B) or (7,6) are supplementary angles. 


3.1.1.4 Pairs of Angles with Intersecting Parallels 

Intersecting two parallel lines p;, pa by a third one g yields eight angles 
(Fig. 3.4). Besides the adjacent and vertex angles with the same vertex 
S , alternate angles, opposite angles, and corresponding (exterior-interior) 
angles are to be distinguished with different vertices. Figure 3.4 





1. Alternate Angles have the same size. They are on the opposite sides of the intersecting line g 
and of the parallel lines p1, po. The legs of alternate angles are in pairs oppositely oriented. 

W In Fig.3.4, e.g., (a1, 52), (£1, 02), (91, 02); (01, £2) are alternate angles. 

2. Corresponding or Exterior-Interior Angles have the same size. They are on the same sides 
of the intersecting line g and of the parallel lines p;, po. The legs of corresponding angles are oriented 
in pairs in the same direction. 

Hf In Fig. 3.4 the pairs of angles (a1, 2), (41, 8), (^1, 2), and (01,62) are corresponding angles. 

3. Opposite Angles are on the same side of the intersecting line g but on different sides of the 
parallel lines p1, pa. They sum to 180°. One pair of legs has the same orientation, the other one is 
oppositely oriented. 

E In Fig. 3.4, e.g., the pairs of angles (01,02), (£1, 72), (V1, 82), and (01, 03) are opposite angles. 
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3.1.1.5 Angles Measured in Degrees and in Radians 

In geometry, the measurement of angles is based on the division of the full angle into 360 equal parts 
or 360° (degrees). This is called measure in degrees. Further division of degrees is not a decimal one, 
it is sexagesimal: 1° = 60’ (minute, or sexagesimal minute), 1’ = 60” (second, or sexagesimal second). 
For grade measure see 3.2.2.2, p. 146 and the remark below. 

Besides measure in degrees the radian measure is used to define the size of an angle. The size of the 








central angle o of an arbitrary circle (Fig. 3.5a) is given as the ratio of the corresponding arc-length | 
and the radius of the circle r: 
a= T (3.1) | 1 rad = 57° 17' 44.8" = 57.2958°, 
T 1° = 0.017453 rad, 
The unit of radian measure is the radian (rad), i-e., the central | 1’ = 0.000291 rad, 
angle belonging to an arc with arclength / equal to the radius | 1" — — 0.000005 rad. 








r. In the table you will find approximate conversion values. 
If the measure of the angle is o? in degrees and a in radian measure, then for conversion 
o o 
α α π . 180 
a° = pa = 180°-, αξ --- o? with o= ; (3.2) 
T T 


o 180 


holds. In particular: 360° = 2m, 180?— m, 90° = 7/2, 270° = 37/2, etc. Formulas (3.2) refer to 
decimal fractions and the following examples show how to make calculations with minutes and seconds. 





W A: Conversion of an angle given in degrees into radian measure: 
52^ 37' 23" = 52 - 0.017453 + 37 - 0.000291 + 23 - 0.000005 — 0.918447 rad. 
W B: Conversion of an angle given in radians into degrees: 
5.645 rad = 323 - 0.017453 + 26 - 0.000291 + 5 - 0.000005 = 323° 26’ 05”. 
The result is getting from: 
5.645:0.017453 = 323 + 0.007611 
0.007611 : 0.000291 = 26 + 0.000025 
0.000025 : 0.000005 = 5. 
The notation rad is usually omitted if it is obvious from the text that the number refers to the radian 
measure of an angle (see also p. 1055). 
Remark: In geodesy a full angle is divided into 400 equal parts, called grades. This is called measure 
in grades. A right angle is 100 gon. A gon is divided into 1000 mgon. 
On calculators the notation DEG is used for degree, GRAD for grade, and RAD for radian. For con- 
version between the different measures see Table 3.5, p. 146. 


3.1.2 Geometrical Definition of Circular and Hyperbolic 
Functions 


3.1.2.1 Definition of Circular or Trigonometric Functions 


1. Definition by the Unit Circle 
The trigonometric functions of an an- 
gle a are defined for both the unit circle 
with radius R = 1 and the acute angles 
of a right-angled triangle (Fig. 3.5a,b) 
with the help of the adjacent side b, op- 
posite side a, and hypotenuse c. In the 
unit circle the measurement of an angle 
is made between a fixed radius O A (with 
length 1) and a moving radius OC coun- 
terclockwise (positive direction): Figure 3.5 
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: è pa 4 s D 
sine : sina = BC =~, (3.3) cosine : cosa=OB=-, (3.4) 
] c 
a -— b 
tangent: tana = AD = b (3.5) cotangent: cota = EF = +, (3.6) 
a 
secant : seca = OD = ; j (3.7) cosecant: csca=OF =<. (3.8) 
a 


2. Signs of Trigonometric Functions =, 
Depending in which quadrant of the unit circle (Fig. 3.5a) the moving radius OC is, the functions 
have well-defined signs which can be taken from Table 2.2 (see p. 79). 


3. Definition of Trigonometric Functions by Area of Circular Sectors 
The functions sin a, cos a, tan a, cot a are de- 
fined by the segments BC, OB, AD of the 
unit circle with R = 1 (Fig. 3.6), where the 
argument is the central angle a = + AOC. 
For this definition one could use also the area 
x of the sector COK, which is denoted in 
Fig. 3.6 by the shaded area. With the cen- 
tral angle 2a measured in radians, one gets 











1 
oS 520 = a for the area with R = 1. 


Therefore, one has the same equations for 
sing = BC, cosx = OB, tanx = AD as 
in (3.3, 3.4, 3.5). Figure 3.6 Figure 3.7 


3.1.2.2 Definitions of the Hyperbolic Functions 

In analogy with the definition of the trigonometric functions in (3.3), (3.4), (3.5) now instead of the 
area of a sector of the unit circle with the equation x? +y? = 1 here the corresponding area of the sector 
of the hyperbola is used with the equation z? — y? — 1 (only the right branch in Fig. 3.7). Denoting 
by x the area of COK, the shaded area in Fig. 3.7, the defining equations of the hyperbolic functions 
are: 














sinha = BC, (3.9) cosha = OB, (3.10) tanha = AD. (3.11) 
Calculating the area x by integration and expressing the results in terms of BC, OB, and AD yields 
— Á = 1, 1+AD 
z= In(BC + VBC’ +1) =In(OB + YOR? 1) = 5 n> + T (3.12) 


and so, from now on, the hyperbolic functions can be expressed in terms of exponential functions: 


BC = — — sinh z, (3.13a) OB= τς = cosh z, (3.13b) 

ccce ET 

AD — — ——  — tanh x. (3.13c) 
ee 


These equations represent the most popular definition of the hyperbolic functions. 


3.1.3 Plane Triangles 
3.1.3.1 Statements about Plane Triangles 
1. The Sum of Two Sides ofa plane triangle is greater than the third one (Fig. 3.8): 
b+c>a. (3.14) 
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2. TheSum ofthe Angles of a plane triangle is 
a+B+7= 180°. (3.15) 


3. Unique Determination of Triangles A triangle is uniquely determined by the following data: 
e by three sides or 

e by two sides and the angle between them or 

e by one side and the two angles on it. 

If two sides and the angle opposite one of them are given, then they define two, one or no triangle (see 
the third basic problem in Table 3.4, p. 145). 


4 





b 


Figure 3.8 Figure 3.9 Figure 3.10 


4. Median ofa Triangle is a line connecting a vertex of the triangle with the midpoint of the 
opposite side. The medians of the triangle intersect each other at one point, at the center of gravity of 
the triangle (Fig. 3.10), which divides them in the ratio 2 : 1 counting from the vertex. 

5. Bisector ofa Triangle isa line which divides one of the interior angles into two equal parts. 
The bisectors intersect each other at one point. 

6. Incircle isthe circle inscribed in a triangle, i.e., all the sides are tangents of the circle. Its center 
is the intersection point of the bisectors (Fig. 3.9). The radius of the inscribed circle is called the 
apothem or the short radius. 

7. Circumcircle is the circle drawn around a triangle, i.e., passing 

through the vertices of the triangle (Fig. 3.11). Its center is the inter- 


section point of the three right bisectors of the triangle. 
8. Altitude of a Triangle is the perpendicular line that starts at a ; 


vertex and is perpendicular to the opposite side. The altitudes intersect 
each other at one point, the orthocenter. E 
9. Isosceles Triangle In an isosceles triangle two sides have equal 


length. 'The altitude, median, and bisector of the third side coincide. For a 
triangle the equality of any two of these sides is enough to make it isosceles. Figure 3.11 


10. Equilateral Triangle In an equilateral triangle with a — b — c the centers of the incircle and 
the circumcircle, the center of gravity, and the orthocenter coincide. 

11. Median isa line connecting two midpoints of sides of a triangle; it is parallel to the third side 
and has the half of length as that side. 

12. Right-Angled Triangle is a triangle that has a right angle (an angle of 90°) (Fig. 3.31, see 
p. 142). 


3.1.3.2 Symmetry 
1. Central Symmetry 


A plane figure is called center symmetric if by a rotation of the plane by 180° around the central point 
or the center of symmetry S it exactly covers itself (Fig. 3.12). Because the size and shape of the 
figure do not change during this transformation, it is called a congruent mapping. Also the sense class 
or orientation class of the plane figure remains the same (Fig. 3.12). Because of the same sense class 
such figures are called directly congruent. 
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The orientation of a figure means the traverse of the boundary of a figure in a direction: positive direc- 
tion, hence counterclockwise, negative direction, hence clockwise (Fig. 3.12, Fig. 3.13). 


Cc’ 





Figure 3.12 Figure 3.13 
2. Axial Symmetry 


A plane figure is called axially symmetric if the corresponding points cover each other after a rotation in 
space by 180° around a line g (Fig. 3.13). The corresponding points have the same distance from the 
axis g, the axis of symmetry. The orientation of the figure is reversed for axial symmetry with respect 
to the line g. Therefore, such figures are called indirectly congruent. This transformation is called a 
reflection in g. Because the size and the shape of the figures do not change, it is also called a indirect 
congruent mapping. The orientation class of the plane figure is reversed under this transformation 
(Fig. 3.13). 

Remark: For space figures hold the analogous statements. 


3. Congruent Triangles, Congruence Theorems 

a) Congruence: Plane figures are called congruent if their size and shape coincide. Congruent fig- 
ures can be transformed into a position of superimposition by the following three transformations, by 
translation, rotation, and reflection, and combinations of these. 

It is to distinguish between directly congruent figures and indirectly congruent figures. Directly con- 
gruent figures can be transformed into a covering position by translation and rotation. Because the 
indirectly congruent figures have a reversed sense class, an axially symmetric transformation with re- 
spect to a line is also needed to transform them into a covering position. 

E Axially symmetric figures are indirectly congruent. To transform them into each other all three 
transformations are needed. 


b) Laws of Congruence for Triangles: Two triangles are congruent if they coincide for: 

e three sides (SSS) or 

e two sides and the angle between them (SAS), or 

e one side and the interior angles on this side (ASA), or 

e two sides and the interior angle being opposite to the longer one (SSA). 

4. Similar Triangles, Similarity Theorems 

Plane figures are called similar if they have the same shape without having the same size. For similar 
figures there is a one-to-one mapping between their points such that every angle in one figure is the 
same as the corresponding angle in the other figure. An equivalent definition is the following: In similar 
figures the length of segments corresponding to each other are proportional. 





a) Similarity of figures requires either the equality of all the corresponding angles or the equality of 
the ratio of all corresponding segments. 

b) Area The areas of similar plane figures are proportional to the square of the ratio of corresponding 
linear elements such as sides, altitudes, diagonals, etc. 

c) Laws of Similarity For triangles the following laws of similarity hold. Triangles are similar if they 
coincide for: 
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e two ratios of sides, 

e two interior angles, 

e the ratio of two sides and the interior angle between them, 

e the ratio of two sides and the interior angle opposite of the longer one. 

Because in the laws of similarity only the equality of ratios of sides is required and not the equality of 
length of sides, therefore the laws of similarity require less than the corresponding laws of congruence. 


5. Intercept Theorems 
The intercept theorems are a consequence of the 
laws of similarity of a triangle. 
1. First Intercept Theorem If two rays start- 
ing at the same point S are intersected by two par- 
allels p;,p2, then the segments of one of the rays 
(Fig. 3.14a) have the same ratio as the correspond- 
ing segments on the other one: 


SP, SP, 
5Qi| 159 
Consequently, every segment on one of the rays is 


proportional to the corresponding segment on the 
other ray. Figure 3.14 


(3.16) 

















2. Second Intercept Theorem If two rays starting at the same point S are intersected by two par- 
allels pı, p2 , then the segments of the parallels have the same ratio as the corresponding segments on 
the rays (Fig. 3.14a): 

| SP,| _ | PP SP, | [AB 
SQI Q1Q» 5Q» QiQ» 


The intercept theorems are also valid in the case of intersecting lines at the point S, if the point S is 
between the parallels (Fig. 3.14b). 








(3.17) 











3.1.4 Plane Quadrangles 


3.1.4.1 Parallelogram 

A quadrangle is called a parallelogram (Fig. 3.15) if it has the following properties: 

e the sides opposite to each other have the same length, 

e the sides opposite to each other are parallel, 

e the diagonals intersect each other at their midpoints, 

e the angles opposite to each other are equal. 

Supposing only one of the previous properties for a quadrangle, or supposing the equality and the 
parallelism of one pair of opposite sides, then all the other properties follow from it. 

The relations between diagonals, sides, and area are the following: 


d + d3 = 2(a° +b°), (3.18) h = bsina, (3.19) S=ah. (3.20) 
>X p? 
[ox X | [^ E 
a A z 


Figure 3.15 Figure 3.16 Figure 3.17 
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3.1.4.2 Rectangle and Square 

A parallelogram is a rectangle (Fig. 3.16), if it has: 

e only right angles, or 

e the diagonals have the same length. 

Only one of these properties is enough, because either of them follows from the other. It is sufficient 
to show that one angle of the parallelogram is a right angle, then all the angles are right angles. If a 
quadrangle has four right angles, it is a rectangle. 

The perimeter U and the area S of a rectangle are: 


U — 2(a 4- b), (3.21a) S — ab. (3.21b) 


If a — b holds (Fig. 3.17), the rectangle is called a square, and the following formulas 


d? 
d=aV2~1.414a, (3.22) a= e 0.707d, (3.33) S=@= y: (321) 


3.1.4.3 Rhombus 

A rhombus (Fig. 3.18) is a parallelogram in which 

e all the sides have the same length, or 

e the diagonals are perpendicular to each other, or 

e the diagonals are bisectors of the angles of the parallelogram. 

Any of the previous properties is enough alone; all the others follow from it. For the rhombus 


d — 2a cos 5 (3.25) dy = asin 5, (3.26) @+d=4a. — (327) 


dida 





(3.28) 


S = ah = aà?sina — 


3.1.4.4 Trapezoid 

A quadrangle is called trapezoid if it has two parallel sides (Fig. 3.19). The parallel sides are called 
bases. With the notation a and b for the bases, h for the altitude and m for the median of the trapezoid 
which connects the midpoints of the two non-parallel sides 


_ h(a+2b) 


a+b (a+ b)h 
AR eae hs = ——_~.. 
mh, (3.30) vs 3(a +b) 


(4339  $- 775 





(3.31) 


m= 


The centroid is on the connecting segment of the midpoints of the parallel basis a and b, at a distance hg 
(3.31) from the base a. For the calculation of the coordinates of the centroid by integration see 8.2.2.3, 
p. 506. 


For an isosceles trapezoid with d — c 





S = (a—ccosy)csiny = (b+ ccosy)csiny. (3.32) 
bE ΓΝ 
XX 

a a 

Figure 3.18 Figure 3.19 Figure 3.20 


3.1.4.5 General Quadrangle 
A closed plane figure bounded by four straight line segments is called a general quadrangle. If the 
diagonals lie fully inside the quadrangle, it is convex, otherwise concave. The general quadrangle is 
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divisible by two diagonals dj, dz in two triangles (Fig. 3.20). Therefore, in every quadrangle the sum 
of the interior angles is 2 - 180° = 360°: 


4 
So a; = 360°. (3.33) 
ici 


The length of the segment m connecting the midpoints of the diagonals (Fig. 3.20) is given by 
a? ὁ) ed? — d? diu Am. (3.34) 
The area of the general quadrangle is 


i 
S= dido sina. (3.35) 


3.1.4.6 Inscribed Quadrangle 

A quadrangle which can be circumscribed by a circumcircle is called an inscribed quadrangle (Fig. 
3.21a) and its sides are chords of this circle. A quadrangle is an inscribed quadrangle if and only if the 
sums of its opposite angles are 180°: 





aty=6+6= 180°. (3.36) 
Ptolemy’s theorem is valid for the insribed quadrangle: 
ac + bd = didə. (3.37) 
The radius of the circumcircle of an inscribed quadrangle is 
a 
1 
4 R= 15 (ab + cd)(ac + bd)(ad + cb) . (3.38) 
The diagonals can be calculated by the formulas 

__| (ac + bd)(ab + ed) 
c dı = IADE ; (5.394) 

b) (ac + bd) (ad + bc) 
dy = BERE. ^ (3.39b) 

Figure 3.21 ασε 


1 
The area can be expressed in terms of the half-perimeter of the quadrangle s = σία +b+c+d): 


S = y (s — a)(s — b)(s — c)(s — d). (3.40) 
If the inscribed quadrangle is also a circumscribing quadrangle (see Fig. 3.21 and 3.1.4.7), then 

S = Vabcd. (3.41) 
3.1.4.7 Circumscribing Quadrangle 
If a quadrangle has an inscribed circle (Fig. 3.21b), then it is called a circumscribing quadrangle, and 
the sides are tangents to the circle. A quadrangle has an inscribed circle if and only if the sum of the 
lengths of the opposite sides are equal, and this sum is also equal to the half-perimeter s: 

1 
s=p(atbtetd)=ate=bt+d. (3.42) 


The area of the circumscribing quadrangle is 


S — (a 4- c)r — (b 4- d)r, (3.43) 
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where r is the radius of the inscribed circle. 
3.1.5 Polygonsinthe Plane 
3.1.5.1 General Polygon 


A closed plane figure bounded by straight-line segments as its sides, can be decomposed into n — 2 
triangles (Fig. 3.22). The sums of the exterior angles ;, and of the interior angles 7;, and the number 
of diagonals D are 

Z - n(n — 3) 


$28;—3005, (3.44) S324 =180°(n— 2), (3.45) D- 
i=l 


3.46 
2 7 (3.46) 


(7 





Figure 3.22 Figure 3.23 


3.1.5.2 Regular Convex Polygons 

Regular convex polygons (Fig. 3.23) have n equal sides and n equal angles. The intersection point of 
the mid-perpendiculars of the sides is the center M of the inscribed and of the circumscribed circle with 
radii r and R, respectively. The sides of these polygons are tangents to the inscribed circle and chords 
of the the circumcircle. They form a circumscribing polygon or tangent polygon for the inscribed circle 
and a inscribed polygon in the circumcircle. The decomposition of a regular convex n gon (regular 
convex polygon) results in n isosceles congruent triangles around the center M. 


























Sane 9 
Central Angle yn = 300 . (347) Base Angle Qn = (1 - 3) - 90°. (3.48 
n 
2 . 360° : o κ 
Exterior Angle 8, = ——. (3.49) Interior Angle γης 1805 -- Αη. (3.50 
n 
i 1 
Circumcircle Radius R= E , Rar?+-a?. (3.51 
. 180 45 
2811 
n 
n 180* 1805 
Inscribed Circle Radius r= = cot zu = Rcos 20 : (3.52 
2 n n 
Side Length a, = 2V R? — r? = 2Rsin m = 2r tan r .(3.53) | Perimeter U — na, . (3.54 
Gn M? az 
Side Length of the 2n gon Gon = Ry} 2— 24/1 (=) 5 An = Agny| 4 (8) : (3.55 
1 2 gn l pa 1 2 Pn 
Area of the n gon Sa = grat =nr* tan a jnE sin Yn = gran cot a (3.56 
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nR? 452 52, 
Area of the 2n gon Son = A {1 ip S4 — So M1 T . (3.57) 


3.1.5.3 Some Regular Convex Polygons 


The properties of some regular convex polygons are collected in Table 3.2. 








The pentagon and the pentagram deserve special attention since it is presumed that Hippasos of Meta- 
pontum (ca. 450 BC) recognized irrational numbers by the properties of these polygons (see 1.1.1.2, 
p. 2). A discussion follows in the example: 


Β The diagonals of a regular pentagon (Fig. 3.24) form an inscribed 
pentagram. Its sides enclose a regular pentagon again. In a regular pen- A 
tagon, the proportion of a diagonal and a side is equal to the proportion 
of a side and the (diagonal minus side): ag : a1 = a1 : (ao — a1) = a1 : a2, 
where a5 — ag — ai. 

Considering smaller and smaller nested pentagons with ag = a1 — a2, a4 = 


dg — à3,... and à € d41,03 < 43,04 € à3..., yields ao : a1 αι : a2 = 
2 : a3 = a3 : d4 — ::-. 'The Euclidean algorithm for αρ and a; never 
breaks off, since ao = 1 -a1 +42, a1 = 1- a2 +43, a2 = 1- a3 +44, . . . , hence 


dn = 1. The side a, and diagonal ap of the regular pentagon are incom- 
mensurable. The continued fraction determined by ag : a4 is identical to 
the golden section in BI B, 1.1.1.4, 3., p. 4, i.e., it results in an irrational Figure 3.24 
number. 





3.1.6 The Circle and Related Shapes 
3.1.6.1 Circle 


The circle is the locus of the points in a plane which are at the same given distance from a given point, 
the center of the circle. The distance itself, and also the line segment connecting the center with any 
point of the circle, is called the radius. The circumference or periphery of the circle encloses the area of 
the circle. A line segment connecting two points of the circle is called a chord. A line passing through 
two points of a circle is called a secant. Lines having exactly one common point with the circle are called 
tangent lines or tangents of the circle. 


Chord Theorem (Fig. 3.26) AC -AD = AB- AE — r? — m?. 

Secant Theorem (Fig. 3.27) ΑΒ: ΑΕ -- ΑΟ. ΔΡ =m? — 1”. 

Secant-Tangent Theorem (Fig. 3.27) AT° = AB- AE = AC - AD — m? — r?. 
Perimeter U =2nr © 6,283r, U — dez 3,142d, U =2VnS & 3,545VS. 


ΤΑ 
A Ç g LA 
ον, κ 7 


D 


Figure 3.25 Figure 3.26 Figure 3.27 
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Table 3.2 Properties of some regular polygons 
απ R r Sn 
2 2 
3 gon | a3 RYV3 = 2rV/3 Ἕν 2r τη A3 4 zi AS a v3 
h= 2y3= 5R - 303 
2 
5gon |as— 5 V10-2V5 — 10 V50--10V5 | = τ 25+10V5 | = LE 10V5 
R 5R? 
=2ry5- 2V5 | --τ(νὸ -- 1) mA) zx 10 - 25 
= 5r?y/5 — 2/5 
6 gon | aş = 23 = =rv3 £ 3 -y EE 
= AS 
8gon | as = Ry2— V2 η = F(v2+1) — 2a2(v2 4- 1) 
—2r(y2—1) |-—ry4—2V2 £ 24 2 = 2R?V2 
= 8r?(/2 — 1) 
R 010 10 5a? 
10 gon a 7 5 (V5 — 1) = (V5.1) —— 54+ 2V5 = 5 +2V5 
R -1 
25-10/5 |-.v50-10/5 |=7V10+2v5 10 — 2/5 
=2r aE — 105 
2 
Area Smiles S= T ~ 0,785, S= - (3.62 
ς U 
Radius r= a 0.159U. (3.63) Diameter d—2r— ΤΗ e 1.238V S. (3.64 
π 


For the following formulas with angles see the definition of the angle in 3.1.1.2, p. 129. 


Angle of Circumference (Fig. 3.25) 
A Special Case is the Theorem of Thales (see p. 142) y = 


i os. 2 
Angle Between a Chord and a Tangent (Fig. 3.25) 6 = 3 AC= E xC0A. 


Interior Angle (Fig. 3.26) 


Exterior 


Angle (Fig. 3.27) 


180°, 


~ ~ 


7 =5(CB + ED) = 


La =X 
a = 5(DE - BC) = 


1 απ ας 
Angle Between Secant and Tangent (Fig. 3.27) 8 = (TE -TB)- 


qp Ex oll 1 
= 9 BC= 5k BOC = 59. 


ie, a=90°. 


τ BOC + XE0D). 


SEC — X COB). 


(3.65a 
(3.65b 


(3.66 
(3.67 


(3.68 


τ TOE — X BOT).(3.69 
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Inscribed Angle (Fig. 3.28), D and E are arbitrary points on the arcs to the left and to the right. 
IN is 1 
a= (BDC — CEB) = 3 (X B0C — XCOB) 








= 5 (360° XCOB — XCOB) = 180° — XCOB. (3.70) 


3.1.6.2 Circular Segment and Circular Sector 
Defining quantities: Radius r and central angle a (Fig. 3.29). The amounts to determine are: 











Chord a = 2V 2hr — h? = 2r sin 5 . (3.71) 
Central Angle a = 2arcsin z (a measured in degrees). (3.72) 
. 
2 
Height of the Circular Segment. h —r r? ^ =r (1 cos 5) = 5 tans 5 (3.73) 
2πτα . n 
Arc Length l= EE 0.01745ra (ain radian measure) (3.74a) 
b—a 1 
[nas læ a+ Epe, 


i p τν 





Figure 3.28 Figure 3.29 Figure 3.30 
ara 2 
Area of the Sector g= "mp ~ 0.00873r*a. (3.75) 
Areaót Circular Segment S=. ( TA eij ) Stir —alr—h)], S~t (6a+8b). (3.76) 
— —|———smnaoaj-—-—ur—air—hn 55 —(6a i y 
5 2 V180 2 15 


3.1.6.3 Annulus 


Defining quantities of an annulus: Exterior radius R, interior radius r and central angle (Fig. 3.30). 





Exterior Diameter D =2 R. (3.77) Interior Diameter d=2r. (3.78) 
: R+r 

Mean Radius poc (3.79) Breadth ofthe Annulus  ó— R—r. (3.80) 

Area of the Annulus 5 — 7(R? — r?) = qur — d?) 2 2n pà. (3.81) 


Area of an Annulus Sector for a Central Angle ¢ (shaded area in Fig. 3.30) 


Sy = = (m p = τη (D? d = D (yin radian measure). (3.82) 
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3.2 Plane Trigonometry 


3.2.1 Triangles 
3.2.1.1 Calculations in Right-Angled Triangles in the Plane 


1. Basic Formulas 
Notation (Fig. 3.31): 
c hypotenuse; a, b other sides, or legs of the right angle; œ and 8 


the angles opposite to the sides a and b respectively; h altitude; ON 
Pp, q hypotenuse segments; S area. 
Sum of Angles a+ 8+7= 180° with 7 = 90°, (3.83) 
Calculation of Sides a — csina — ccos B 
— btana — bcot B, (3.84) 
Pythagoras Theorem +P =e. (3.85) 





Thales Theorem The vertex angles of all triangles in a semicir- 


cle with hypotenuse as the base are right angles, i.e., all angles ISX 


at circumference in this semicircle are right angles (see Fig. 3.32 


and (3.65b), p. 140). S 
Euclidean Theorems hh?=pq, a?=pc, b? = qc, (3.86) Ze 78M 


b 2 2 
Area s= > = S tan B= τσι 20. (3.87) Figure 3.32 


2. Calculation of Sides and Angles of a Right-Angled Triangle in the Plane 
In a right-angled triangle among the 
six defining quantities (three angles 
o, B,^; and the sides opposite to them 
à, b, c, which are not all independent, of | 
course), one angle, in Fig. 3.31 the an- 
gle 7, is given as 90°. e.g. aa | 8 = 90° — a | b= acota = | 
A plane triangle can be determined by sina | 


Table 3.3 Defining quantities of a right-angled triangle 
in the plane 





Given Calculation of the other mm 














c= 
three defining quantities but these can- P 2 — ane i, c b 
not be given arbitrarily (see 3.1.3.1, ρα σσ οσα nr cosa 





p. 133). So, in the case of a right-angled 
triangle only two more quantities can be 
given. The remaining three quantities nat Su i B= mE 
can be determined from Table 3.3 and SE S LE uS ^7 sna Mee 
(3.15) and (3.83). 


3.2.1.2 Calculations in General (Oblique) Triangles in the Plane 


1. Basic Formulas 
Notation (Fig. 3.33): a, b, c sides; a, 3, y the angles opposite to them; S area; R radius of the circum- 


] ; decet a+b+c 
circle; r radius of the incircle; s = DIL half of the perimeter. 


e.g. ca | 8 = 90° — a | a= csina | b= ccosa 


























Cyclic Permutation 
Because an oblique triangle has no distinguishing side or angle, from every formula containing the 
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Κ᾽ 
Se 


C 
Figure 3.33 Figure 3.34 Figure 3.35 ΚΕ 


sides and angles it is possible to get two further formulas by cyclic permutation the sides and angles 
according to Fig. 3.34. 


o 


S 





TW. 





( 






































S b sing : ji 
E From £ = i (sine law) one gets by cyclic permutation: - = ο. ] f= =L 
b sing c sing’ a sina 
b c 
Sine Law -A a aR (3.88 
sina sinf siny 
Projection Rule (see Fig. 3.35) c= acos 8 +bcosa. (3.89 
Cosine Law or Pythagoras Theorem in General Triangles ° = a? + b? — 2ab cos y . (3.90 
Mollweide Equations 
— B -β 
(a+ b) sin T eos | , (391a) (a — 5) cos T esin ( ©]. (3.91b 
2 2 2 2 
mae B 
a+b “ND 
T: = ; : 
angent Law a2 a-g (3.92 
tan 
2 
s—b)(s—c 
Half-Angle Formulas tan a= ΕΞ f (3.93 
2 s(s—a) 
ing 5 
Tangent Formula tana = ο. a, (3.94 
c—acosB b—acosy 
Additional Relations 
(3.95a) ο ο (9.950 
2 bc 
Height Corresponding to the Side a ha = bsiny — csinB. (3.96 
1 
Median of the Side a Ma = pv b? + c2 + 2becosa. (3.97 
2be cos Č 
Bisector of the Angle a ly = ——4 (3.98 
b+e 
7 s ἡ α b c 
Radius of the Circumcircle R= (3.99 


2sna  2sinf  2siny' 
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s— a)(s—b)(s—c ν 9 
Radius of theIncircle r = ἘΞ stan : tan ; tan ; (3.100) 
s 
adhere au 3.101 
= 4Rsin 5 sin 5 sin 5 - (3.101) 
1 
Area $= je siny — 2R? sinosin f siny — rs — y s(s — a) (s — b)(s— c). (3.102) 


The formula S — ,/s(s — a)(s — b)(s — c) is called Heron's formula. 





2. Calculation of Sides, Angles, and Area in General Triangles 
According to the congruence theorems (see 3.1.3.2, p. 134) a triangle is determined by three independent 
quantities, among which there must be at least one side. 


From here follow the four so-called basic problems. If from the six defining quantities (three angles 
a, B, y and the sides opposite to them a, b, c) three independent quantities are given, one can calculate 
the remaining three with the equations in Table 3.4, p. 145. 
In contrast to spherical trigonometry (see the second basic problem, Table 3.9, p. 172) in a plane 
triangle there is no way to get any side only from the angles. 


3.2.2 Geodesic Applications 
3.2.2.1 Geodesic Coordinates 


In geometry usually right-handed coordinate systems are used to determine points in plane or space 
(Fig. 3.170). In contrast with this, in geodesy left-handed coordinate systems are in use. 


1. Geodesic Rectangular Coordinates 

In a plane left-handed rectangular coordinate system (Fig. 3.37) the z-axis of abscissae is shown up- 
ward, the y-axis of ordinates is shown to the right. A point P has coordinates yp, xp. The orientation 
of the x-axis follows from practical reasons. When measuring long distances, for which mostly the 
Soldner- or the Gauss-Krueger coordinate systems are in use (see 3.4.1.2, p. 162), the positive x-axis 
points to grid North, the y-axis oriented to the right points to East. The enumeration of the quadrants 
follows a clockwise direction in contrast with the usual practice in geometry (Fig. 3.37, Fig. 3.38). 
If besides the position of a point in the plane also its altitude is to be considered, one can use a three- 
dimensional left-handed rectangular coordinate system (y, x, z), where the z-axis points to the zenith 
(Fig. 3.36). 





Figure 3.36 Figure 3.37 Figure 3.38 


2. Geodesic Polar Coordinates 

In the left-handed plane polar coordinate system of geodesy (Fig. 3.38) a point P is given by the 
directional (azimuthal) angle t between the axis of abscissae and the line segment s, and by the length of 
the line segment s between the point and the origin (called the pole). In geodesy the positive orientation 
of an angle is the clockwise direction. 

To determine the altitude the zenith distance ¢ can be used or the vertical angle respectively the angle 
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of tilt a. In Fig. 3.36 is shown that in a three-dimensional rectangular left-handed coordinate system 
(see also left and right-handed coordinate systems 3.5.3.1, 2., p. 209), the zenith distance is measured 
between the zenith axis z and the line segment s, the angle of tilt between the line segment s and its 
perpendicular projection on the y, x plane. 


Table 3.4 Defining quantities of a general triangle, basic problems 





























Given Formulas for calculating the other quantities 
asin 
1. lsideand2 y — 180? — o — fi, b= ο... 
angles (a, a, B) asiny HE 
e—— 5 S=-—absiny. 
sina 2 
α-- a—b a+ 1 
2. 2 sides and the tan B 5 cot 1 E 90* ——5 
angle between 2 at 2 2 2 
them (a,b, y) o and f come from a 4- B and a — B, 
37 1 
pec ο S=-absiny. 
sina 2 
; 5 bsina 
3. 2 sides and the sin 8 = : 
angle opposite . | . 
one of them If a > b holds, then 8 < 90° and is uniquely determined. If 
(a, b, o») a <b holds, the following cases occur: 


1. 6 has two values for bsina < a (By = 180° — βι). 
2. B has exactly one value (90?) for bsina = a. 


3. For bsina > a there is no such triangle. 


si 1 
η = 180? — (a 4- B), ie S = -a bsing. 
sina 2 








4. 3 sides (a,b,c) 

















3. Scale 
In cartography the scale factor M is the ratio of a segment sj in a coordinate system K with respect 
to the corresponding segment sgo in another coordinate system Ko. 
1. Conversion of Segments With m as a modulus or scale and N as an index for the nature and K 
as the index of the map holds: 

M-—1:mc-sk:sy. (3.103a) 
For two segments sx, 5x2 With different moduli m4, mg yields: 

SK1 | SK2 = M2 : Mı. (3.103b) 
2. Conversion of Areas If the areas are calculated according to the formulas Fx = axbx, Fy = 
ayby, then: 

Fy = Fem. (3.104a) 





146 3. Geometry 





For two areas F1, F5 with different moduli mi, m» : 
Figs: Feo = m : m2. (3.104b) 
3.2.2.2 Angles in Geodesy 


1. Grade or Gon Division 

In geodesy, in contrast to mathematics (see 3.1.1.5, p. 131) the gon measure is in use as a unit for angles. 
The perigon or full angle corresponds here to 400 grade or gon. The conversion between degrees and 
gons can be performed by the formulas in Table 3.5: 


Table 3.5 Grade and Gon Division 





1 Full angle = 360° =27rad = 400 gon 
1 right angle =90° = 5d — 100 gon 
π 
1gon = 200 rad = 1000 mgon 











2. Directional Angle 

The directional angle t at a point P gives the direction of an oriented line segment with respect to a 
line passing through the point P parallel to the x-axis (see point A and the directional angle tag in 
Fig. 3.39). Because the measuring of angles in geodesy is made in a clockwise direction (Fig. 3.37, 
Fig. 3.38), the quadrants are enumerated in the opposite order to the right-handed Cartesian coordi- 
nate system of plane trigonometry (Table 3.6). The formulas of plane trigonometry are valid without 
change. 


Table 3.6 Directional angle in a segment with correct sign for arctan 



































| Quadrant I II ΠῚ IV 

| Sign of numerator T = . + 
N 

| n tan > 0 tan < 0 tan > 0 tan < 0 
Δα 

| Directional angle t to gon | to + 200 gon | to + 200 gon | to + 400 gon 





3. Coordinate Transformations 


1. Calculation of Polar Coordinates from Rectangular Coordinates For two points A(yA, r4) 
and B(yg, ag) in a rectangular coordinate system (Fig. 3.39) with the segment s4g oriented from A 
to B and the directional angles tap, tga the following formulas are valid: 


UB—VA _ AYAB 
TB— TA i Avan’ (5:1054) 8AB —N AyAp t Aváp , (3.105b) 


AYAB 
ATAB 








tantap = (3.105c) tga = tap + 200 gon. (3.105d) 


N 
The quadrant of the angle t depends on the signs of AyAp and AzAg. If using a calculator A is 
x 


punched in with the correct signs for Ay and Ax, then one gets an angle to by pressing the arctan 
button to which is to add the gon-value given in Table 3.6 according to the corresponding quadrant. 
2. Calculation of Rectangular Coordinates from Distances and Angles In a rectangular co- 
ordinate system are to determine the coordinates of a point C by measuring in a local polar system 
(Fig. 3.40). 


3.2 Plane Trigonometry 147 





Given: y4, %4; YB, &B. Measured: a, sgc. Find: yc, zo. 








Solution: 
Δι 
tantas = 22 , (3.106a) tso = tag +a +200gon, (3.106b) 
ATAB 
UC — yB + sge Sin tge, (3.106c) £o = £g + spc cos tge- (3.106d) 


If also sAp is measured, then the difference between the locally measured distance and the distance 
computed from the coordinates can be considered by multiplying with the scale factor q, where q must 
be very close to 1: 


Pa calculated distance y Ay? pg - Δαλῃ (3.107) 


- ; 
measured distance SAB 





Yo — yp ^ Sgoqsintpo, (3.107b) £o = £g + spoqcostpo. (3.107c) 





Figure 3.39 


3. Coordinate Transformation Between Two Rectangular Coordinate Systems In order to 
locate a given point on a country-map the local system y’, x’ is to be transformed into the system y, x of 
coordinates of the map (Fig. 3.41). The system y’, x’ is rotated into y, x by an angle y and is translated 
parallel by yo, πο. The directional angles in the system y’, 2’ are denoted by J. The coordinates of A and 
B are given in both systems and the coordinates of a point C in the 2’, y’-system. The transformation 


is given by the following relations: 
Sp — y Ay ig - Aag , (3.108b 


Sap = Ayip t+ Ariz, (3.1082) 


Figure 3.40 


_ SAB 








q= FE (3.108c) p = taB — VAB, (3.108d 
A Ay. 

tantas = =, (3.108e) tan Dan = 42, (3.108f 
ATAB Axe 





Yo = Ya—Quasinyg—qyacosy, (3.108g) % =ta+qyasiny—qzacosy, (3.108h 


yo = Ya + qsin Y(T — 2^4) -- qcos e(ye — y). (3.108i 
zo = xta + qcosq(xo — x4) — qsing(ye — y4). (3.108j 
Remark: The following two formulas can be used as a check. 


yo = JA qs4c Sin(p--UA4c), (3.108k) 





to = wAdqsaccos(p--04c). (3.1081 
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If the segment AB is on the z'-axis, the formulas reduce to 





A Ας 

α-Ξ545-γεῖπῳ, (3.1092) b = AP . qcosq, (3.109b) 
Up Tp 

Yo = Ya + azg + byo, (3.109c) zo = T4 + bto — ayo, (3.109d) 

Yo = Ayach — Axaca, (3.109e) to = Avacb + Ayaca. (3. 109f) 








Figure 3.41 Figure 3.42 


3.2.2.3 Applications in Surveying 

The determination of the coordinates of a point N to be fixed by triangulation is a frequent measuring 
problem in geodesy. The methods of solving it are intersection, three-point resection, arc intersection, 
free stationing and traversing. The last two methods are not discussed here. 

1. Intersection 

1. Intersection of Two Oriented Lines or first fundamental problem of triangulation: To deter- 
mine a point N by two given points A and B with the help of a triangle ABN (Fig. 3.42). 

Given: y4, x4; yp, xp. Measured: o, D, if it is possible also y or y = 200 gon —a — f. Find: yy, zy. 
Solution: 


A 
tantas = m (3.1102) sap = y ^yĵp + Az} p = |Ayaeg sin tag| + Aag costaa| ,(3.110b) 








TAB 
sina sina (3.1100) sin 8 sin 8 (3.110d) 
5 — SAB——— — S$AB——— — — : 5 = g = = SAB = s : c 
HN AAB sin y ^B gin (a+ 8) αμ sin y "AB Sin (a 4- B) 
tan = tAB — Q, (3.110e) tgN —tpgA- B — tap - B 200 gon, (3.110f) 
Yn = Ya + san SiN tAn = YB + Spy SÌN tBN, (3.110g) 
UN — £A + san COStAN = £B + Spy COStBN- (3.110h) 


2. Intersection Problem for Non-Visible B Ifthe point B cannot be seen from A, the directional 
angles tan and tgy are to be determined with respect to reference directions to other visible points D 
and E whose coordinates are known (Fig. 3.43). 

Given: ya, t4; Up, tp; UD; Xp; UE, te. Measured: ó in A, £ in B, and if it is possible, also y. 

Find: YN; TN. 
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Figure 3.43 Figure 3.44 


Solution: Reducing to the first fundamental problem, calculating tant4g, according to (3.110b) 
yields: 














^ ^r 
tantap = πε (3.111a) tantpg — n (3.111b) 
LAD TEB 
tan =tap +6, (3.111c) tpn =tpeté, (3.111d) 
a = tgp — tan, (3.111e) B —tgv — tpa, (3.111f) 
ΔΙ Δ 
tantay = UNA | (3.11 1g) tantgy = E, (3.111h) 
ATNA Arne 


_ AynpA t xAtantaw — vp tantgny 


IN , (3.1111) UN = YB + (zy — zg)tantgw. (3.111j) 


tantan — tantgn 


2. Three-Point Resection 


1. Snellius Problem of Three-Point Resection or to determine a point N by three given points 
A, B,C; also called the second fundamental problem of triangulation (Fig. 3.44): 


Given: ya, U4; YB, UB; Yc,tc. Measured: 61,62 in N. Find: yy, oy. 




















Solution: 
tan tac = uu (3.1122) tantge = — , (3.112b) 
ga Siac _ Ata (3.112c) υ- DUsc _ Areco (3.112d) 
sin tac cos tac sin tge cos tge 
qoia imcie-cim (3.1120) E = 180° — m (3.1121) 


The equality (3.112f) is a first condition to determine y and v. A second condition one gets from 
(2.114) and (2.115), p. 83: 


s sind d ο 
τ = tan PTY -cot © ae i 
sin y — sin Y 2 2 





(3.112g) 
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With the sine law (3.88) follows 


sin y _ SON sin v _ SON 





























= = —, 112] 
sin 6) a’ sindg Ὅν (3 i) 
putting into (3.112g) gives 
i5 w tan © +y. sin Q— Ein Vo ied +Y. bein δι --ᾱ sin δ (3.1121) 
2 2 sin Y + sin Y 2 bsin 6; + asin 62 
From (3.112i) one gets ov and together with (3.112f) 
pty -Y otv φ-Ψ ‘ 
ge +t 28 y= E gos (3.1125) 
From here one can determine the following line segments and points: 
aq, b 
Saw sin(d 3.112k are g 
SAN = & sin( 4- p) ( ) SBN = 5 sin(ôz + Y), (3.1121) 
b 
SCN = B sing = — sinw, (3.112m) 
sin 6, sin 69 
£y = £A + SAN COSÍAN — tp t+ Spy costpn, (3.112n) 
UN = Ya + San SiN tAN = Yg + Sgn SiN tgy. (3.1120) 


2. Three-Point Resection by Cassini 
Given: ya, x4; Up, tp; yc, tc. Measured: 01,05 in N. Find: yy, zw. 


For this method two reference points P and Q are to be used, which are on the reference circles passing 
through A, C, P and B, C, Q so that both are on a line containing N (Fig. 3.45). The centers of the 
circles Hy and Hp are at. the intersection points of the mid perpendicular of AC and of BC with the 
segments PC and QC’. The angles 61, 62 measured at N appear also at P and Q as angles of circum- 
ference. 











Solution: 

yp = Ya + (eo — a) cotd;, — (3.113a) tp =x4+(yo— ya) cotd;, — (3.113b) 

UQ — yB t (xg — zc)cotós, (9.196) zQ = £g + (yg — yc) cot ôz,  (3.113d) 
ΔΙ yo — 7ο -- tt 

"sg (3.113e) ay map 5ο EP (ro Br) cottra (3 1139) 
Ax pQ tan tpo + cot tpo 

YN = yp + (tn — xp) tan tpo (tantpg « cot tpo), (3.113g) 

UN — yc — (xw — zo) cot tpo (cot tpo < tan tpo), (3.113h) 


Dangerous Circle: When choosing the points it is to be ensured that they do not lie on one circle, 
because then there is no solution; one talks about a so-called dangerous circle. The closer the points 
are near to the dangerous circle, the less gets the accuracy of the method. 

3. Arc Intersection 

With this method a so-called new point N is to be determined as intersection point of two arcs around 
two points A and B with known coordinates and with measured radii say and sgy (Fig. 3.46). The 
unknown line segment s 4, is calculated and the angles are to be calculated from the now already known 
three sides of the triangle ABN. 

A second solution — not discussed here — starts from the decomposition of the general triangle into two 
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Figure 3.45 


right-angled triangles. 


Figure 3.46 


Given: ya, x4 : yg, pg. Measured: say; spy. Find: sAp, yy, tw. 














Solution: 
AYAB 
sap = J Ag - AxÀg, (3.114a) tant4g — ————, (3.114b) 
ATAB 
tga = tasg + 200 gon, (3.114c) 
«2 2 g2 2 23 2 
cosa = San + SAB — SBN . (3114d) cos B = SBN T SAB ~ SAN , (3.114e) 
2sANSAB 2SBNSAB 
tan =t4B— Q, (3.114f) ten = tsa- P, (3.114g) 
Yn = Ya + san SÌN tAN, (3.114h) £N = TA + SAn COStAN, (3.114i) 
UN — UB t SBN SÌN İBN, (3.114j) TN = TBT SBN COStBN. (3.114k) 
3.3 Stereometry 
3.3.1 Lines and Planes in Space 
Z. 1. Two Lines Two linesin the same plane have either one or no common 
point. In the second case they are parallel. If there is no plane such that 


it contains both lines, they are skew lines. The angle between two skew 
lines is defined by the angle between two lines parallel to them and passing 
through a common point (Fig. 3.47). The distance between two skew 
lines is defined by the segment which is perpendicular to both of them. 
(There is always a unique transversal line which is perpendicular to both 
Figure 3.47 skew lines and intersects them, too.) 


2. Two Planes Two planes intersect each other in a line or they have no common point. In this 
second case they are parallel. If two planes are perpendicular to the same line or if both are parallel to 
every intersecting pair of lines in the other, the planes are parallel. 
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3. Line and Plane A line can be incident to a plane, it 
can have one or no common point with the plane. In this last 
case it is parallel to the plane. The angle between a line and a 


Z\ plane is measured by the angle between the line and its orthog- 
UU eu onal projection on the plane (Fig. 3.48). If this projection is 
only a point, i.e., if the line is perpendicular to two different 


intersecting lines in the plane, then the line is perpendicular or 
Figure 3.48 orthogonal to the plane. 


| 3.3.2 Edge, Corner, Solid Angle 


1. Edge or dihedral angle is a figure formed by two infinite half-planes starting at the 
same line (Fig.3.49). In everyday terminology the word edge is used for the intersec- 
tion line of the two half-planes. As a measure of edges, one uses the edge-angle ABC, 
Ci. p the angle between two half-lines lying in the half-planes and being perpendicular to 
^. the intersection line DE at a point B. 
2. Corner or polyhedral angle OABCDE (Fig. 3.50) is a figure formed by several 
E planes, the lateral faces, which go through a common point, the vertex 0, and intersect 
L^ each other at the lines 0A, 0B, .... 
Two lines which bound the same lateral face form a plane angle, while neighboring 
Figure3.49 faces form a dihedral angle. 








Polyhedra are equal to each other, i.e., they are congruent, if they are superposable. For this the corre- 
sponding elements, i.e., the edges and plane angles at the vertex must be coincident. If the correspond- 
ing elements at the vertex are equal, but they have an opposite order of sequence, the corners are not 
superposable, and they are called symmetric corners, because they can be brought into a symmetric 
position as shown in Fig. 3.51. 

A convex polyhedral angle lies completely on one side of each of its faces. 

The sum of the plane angles AOB + BOC +...+ E0A (Fig. 3.50) is less than 360° for every convex 
polyhedron. 


3. Trihedral Angles are congruent if they coincide in the following elements: 
in two faces and the corresponding dihedral angle, 

in one face and both dihedral angles belonging to it, 

in three corresponding faces in the same order of sequence, 


9 
9 
9 
e inthree corresponding dihedral angles in the same order of sequence. 


> 


Solid Angle A pencil of rays starting from the same point (and intersecting a closed curve) forms 
a solid angle in space (Fig. 3.52). It is denoted by Q and calculated by the equality 


ucc (3.115a) 


pon 

Here S means the piece of the spherical surface cut out by the solid angle from a sphere whose radius is 
r, and whose center is at the vertex of the solid angle. The unit of solid angle is the steradian (sr) (see 
also p. 1055): 

E n? 

~ Lm?’ 

i.e., a solid angle of 1 sr cuts out a surface area of 1 m? of the unit sphere (r = 1m). 
Bl A: The full solid angle is Q — Azr?/r? — 4s. 

W B: Acone with a vertex angle (also called an apex angle) a — 120? defines(determines) a solid angle 
Q = 2rr?(1 — cos(a /2))/r? = r, where the formula for a spherical cap (3.163)has been used. 


lsr 





(3.115b) 
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Figure 3.50 Figure 3.51 Figure 3.52 Figure 3.53 
3.3.3 Polyeder or Polyhedron 


In this paragraph the following notations are used : V volume, S total surface, M lateral area, h alti- 
tude, Ag base area. 





1. Polyhedron is a solid bounded by plane polygons. 
2. Prism (Fig. 3.53) is a polyhedron with two congruent bases, and having parallelograms as ad- 
ditional faces. A right prism has edges perpendicular to the base, a regular prism is a right prism with 
a regular polygon as the base. For the prism 

V — Ach, (3.116) M — pl, (3.117) S=M+2Ag. (3.118) 
holds. Here p is the perimeter of the section perpendicular to the edges, and / is the length of the edges. 
If the edges are still parallel to each other but the bases are not, the lateral faces are trapezoids. If the 
bases of a triangular prism are not parallel to each other, its volume can be calculated by the formula 
(Fig. 3.54): 
(a+b+0)Q 

3 ; 
where Q is a perpendicular cut, a, b, and c are the lengths of the parallel edges. If the bases of the prism 
are not parallel, then its volume is 

V=/Q, (3.120) 
where [ is the length of the line segment BC connecting the centers of gravity of the bases, and Q is the 
cross-cut perpendicular to this line. 


qu 
, 9] 
A 


Figure 3.54 Figure 3.55 Figure 3.56 


V= (3.119) 





3. Parallelepiped is a prism with parallelograms as bases (Fig. 3.55), i.e., it is bounded by six 
parallelograms. In a parallelepiped all the four body diagonals intersect each other at the same point, 
the midpoint, and halve each other. 

4. Rectangular Parallelepiped or block is a right parallelepiped with rectangles as bases. In a 
block (Fig. 3.56) the body diagonals have the same length. If a, b, and c are the edge lengths of the 
block and d is the length of the diagonal, then 
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d^-a* .b?.d0, (3421) V=abe, (3122) S —2(ab--bc--ca). (3.123) 
5. Cube (or regular hexahedron) is a block with equal edge lengths: a =b=c, 
d? — 3a?, (3.124) V-a, (325) S — 6a?. (3.126) 


6. Pyramid (Fig. 3.57) is a polyhedron whose base is a polygon and its lateral faces are triangles 
with a common point, the vertex. A pyramid is called right if the foot of the perpendicular from the 
vertex to the base Ag is at the midpoint of the base. It is called regular if it is right and the base is a 
regular polygon (Fig. 3.58), and n faced if the base is an n gon. Together with the base the pyramid 
has (n 4- 1) faces. For the volume 


Αα 





V= 3.127 
s (3127) 
holds. For the lateral area of the regular pyramid 
1 
M= zP’ (3.128) 


holds with p as the perimeter of the base and h, as the altitude of a face. 


7. Frustum ofa Pyramid or truncated pyramid is a pyramid whose vertex is cut away by a plane 
parallel to the base (Fig. 3.57, Fig. 3.59). Denoting by S0 the altitude of the pyramid, i.e., the 
perpendicular from the vertex to the base, then 











πα Be QC ta? li 
Area ABCDEF (WV SE 
Area A,B,C, D, EF, n S0, ` ( . ) 


holds. If Ap and Ag are the upper and lower bases, resp., h is the altitude of the truncated pyramid, 
i.e., the distance between the bases, and ap and ag are corresponding sides of the bases, then 


sai 1 ap ap\? 
V - jh [Ac +Ap+ VAcAp| = FAG h +24 P3 | (3.131) 


The lateral surface of a regular truncated pyramid is 


Pp po, 
ΤΣ δι 


where pp and pg are the perimeters of the bases, and As is the altitude of the faces. 


B 


M (3.132) 








Figure 3.57 Figure 3.58 Figure 3.59 
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Figure 3.60 Figure 3.61 Figure 3.62 


8. Tetrahedron isa triangular pyramid (Fig. 3.60). With the notation 
0A =a, 0B =b, 0C — c, CA — q, BC — p and AB — r the following holds: 











0 r? g a1 

1 ο 021 
pel q poc il. (3.133) 

a?b?201 

11110 


9. Obelisk is a polyhedron whose lateral faces are all trapezoids. In the special case in Fig. 3.61 
the bases are rectangles, the opposite edges have the same inclination to the base but they do not have 
a common point. If a, b and aj, bı are the sides of the bases of the obelisk and A is the altitude of it, 
then: 


/ h 
V= : ((2a + a1) b+ (2a, + a) by] = s [ab + (a + ay) (b+ by) - abi]. (3.134) 


10. Wedge is a polyhedron whose base is a rectangle, its lateral faces are two opposite isosceles 
triangles and two isosceles trapezoids (Fig. 3.62). For the volume holds 


1 
V= g (24 +a,)bh. (3.135) 


SAH Ὁ ο ελ 


8) d) 
Figure 3.63 


11. Regular Polyeder have congruent regular polyeders as faces and congruent regular corners. 
The five possible regular polyheders are represented in Fig. 3.63; Table 3.7 shows the corresponding 
data. 
12. Euler's Theorem on Polyeders Ife is the number of vertices, f is the number of faces, and k 
is the number of edges of a convex polyhedron then 

e-k+f=2. (3.136) 
Examples are given in Table 3.7. 
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Table 3.7 Regular polyeders with edge length a 














Number and Number of Total area Volume 

Name forni ottas Edges | Corners S/a? V/a 

; V2 
Tetrahedron 4 triangles 6 4 V3 = 1.7321 T8 ^ 0.1179 
Cube 6 squares 12 8 6 — 6.0 1-210 

v2 
Octahedron 8 triangles 12 6 2/3 = 3.4641 — = 0.4714 

TES: 
Dodecahedron | 12 pentagons | 30 20 3/5(5 + 2V5) = 20.6457 EIU 1.6631 
ab. [4 

Icosahedron | 20 triangles | 30 12 5/3 = 8.6603 δν) ο αγγ 





























3.3.4 Solids Bounded by Curved Surfaces 
In this paragraph the following notations are used: V volume, S total surface, M lateral surface, h 
altitude, Ag base. 


1. Cylindrical Surface is a curved surface which can be got by parallel translation of a line, the 
generating line or generator along a curve, the so-called directing curve (Fig. 3.64). 

2. Cylinder is a solid bounded by a cylindrical surface with a closed directing curve, and by two 
parallel bases cut out from two parallel planes by the cylindrical surface. For every arbitrary cylinder 
(Fig. 3.65) with the base perimeter p, with the perimeter s of the cut perpendicular to the apothem, 
whose area is Q, and with the length / of the apothem the following is valid: 


V — Ach — Ql, (3.137) M -ph- sl. (3.138) 


3. Right Circular Cylinder has a circle as base, and its apothems are perpendicular to the plane 
of the circle (Fig. 3.66). With a base radius R 


V — s Rh, (3.139) M=2rRh, (3.140) S=2nR(R+h). (8.141) 
4. Obliquely Truncated Cylinder (Fig. 3.67) 


V ="R? B (3.142) M =r R(h hs), (3.143) 


hg —1 Ἴ 
S-—-mRih thy; Rc n+ (25%) | (3.144) 


5. Ungula of the Cylinder With the notation of Fig. 3.68 and with a = y/2 in radians 








3 an3 

V= - [a(3 R? — à?) - 3 R*(b — Τα] = D (sn α-- E * — arcos a) ; (3.145) 
2RI 

M= z * ((b— R)a - a], (3.146) 


where the formulas are valid even in the case b > R, p >r. 
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generating line 


directing curve 





Figure 3.64 Figure 3.65 Figure 3.66 Figure 3.67 


6. Hollow Cylinder With the notation R for the outside radius and r for the inside one, ô = R — r ns 
R+r 
2 


V2nh(R?—1?) 2 s hó(2 R— 0) = m hô(2r + ô) =2rhdo. (3.147) 


for the mean radius (Fig. 3.69) 





for the difference of the radii, and o — 





Figure 3.68 Figure 3.69 
7. Conical Surface arises by moving a line, the generating line, along a curve, the direction curve 
so that it always goes through a fixed point, the vertex (Fig. 3.70). 


8. Cone (Fig. 3.71) is bounded by a conical surface with a closed direction curve and a base cut 
out from a plane by the surface. For an arbitrary cone 








hA 
V= ο (3.148) 
directing curve 
δρ 
1 
vertex 
Figure 3.70 Figure 3.71 Figure 3.72 


9. Right Circular Cone has a circle as base and its vertex is right above the centre of the circle 
(Fig. 3.72). With / as the length of the apothem and R as the radius of the base 


1 
V= gr RPh, (3.149) M=n7Rl=n7RVR?2+h?2, (3.150) S=aR(R+1). (5.151) 


10. Frustum of Right Cone or Truncated Cone (Fig. 3.73) 
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Cr; » 
] T 
Figure 3.73 Figure 3.74 
l=y\h?+(R-r}, (3.152) M — zl r), (3.153) 
πο, 2 2 z ο. hr 
V= (R tr Βτ), (3.154) H=h+p— (3.155) 


11. Conic Sections see 3.5.2.11, p. 206. 


12. Sphere (Fig. 3.74) with radius R and diameter D = 2R. Every plane section of it is a circle. 
A plane section through the center results in a great circle (see 3.4.1.1, p. 160) with radius R. Only 
one great circle can be fitted through two surface points of the sphere if they are not the endpoints of 
the same diameter. The shortest connecting surface curve between two surface points is the arc of the 
great circle between them (see 3.4.1.1, p. 160). 

Formulas for the surface and for the volume of the sphere: 








S = 4r R? z: 12.57 R?, (3.156a) S = r D? ~ 3.142 D?, (3.156b) 
4 
S = V36n V2 z 4.836 VV?,  (3.156c) V= gr Re e 4.189 R?, (3.1572) 
D3 3 
v=" w0.5236 D3, (3.157b) y- ; 5” 0.09403 V53, (3.1570) 
π 
1 / 
R= ig = 0.2821 V'S, (3.158a) π- ᾗ E &:0.6204 VV. — (3.158b) 
13. Spherical Sector (Fig. 3.75) 
2 
S=nR(2h+a), (3.159) v= UN n (3.160 
14. Spherical Cap (Fig. 3.76) 
1 1 
a? — h(2.R — h), (3.161) V= gre (3 a? + n?) = 37 GR — h), (3.162 
M=2rRh=r(@ +h), (3.163) S=n(2Rh+a?)=n(h? +207). (3.164 
15. Spherical Layer (Fig. 3.77) 
2 2 2\ 2 
2_ 2 af — bt- h _1 ee 9 
R?-a «( am ) , (8.165) V — gh (3a +30? +h?), (3.166 
M —-2z Rh, (3.167) S=n(2Rh+a? +b’). (3.168 








If Vj is the volume of a truncated cone written in a spherical layer (Fig. 3.78) and / is the length of its 
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apothem, then 
1 
V-V = gr^ (3.169) 


YI RT ασ, 


Figure 3.75 Figure 3.76 Figure 3.77 


16. Torus (Fig. 3.79) is the solid which can be generated by rotating a circle around an axis which 
is in the plane of the circle but does not intersect it. 





S = 47° Rr = 39.48 Rr, (3.170a) S =7° Dd ~ 9.870 Dd, (3.170b) 
V =2r° Rr?’ ~ 19.74Rr°, — (8.171a) V= 1D d?zz2.467 Dd?.  (3.171b) 
d 
E 
το 
Figure 3.78 Figure 3.79 Figure 3.80 


17. Barrel (Fig. 3.80) arises by rotation of a generating curve; a circular barrel by rotation of 
a circular segment, a parabolic barrel by rotation of a parabolic segment. For the circular barrel the 
following approximation formulas hold, 


V ez 0.262 A (2 D?4 d?) (3.172a) or V & 0.0873 h(2 D + d}, (3.172b) 


and for the parabolic barrel 


pert s 20^ «past 3p J ~ 0.05236h (8 D? +4 Dd +34?) (3.173) 
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3.4 Spherical Trigonometry 


For geodesic measures which extend over great distances the spherical shape of the Earth is taken into 
consideration. Therefore the spherical geometry is necessary. In particular one needs formulas for 
spherical triangles, i.e., triangles lying on a sphere. This was also realized by the ancient Greeks, so 
besides the trigonometry of the plane the trigonometry of the sphere has been developed, and nowadays 
Hipparchus (around 150 BC) is considered as the founder of spherical geometry. 


3.4.1 Basic Concepts of Geometry on the Sphere 
3.4.1.1 Curve, Arc, and Angle on the Sphere 
1. Spherical Curves, Great Circle and Small Circle 


Curves on the surface of a sphere are called spherical curves. Important spherical curves are great 
circles and small circles. They are intersection circles of a plane passing through the sphere, the so- 
called intersecting plane (Fig. 3.81): 
If a sphere of radius R is intersected by a plane K at distance h from the center O of the sphere, then 
for radius r of the intersection circle holds 

r=vR-h (O<h< R). (3.174) 
For h = 0 the intersecting plane goes through the center of the sphere, and r takes the greatest possible 
value. In this case the intersection circle g in the planeT is called a great circle. Every other intersection 
circle, with 0 < h < R, is called a small circle, for instance the circle k in Fig. 3.81. For h = R the 
plane K has only one common point with the sphere. Then it is called a tangent plane. 





Figure 3.81 Figure 3.82 Figure 3.83 


HM On the Earth the equator and the meridians with their countermeridians — which are their reflections 
with respect to the Earth’s axis — represent great circles. The parallels of latitude are small circles (see 
also 3.4.1.2, p. 162). 

2. Spherical Distance 

Through two points A and B of the surface of the sphere, which are not opposite points, i.e., they are 
not the endpoints of the same diameter, infinitely many small circles can be drawn, but only one great 
circle (with the plane of the great circle g). Consider two small circles kı, ko through A and B and 
turn them into the plane of the great circle passing through A and B (Fig. 3.82). The great circle has 
the greatest radius and so the smallest curvature. So the shorter arc of the great circle is the shortest 
connection between A and B. It is the shortest connection between A and B on the surface of the 
sphere, and it is called the spherical distance. 


3. Geodesic Lines 


Geodesic lines are the curves on a surface which are the shortest connections between two points of the 
surface (see 3.6.3.6, p. 268). 
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E In the plane the straight lines, on the sphere the great circles, are the geodesic lines (see also 3.4.1.2, 
p. 162). 

4. Measurement of the Spherical Distance 

The spherical distance of two points can be expressed as a measure of length or as a measure of angle 
(Fig. 3.83). 

1. Spherical Distance as a Measure of Angle is the angle between the radii 0A and 0B measured 
at the center 0. This angle determines the spherical distance uniquely, and hereafter it is denoted by a 
lowercase Latin letter. The notation can be given at the center or on the great circle arc. 

2. Spherical Distance as a Measure of Length is the length of the great circle arc between A 


and B. It is denoted by AB (arc AB). 
3. Conversions from Measure of Angle into Measure of Length and conversely can be done 
by the formulas 


a è Ἔκ 
AB= Rarce = R-, (3.175a) e —AB E (3.175b) 
ϱ 
Here e denotes the angle given in degrees and arc e denotes the angle in radian (see radian measure 
3.1.1.5, p. 131). The conversion factor o is equal to 


ο 


1 
o= lrad ES 57.2958? — 3438' — 206265". (3.175c) 
T 








The determinations of the distance as a measure of length or angle are equivalent but in spherical 
trigonometry the spherical distances are given mostly as a measure of angle. 


W A: For spherical calculations on the Earth's surface usually a sphere is considered with the same 
volume as the biaxial reference ellipsoid of Krassowski. This radius of the Earth is R = 6371.110 km, 


and consequently holds 1° ^ 111.2 km, 1' ^ 1853.3m = 10ldseamile. Today 1 seamile — 1852 m. 


E B: The spherical distance between Dresden and St. Petersburg is AB — 1433 km or 
57.3? = 12.89° = 12°53’. 


1433 km 
6371 km 


e= 





Figure 3.84 


5. Intersection Angle, Course Angle, Azimuth 

The intersection angle between spherical curves is the angle between their tangent lines at the intersec- 
tion point Pj. If one of them is a meridian, the intersection angle with the curve segment to the north 
from P, is called the course angle o in navigation. To distinguish the inclination of the curve to the east 
or to the west, a sign is to be assigned to the course angle according to Fig. 3.84a,b restricting it to 
the interval —90° < a < 90°. The course angle is an oriented angle, i.e., it has a sign. It is independent 
of the orientation of the curve — of its sense. 

The orientation of the curve from P; to Py, as in Fig. 3.84c, can be described by the azimuth ô: It is 
the intersection angle between the northern part of the meridian passing through P and the curve arc 
from P to Pz. The azimuth is restricted to the interval 0° < 6 < 360°. 
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Remark: In navigation the position coordinates are usually given in sexagesimal degrees; the spherical 
distances and also the course angles and the azimuth are given in decimal degrees. 


3.4.1.2 Special Coordinate Systems 
1. Geographical Coordinates 


To determine points P on the Earth's surface geographical coordinates are in use (Fig. 3.85), i.e., 
spherical coordinates with the radius of the Earth, the geographical longitude A and the geographical 
latitude q. 

To determine the degree of longitude the surface of the Earth is subdivided by half great circles from the 
north pole to the south pole, by so-called meridians. The zero meridian goes through the observatory 
of Greenwich. From here one counts the east longitude with the help of 180 meridians and the west 
longitude with 180 meridians. At the equator they are at a distance of 111km of each other. East 
longitudes are given as positive, west longitudes are given as negative values. So —180° < A < 180°. 





—X 





Figure 3.85 Figure 3.86 


To determine the degree of the latitude the Earth's surface is divided by small circles parallel to the 
equator. Starting from the equator 90 degrees of latitude are to be counted to the north, the northings, 
and 90 southern latitudes. Northings are positive, southern latitudes are negative. So —90° < y < 90°. 
2. Soldner Coordinates 

The right-angled Soldner coordinates and Gauss- Krüger coordinates are important in wide surface sur- 
veys. To map parts of the curved Earth's surface onto a right-angled coordinate system in a plane, 
distance preserving in the ordinate direction, according to Soldner the z-axis is to be placed on a merid- 
ian (it is called a central meridian), and the origin at a well-measured center point (Fig. 3.86a). The 
ordinate y of a point P is the segment between P and the foot of the spherical orthogonal (great circle) 
on the central meridian. The abscissa x of the point P is the segment of a circle between P and the 
main parallel passing through the center, where the circle is in a parallel plane to the central meridian 
(Fig. 3.86b). 

Transferring the spherical abscissae and ordinates into the plane coordinate system the segment Ax is 
stretched and the directions are distorted. The coefficient of elongation a in the direction of the abscissa 
is Δα 1 y? 
As -— F 2R2’ 
To moderate the stretching, the system may not be extended more than 64km on both sides of the 
central meridian. A segment of 1 km length has an elongation of 0.05 m at y = 64km. 

3. Gauss-Krüger Coordinates 

In order to map parts of the curved Earth's surface onto the plane with an angle-preserving (conformal) 
mapping, at Gauss-Kriiger system first a partition into meridian zones is prepared. For Germany these 
mid-meridians are at 6°, 9°, 12°, and 15° east longitude (Fig. 3.87a). The origin of every meridian 
zone is at the intersection point of the mid-meridian and the equator. In the north-south direction the 
total range is to be considered, in the east-west direction a 1°40! wide strip on both sides. In Germany 
it amounts ca. +100 km. The overlap is 20’, which is here nearly 20 km. 


a 





R = 6371 km. (3.176) 
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rig t value! Rh gh áliie y-elongation The poeticis ed elongation ü 
x in the abscissa direction 
x - elon- (Fig. 3.87b) is the same as in 





[ gation [Ἢρ Soldner system (3.176). To 
keep the mapping angle-preser- 





Ενας | ving, the quantity b must be 
a) |-- 500 km 4 b) added to om 
b-——-. (3.177) 
Figure 3.87 6n 


3.4.1.3 Spherical Lune or Biangle 


Suppose there are two planes I'4 and La passing through the endpoints A and B of a diameter of the 
sphere and enclosing an angle o (Fig. 3.88) and so defining two great circles g; and go. The part of 
the surface of the sphere bounded by the halves of the great circles is called a spherical lune or biangle 
or spherical digon. 'T'he sides of the spherical biangle are defined by the spherical distances between A 
and B on the great circles. Both are 180°. 

As the angles of the biangle one defines the angles between the tangents of the great circles g; and gz at 
the points A and B. They are the same as the so-called dihedral angle a between the planes [; and Ty. 
If C and D are the bisecting points of both great circle arcs A and B , the angle o can be expressed as 
the spherical distance of C and D. The area Ap of the spherical biangle is proportional to the surface 
area of the sphere just as the angle a to 360°. Therefore the area is 


Απ [ρα 2 2R?a 
360° ο 








=2Rarca_ with the conversion factor g as in (3.175c). (3.178) 





Figure 3.88 Figure 3.89 Figure 3.90 


3.4.1.4 Spherical Triangle 


Consider three points A, B, and C on the surface of a sphere, not on the same great circle. Connecting 
every two of them by a great circle yields a spherical triangle ABC (Fig. 3.89). 

The sides of the triangle are defined as the spherical distances of the points, i.e., they represent the 
angles at the center between the radii 0A, 0B, and 0C. They are denoted by a, b, and c, and hereafter 
they are given in angle measure, independently of whether they are denoted at the center as angles or 
on the surface as great circle arcs. The angles of the spherical triangle are the angles between every two 
planes of the great circles. They are denoted by a, 8, and y. 

The order of the notation of the points, sides, and angles of the spherical triangle follows the same 
scheme as for triangles of the plane. A spherical triangle is called a right-sided triangle if at least one 
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side is equal to 90°. There is an analogy with the right-angled triangles of the plane. 
3.4.1.5 Polar Triangle 


1. Poles and Polar The endpoints of a diameter P, and P, are called poles, and the great circle g 
being perpendicular to this diameter is called polar (Fig. 3.90). The spherical distance between a pole 
and any point of the great circle g is 90°. The orientation of the polar is defined arbitrarily: Traversing 
the polar along the chosen direction there is a left pole to the left and a right pole to the right. 

2. Polar Triangle A'B'C' of a given spherical triangle ABC is a spherical triangle such that the 
vertices of the original triangle are poles for its sides (Fig. 3.91). For every spherical triangle ABC 
there exists one polar triangle A'D'C". If the triangle A'B'C" is the polar triangle of the spherical 
triangle ABC, then the triangle ABC is the polar triangle of the triangle A'B'C". 'The angles of a 
spherical triangle and the corresponding sides of its polar triangle are supplementary angles, and the 
sides of the spherical triangle and the corresponding angles of its polar triangle are supplementary 
angles: 


a’ = 180° —a, bv = 180° — β, c = 180° — 4, (3.179a) 


a’ = 180° —a, B’ = 180° —b, y —180? — c. (3.179b) 





Figure 3.91 Figure 3.92 


3.4.1.6 Euler Triangles and Non-Euler Triangles 


The vertices A, B,C of a spherical triangle divide every great circle into two, usually different parts. 
Consequently there are several different triangles with the same vertices, e.g., also the triangle with 
sides a’, b,c and the shadowed surface in Fig. 3.92a. According to the definition of Euler one should 
always choose the arc which is smaller than 180° as a side of the spherical triangle. This corresponds 
to the definition of the sides as spherical distances between the vertices. Considering this, all spherical 
triangles of whose sides and angles are less than 180° are called Euler triangles, otherwise they are called 
non-Euler triangles. In Fig. 3.92b there is an Euler triangle and a non-Euler triangle. 


3.4.1.7 Trihedral Angle 


This is a three-sided solid formed by three edges Sq, sj, s. starting at a vertex O (Fig. 3.93a). The 
angles a, b, c are defined as sides of the trihedral angle, every of them is enclosed by two edges. The 
regions between two edges are called the faces of the trihedral angle. The angles of the trihedral angle 
are a, B, and 7, the angles between the faces. If the vertex of a trihedral angle is at the center O of 
a sphere, it cuts out a spherical triangle of the surface (Fig. 3.93b). The sides and the angles of the 
spherical triangle and the corresponding trihedral angle are coincident, so every theorem derived for a 
trihedral angle is valid for the corresponding spherical triangle, and conversely. 
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Figure 3.93 


3.4.2 Basic Properties of Spherical Triangles 
3.4.2.1 General Statements 


For an Euler triangle with sides a, b, c, whose opposite angles are a, 3,7, the following statements are 
valid: 
1. Sumofthe Sides The sum of the sides is between 0? and 360°: 








0° <at+b+c< 360°. (3.180 
2. Sum of two Sides The sum of two sides is greater than the third one, e.g., 

a+b>c. (3.181 
3. Difference of Two Sides The absolute value of the difference of two sides is smaller than the 
third one, e.g., 

ja — b| < c. (3.182 
4. Sum ofthe Angles The sum of the angles is between 180° and 540°: 

180° < a + B +y < 540°. (3.183 
5. Spherical Excess The difference 

€ — a 4 f 4 y — 180? (3.184 
is called the spherical excess. 
6. Sum of Two Angles The sum of two angles is less than the third one increased by 180°, e.g., 

a+ p < y+ 180°. (3.185 


7. Opposite Sides and Angles Opposite to a greater side there is a greater angle, and conversely. 
8. Area The area Ar of a spherical triangle can be expressed by the spherical excess € and by the 
radius of the sphere R with the formula 
T Re 
Απ εἰ”. = — = R’ arce. 3.186a 
2 180° ο ( 


Here ϱ is the conversion factor (3.175c). From the theorem of Girard, with Ag as the surface area of the 
sphere, holds 


Ag 
---ε. 18601] 
7209 € (3.186b) 


If the sides are known and not the excess, then e can be calculated by the formula of L'Huilier (3.201). 





T= 


3.4.2.2 Fundamental Formulas and Applications 


The notation for the quantities of this paragraph corresponds to those of Fig. 3.89. 
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1. Sine Law 


sina sina (3.1872) sinb = Sap. (3.187b) one = any (3.187c) 


sinc siny sina sina’ 








sinb sing’ 
The equations from (3.1872) to (3.187c) can also be written as proportions, i.e., in a spherical triangle 
the sines of the sides are related as the sines of the opposite angles: 


sina sinb sinc 











= = ] 3.187d 
sina sinf siny ( ) 
The sine law of spherical trigonometry corresponds to the sine law of plane trigonometry. 
2. Cosine Law, or Cosine Law for Sides 
cosa = cosb cosc + sin bsin ccosa, (3.188a) cosb = cos ccosa + sin csina cos B, (3.188b) 
cos c = cos a cos b + sin a sin b cos 7. (3.188c) 


The cosine law for sides in spherical trigonometry corresponds to the cosine law of plane trigonometry. 
From the notation one can see that the cosine law contains the three sides of the spherical triangle. 
3. Sine-Cosine Law 
sina cos f — cos bsinc — sinbcosccosa, (3.189a) 
sin a cos y = cos csin b — sin ccos bcos a. (3.189b) 
One can get four more equations by cyclic change of the quantities (Fig. 3.34). 
The sine-cosine law corresponds to the projection rule of plane trigonometry. Because it contains five 
quantities of the spherical triangle it is not used directly for solving problems of spherical triangles, but 
it is used for the derivation of further equations. 


4. Cosine Law for Angles of a Spherical Triangle 


cosa = — cos b cos y + sin £ sin y cosa, (3.190a) 
cos B — — cos y cos a + sin y sin a cos b, (3.190b) 
cos y = — cos a cos 8 + sin a sin 8 cos c. (3.190c) 


This cosine rule contains the three angles of the spherical triangle and one of the sides. With this 
law one can easily express an angle by the opposite side with the angles on it, or a side by the angles; 
consequently every side can be expressed by the angles. Contrary to this, for plane triangles the third 
angle is calculated from the sum of 180°. 
Remark: It is not possible to determine any side of a plane triangle from the angles, because there are 
infinitely many similar triangles. 
5. Polar Sine-Cosine Law 
sin a cos b = cos f sin y + sin £ cos y cosa, (3.191a) 
sin a cos c — cos sin f 4- sin ^ cos f cos a. (3.191b) 
Four more equations can be get by cyclic change of the quantities (Fig. 3.34). 
Just as for the cosine law for angles, also the polar sine-cosine law is not usually used for direct calcu- 
lations for spherical triangles, but to derive further formulas. 
6. Half-Angle Formulas 
To determine an angle of a spherical triangle from the sides one can use the cosine law for sides. The 


half-angle formulas allow us to calculate the angles by their tangents, similarly to the half-angle for- 
mulas of plane trigonometry: 





Q 


sin(s — b) sin 
tan = = ( ) ( 





sin(s — c) sin(s — a) (3.192b) 
------------: (3492b 


sin s sin(s — a) sin s sin(s — b) 


3.4 Spherical Trigonometry 167 








sin(s — a) sin(s — b) a+b+c 


.192 5 = 
, (3.192c) 8 " 


(3.1924) 


sin s sin(s — c) 


If from the three sides of a spherical triangle all the three angles should be determined, then the follow- 
ing calculations are useful: 
a k B k 
tanc = ————_., «1958 tan = = — 193: 
al 5 me (3.193a) an 5 du c (3.193b) 


tan? = ME M with (3.193c) 
2.  sin(s—c) 


a+b+c 
, (3.193d) s=, (3.193e) 


sin(s — a)sin(s — b) sin(s — c) 





sins 


7. Half-Side Formulas 
With the half-side formulas it is possible to determine one side or all three sides of a spherical triangle 
from its three angles: 














s(c — s(c — b s(c — s(c — 
gop =| Oe = P) odio Y) t3 iga) ο ος ος ως 
2 — cosa cos(a — a) 2 — cosa cos(a — f) 
σος τς Costa =a) coo p); (3.194c) o= ΜΕ (3.194d 
2 — cosa cos(a — y) 2 
or 
a k' b k' 
οςς--------- 1959 ἲπ-----π: 195: 
B cos(c — a)' poH) na cos(c — B)' pM 
c " . 
cot a= ib with (3.195c 
i. cos(c — a) cos(c — B) cos(o — D (3.1954) o- 2t B+ n (3.195e 
— cosa 2 
Since for the sum of the angles of a spherical triangle according to (3.183): 
180° < 20 < 540° or 90° < ø< 270° (3.196 





holds, cos ø < 0 must always be valid. Because of the requirements for Euler triangles all the roots are 
real. 

8. Applications of the Fundamental Formulas of Spherical Geometry 

With the help of the given fundamental formulas, for instance distances, the azimuth, and course angles 
on the Earth can be determined. 

W A: Determine the shortest distance between Dresden (A; = 13°46', yı = 51°16) and Alma Ata 
(Az = 76°55’, p2 = 43°18’). 

Solution: The geographical coordinates (A1, v1), (A2, 92) and the north pole N (Fig. 3.94) result in 
two sides of the triangle P| PN a — 90? — ye and b = 90° — ¢ lying on meridians, and also the angle 
between them y = Ap — Ay. For c = e it follows from the cosine law (3.188a) 


cosc = cosa cosb + sin a sin b cos y, 
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Figure 3.94 Figure 3.95 


cose — cos(90? — q1) cos(90* — p2) + sin(90° — y1) sin(90° — ρϱ) σοβ(λα -- λι) 
= sin y1 SİN Y2 + cos q cos qo cos(Ao — A1), (3.197) 


i.e., cose = 0.53498 4- 0.20567 — 0.74065,e — 42.213. The great circle segment P, P) has length 
4694 km using (3.1752). 

W B: Calculate the course angles 01 and 69 at departure and at arrival, and also the distance in sea miles 
of a voyage from Bombay (A, = 72°48’, , = 19°00’) to Dar es Saalam (Ag = 39?28', 4 — —6?49") 
along a great, circle. 

Solution: The calculation of the two sides a = 90° — yi = 71°00',b = 90° — yo = 96°49’ and the 
enclosed angle y = Ay — Az = 33°20’ in the spherical triangle P, P,N with the help of the geographical 
coordinates (A1, 91), (A2, 2) (Fig. 3.95) and the cosine law (3.188c) cose = cose = cosacosb + 


sina sin b cos yields Pj P5— e = 41.777°, and because 1’ + 1 sm follows Pj P52: 2507 sm. 
With the cosine law for sides (3.188a) 
cosa — cosbcosc cosb — cosa cosc 
a = arccos 552 — 557.074 — 51.248". and f — arccos “= 125.018". 
sinbsinc sinasinc 

"Therefore, the results are 

61 = 360? — 8 = 234.982? and 62 = 180° + a = 231.248". 
Remark: It makes sense to use the sine law to determine sides and angles only if it is already obvious 
from the problem that the angles are acute or obtuse. 


3.4.2.3 Further Formulas 


1. Delambre Equations 
Analogously to the Mollweide formulas of plane trigonometry the corresponding formulas of Delambre 
are valid for spherical triangles: 


























α-β . a+b .a-f . a—b 
COS sin sin sin 
— $— —- — 32, (3.1982) 2—- —2-, (3.198b) 
sin 9 sin 9 COS 9 sin 2 
αβ a+b ο αξβ a—b 
COS COS sin COS 
--ᾱ-----ᾱ-, (311980) —— (3.198d) 
sin 9 COS 2 COS 9 COS 9 


Since for every equation two more equations exist by cyclic changes, altogether there are 12 Delambre 
equations. 
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2. Neper Equations and Tangent Law 
































. a—b cos 2 b 
α-β πο Y . ac m 2 y 
tan 2 7E cot 5 =, (9.199) tan GU SIE cot 5) (3.199b) 
sin cos —— 
2 
.α-ῇβ cos 8 B 
α-ὐ ΠΩ a+b So ο 
tan g 8 tan 5 =, (3.199c) tan αν πε τρ tan 5 (3.199d) 
sin — cos — 


These equations are also called Neper analogies. From these one can derive formulas analogous to the 
tangent law of plane trigonometry: 






































= —g b—c = 
tan E tan 2 2 n tan E tan B 2 7 
——— TM€———À y 3.200a) E a (3.200b) 
b Y , ( a , 
tan EY tan ΚΕ tan tan βΕΎ 
2 2 
i c—a t α 
an jan 
2 2 s 
ota FFU (3.200c) 
tan tan 
2 2 


3. L'Huilier Equations 

The area of a spherical triangle can be calculated with the help of the excess e , either from the known 
angles a, 3, y according to (3.184), or if the three sides a, b, c are known, with the formulas about the 
angles from (3.193a) to (3.193e). The L'Huilier equation makes possible the direct calculation of c from 
the sides: 







s—b 6-6 
7 5 (3.201) 


This equation corresponds to the Heron formula of plane trigonometry. 


an ben? ton ἃ 
an— = 4/tan — tan 
4 2 2 


9.4.5 Calculation of Spherical Triangles 


3.4.3.1 Basic Problems, Accuracy Observations 

The different cases occurring most often in calculations of spherical triangles are arranged into so-called 
basic problems. For every basic problem for acute-angled spherical triangles there are several ways to 
solve it, and it depends on whether the calculations are based only on the formulas from (3.187a) to 
(3.191b) or also on the formulas from (3.192a) to (3.201), and also whether one is looking for only one 
or more quantities of the triangle. 

Formulas containing the tangent function yield numerically more accurate results, especially in com- 
parison to the determination of defining quantities with the sine function if they are close to 90°, and 
with the cosine function if their value is close to 0° or 180°. For Euler triangles the quantities calculated 
with the sine law are bi-valued, since the sine function is positive in both of the first quadrants, while 
the results obtained from other functions are unique. 


3.4.3.2 Right-Angled Spherical Triangles 


1. Special Formulas 

In a right-angled spherical triangle at least one of the angles is 90°. The sides and angles are denoted 
analogously to the plane right-angled triangle. If as in Fig. 3.96 y is a right angle, the side c is called 
the hypotenuse, a and b the legs and a and £ are the leg angles. From the equations from (3.187d) to 
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(3.191b) it follows for y = 90°: 


sina = sin a sin c, (3.202a) sin b — sin f sin c, (3.202b) 
cos € = cos a cos b, (3.202c) cosc = cot a cot b, (3.202d) 
tana = cos B tan c, (3.2020) tanb = cosa tan c, (3.202f) 
tanb — sinatan f, (3.202g) tana = sin btana, (3.202h) 
cosa = sin P cosa, (3.2021) cos f — sina cos b. (3.2027) 


TT UN 
909-b 90a 


- 


Figure 3.96 Figure 3.97 


If in certain problems other sides or angles are given, for instance instead of a, 3, y the quantities b, 7, a, 
the necessary equations can be get by cyclic change of these quantities. For calculations in a right- 
angled spherical triangle one usually starts with three given quantities, the angle y = 90° and two 
other quantities. There are six basic problems, which are represented in Table 3.8. 





Table 3.8 Defining quantities of a spherical right-angled triangle 





























Basic Given defining Number of the formula to 
problem | quantities determine other quantities 

| 1. Hypotenuse and a leg c, a a (3.202a), 8 (3.202€), b (3.202c) 

| 23 Two legs a,b a (3.202h), 8 (3.202g), c (3.202c) 

| 3. Hypotenuse and an angle c, a a (3.202a), b (3.202f), 6 (3.202d) 

| 4. Leg and the angles on it a, 8 c (3.2026), b (3.202j), o (3.2021) 

| 5. Leg and the angle opposite to it a, a | b (3.202h), c (3.202a), 8 (3.2021) 

| 6. Two angles a, £8 a (3.2021), b (3.202j), c (3.202d) 





2. Neper Rule 
The Neper rule summarizes the equations (3.202a) to (3.202)). If the five determining quantities of a 
right-angled spherical triangle, not considering the right angle, are arranged along a circle in the same 
order as they are in the triangle, and if the legs are replaced by their complementary angles 90° — a, 
90° — b, (Fig. 3.97), then the following is valid: 
1. The cosine of every defining quantity is equal to the product of the cotangent values of its neighboring 
quantities. 





2. The cosine of every defining quantity is equal to the product of the sine values of the non-neighboring 
quantities. 


HB A: cosa = cot(90° — b) cotc = 


tan 





i (see (3.202f) . 
anc 


W B: cos(90? — a) — sincsino — sina (see (3.202a)). 


E C: Map the sphere given by a grid, onto a cylinder which contacts the sphere along a meridian, the 
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so-called central meridian. This meridian and the equator form the axis of the Gauss-Krüger system 
(Fig. 3.98a,b). 





Figure 3.98 Figure 3.99 


Solution: A point P of the surface of the sphere will have the corresponding point P’ on the plane. 
The great circle g passing through the point P perpendicular to the central meridian is mapped into 
a line g’ perpendicular to the x-axis, and the small circle k passing through P parallel to the given 
meridian becomes a line k’ parallel to the x-axis (grid meridian). The image of the meridian m through 
P is not line but a curve m’ (true meridian). The upward direction of the tangent of m’ at P' gives the 
geographical north, the upward direction of k' gives the grid north direction. The angle y between the 
two north directions is called the meridian convergence. 

In a right-angled spherical triangle QPN with c = 90° — ọ, and b = ņ one gets y from a = 90? — η. 
tanb πο...” tann 
tanc tan(90° — y) 
y and η are mostly small, one can consider siny ~% y, tann ~% 1; consequently y = ntan y is valid. 
The length deviation y of this cylinder sketch is pretty small for small distances 7, and so one can 
substitute 7 = y/R, where y is the ordinate of P. This yields y = (y/R)tany. The conversion of 7 
from radian into degree measure yields a meridian convergence of y — 0.018706 or 4 — 1*04'19" at 
p = 50°, y = 100 km. 


3.4.3.3 Spherical Triangles with Oblique Angles 


For three given quantities, there is to be distinguished between six basic problems, just as it was done 
for right-angled spherical triangles. The notation for the angles is a, 8, y and a,b, c for the opposite 
sides (Fig. 3.99). 

Tables 3.9, 3.10, 3.11, and 3.12 summarize, which formulas should be used for which defining quan- 
tities in the case of the six basic problems. Problems 3, 4, 5, and 6 can also be solved by decomposing 
the general triangle into two right-angled triangles. To do this for problems 3 and 4 (Fig. 3.100, 
Fig. 3.101) one can use the spherical perpendicular from B to AC to the point D, and for problems 5 
and 6 (Fig. 3.102) from C to AB to the point D. 





The Neper rule yields cosa = ,siny = tangtanq«. Because 





Figure 3.100 Figure 3.101 Figure 3.102 
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In the headings of Tables 3.9, 3.10, 3.11, and 3.12 the given sides and angles are denoted by S and 
W respectively. This means for instance SWS: Two sides and the enclosed angle are given. 


Table 3.9 First and second basic problems for spherical oblique triangles 






























































First basic problem Second basic problem 
Given: 3 sides a, b, c SSS Given: 3 angles a, 3, 7 WWW 
Conditions: Conditions: 
0° <a+b+c < 360°, 180° < æa + 8 + y < 540°, 
a+b>c,a+c>b, b+c>a. o 4 B « 180? t- y, o o y « 180? 4 B, 
B vy « 180? +a. 
Solution 1: Required a . Solution 1: Required a . 
cosa — cosbcosc cos a: 4- cos f) cos ^ 
COS Q — ———— ——— — or COS q — ——— ——— — — 
sinbsinc sin f sin y 
tan 2 = sin(» - 5) sin(s — c) cot = , | COST = B) cos(o — 7) 
2 sin s sin(s — a) 2 — cosa cos(o — a) 
a+b+c a+8+y 
νε... Ὁ ι΄ 
Solution 2: Required a, 8, 7. Solution 2: Required a, b, c. 
pa sin(s — a)sin(s — b) sin(s — c) Ee cos(c — a) cos(c — B) cos(e — y) 
E sin s , — cos o ` 
tan € k i B k κα k! k' 
an , tan= = — , | cot= = , cot = ; 
2  si(s—a)' 2  sin(s—b) 2  cos(c — o) 2  cos(c — B) 
tan 2 ες a 
a> = —— . cot = = : 
2  sin(s—c) 2  cos(c — y) 
Checking: (s—a)-(s—6)--(s—c) ^ s, | Checking; (c —o)- (c — 8)--(c—»y) —oc, 
tan S tan 5 tan ; sins = k. cot 2 cot 2 cot (= cosa) =k’. 














WŒ ^A Tetrahedron: A tetrahedron has base ABC and vertex S 
(Fig. 3.103). The faces ABS and BCS intersect each other at an angle 
B = 74°18’, BCS and CAS at an angle y = 63°40’, and CAS and ABS 
at a = 80°00’. How big are the angles between every two of the edges 
AS, BS, and CS? 

Solution: From a spherical surface around the vertex S of the pyramid, 
the trihedral angle (Fig. 3.103) cuts out a spherical triangle with sides 
a, b, c. 

'The angles between the faces are the angles of the spherical triangle, 
the angles between the edges, which are searched for, are the sides. The 
determination of the angles a, b, c corresponds to the second problem. 
Solution 2 in Table 3.9 yields: 

c — 108?59', o — a = 28°59, o — B = 34°41’, o — y — 45.19’, κ’ 
1.246983, cot(a/2) = 1.425514, cot(b/2) = 1.516440, cot(c/2) 
Figure 3.103 1.773328. 





B D Radio Bearing: In the case of a radio bearing two fixed stations Pj(À1, 1) and P2(A2, 3) 
receive the azimuths ô, and 69 via the radio waves emitted by a ship (Fig. 3.104). The task is to 
determine the geographical coordinates of the position Pp of the ship. The problem, known by marines 
as the shore-to-ship bearing, is an intersection problem on the sphere, and it will be solved similar to the 
intersection problem in the plane (see 3.2.2.3, p. 148). 

1. Calculation in triangle Pj; PN: In triangle P, P5N the sides Pj,N — 90? — (1, P4N = 90° — » and 
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the angle X PN P) — à — A1 — AA are given. The calculations of the angle X: £1, £2 and the segment 
P, P, = e correspond to the third basic problem. 


Table 3.10 Third basic problem for spherical oblique triangles 
















































































Third basic problem SWS 
Given: 2 sides and the enclosed angle, e.g., a, b, ^ 
Condition: none 
. a=b 
. . α- ϐ Sin—z— 
Solution 1: Required c, or canda. | tan Ὁ 2. cot T 
cosc = cosa cosb + sin asin bcos y, 2 sin d +6 2 
: sinasin y 
sina — —————. α-β 
in d Te (—90° < < 90°) 
a can be in quadrant I. or I. , A D ο 
We apply the theorem: a+8 πο 3 , p= a j β α 5 B 
Larger angle Is opposite Solution 4: Required a, 8, c. 
to the larger side or db y 
checking calculation: , an +B cos —,— Cos 5 UE 
CMM a in q. T: an aab 2 =N? 
cosa i cos S i = a in q. TT. dos sin 
Solution 2: Required a, or a and c. 2 
tanu = vana tos amp sin cos 7 z 
tan ysin u tan = A =— 
tana = —~——_ 2 d LO N 
sin(b — u) sin sin j 
tan(b — u a+B à 
tanc — ΤΗΞΗΣ (—90° < ^ « 907) 
Cos a 
Solution 3: Required o and (or) 2. at+B qe B j +o Ὡ-β 
a—b ees ο η m 
at cos . ; 
tan f 2L lexl. 60g = E Snc M . 
oth 3 2. atp? 2 .a-B' 
os 2 sin sin — 
Checking: Double calculation of c. ~ 














2. Calculation in the triangle P; PP: Because £1 — 
61 — £1, £9 — 360? — (05 4- £3) , the side e and the angles 
lying on it £4 and £9 are known in P, P5 P». Calculations 
of the sides e and eg correspond to the fourth basic prob- 
lem, third solution. The coordinates of the point Pù can 
be calculated from the azimuth and the distance from P, 
or P. 

3. Calculation in the triangle N P,P: In the triangle 
N P,P there are given two sides N P, — 90?—1, Pi Po = 
e; and theincluded angle à. Theside N Pj — 90?—4 and 
the angle AA, are calculated according to the third basic 
problem, first solution. For checking one can calculate in 
the triangle N Po P2 for asecond time N Py = 90° — o, and 
also AAg. Consequently, the longitude Ag = Ay + AA; = 
Figure 3.104 À» — A2 and the latitude qo of the point Py are known. 
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3.4.3.4 Spherical Curves 
Spherical trigonometry has a very important application in navigation. One basic problem is to deter- 
mine the course angle, which gives the optimal route. Other fields of application are geodesic surveys 
and also robot-movement planning. 

Table 3.11 Fourth basic problem for spherical oblique triangles 













































































Fourth basic problem WSW 
Given: One side and two adjacent angles, e.g., a, B, € 
Condition: none 
Solution 1: Required 7, or y and a. 
cos y = — cos a cos f) 4- sina sin B cos c, 
. : a—b 
2 sin csin a (—90° < < 90°), 
sina — ——— —. 
Sm y a+b α-ὖ a+b a—b 
a can be in quadrant I or II. We ap- a= H , b= 9 : 
ply the theorem: The larger side is in | Solution 4: Required a, b, y. ^ 
opposite to the larger angle or checking α-β. ο 
calculation: a+b «08 sm> 7 
ain q. I. tan x: 2 2 ar? 
cosa + cos B cosy Z0 — win q. II. 2 qu ETE VUE N 
Solution 2: Required a, or a and y. 3 2 B 2 
tan c cos S sin — 
cot = tanacose, tana = STE I iud po BE Wy CAN | A 
cos(ĝ — u) ` | tan —;— = — a+B c N 
cot(8 — u sin s= 
ΗΝ ΠΟΘΕΝ ἃ a? 
cosa o o 
Solution 3: Required a and (or) b. (90° < < 90°), 
a α-β gb wb b a+b a-—b 
S 2 α-- , = 2 
tan T a2 tan i ` 2 2 2 2 
a+b 5 oy Z γ σι 
cos sin — = ———__, cos— = ——_... 
2 . a+b 2 . a—b 
Q0 — B sin sin 
m T d - Checking: Double calculation of 
tan Ξ —aB tans necking: Double calculation of y. 
sin — 














1. Orthodrome 
1. Notion The geodesic lines of the surface of the sphere — which are curves, connecting two points 
A and B by the shortest path — are called orthodromes or great circles (see 3.4.1.1, 3., p. 160). 
2. Equation ofthe Orthodrome Moving on an orthodrome— except for meridians and the equator 
— needs a continuous change of course angle. These orthodromes with position-dependent course angles 
a can be given uniquely by their point closest to the north pole Py(An, Yn), where pyn > 0° holds. The 
orthodrome has the course angle ay = 90° at the point closest to the north pole. The equation of the 
orthodrome through Py and the running point Q(A, y), whose relative position to Py is arbitrary, can 
be given by the Neper rule (3.4.3.2,2., p. 170) according to Fig. 3.105 as: 

tan yy cos(A — Ay) = tan y. (3.203) 
Point Closest to the North Pole: The coordinates of the point closest to the north pole PN(Aw, ὧν) 
of an orthodrome with a course angle a4 (a4 4 0°) at the point A(A4, Ya) (ya 4 90°) can be calcu- 
lated by the Neper rule (3.4.3.2,2., p. 170) considering the relative position of Py and the sign of a4 
according to Fig. 3.106 as: 


ta 
Qw — arccos(sin|a4|cosq4) (3.204a) and Ay = A4+sign(a4) |arccos τη . (3.204b) 
an pn 
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Remark: 


If a calculated geographical distance A is not in the domain —180° < \ < 180°, then for 


A #+k-180° (k € IN) the reduced geographical distance Aye is: 











λ 
Area = 2 arctan (ων 3) : (3.205) 
This is called the reduction of the angle in the domain. 
Table 3.12 Fifth and sixth basic problems for a spherical oblique triangle 
Fifth basic problem SSW Sixth basic problem WWS 











Given: 2 sides and an angle opposite 
to one of them, e.g., a, b, a 











Given: 2 angles and a side opposite 
to one of them, e.g., a, a, 3 





Conditions: See distinction of cases. 


Conditions: See distinction of cases. 





Solution: Required is any missing quantity. 
. ,  sinbsina 
sinp = == 
sina is 
Let βι be acute and 5 — 180* — £1 obtuse. 
Distinction of cases: 
1 sinbsina 


- >1 0solution. 
sina 
sin bsi 
SinoSIDO 1 4 solution f = 90°. 
sina 
sinbsina 
de ——— —— «14 
sina 
3.1. sina-sinb: 
3.1.1. b «90? Isolution βι. 
3.1.2. b 90? Isolution f. 
3.2. sina <sinb: 
334. * < 90° ,a < 90° } 2 solutions 
UU" aq »90?,o > 90° P3, B2- 
a < 90° ,a > 90° "m 
3.2.2. a > 90° a < 90° ) 0 solution. 


Further calculations with one angle 
or with two angles f: 





2 values 3, 32 are possible. 





Solution: Required is any missing quantity. 
sinasin fj 


sinb = 2 values bi, bg are possible. 


sina 

Let b be acute and by = 180° — 6, obtuse. 
Distinction of cases: 

1 sina sin f 


21 Osolution. 


sina 
NU ὦ 
sinasin 8 


- —] 1 solution b = 90°. 
sina 
sinasin 8 

g =— s 

sina . 
3.1. sino sin. 
3.1.1. 6 <90° Isolution by. 
3.1.2. B > 90° 1 solution bo. 
32. sinacsinf: 
334. * < 90° ,a < 90° } 2 solutions 

“a> 90°,a > 90° by, ba. 

399 a < 90? ,o » 90? 


a 7 909 a < 90° } 0 solution. 


Further calculations with one side 
or with two sides b: 
























































Method 1: Method 2: 

+ ,— b 
tanu = tanbcosa, č MN ος y a + BOS 
tanv — tanacos f, tan- — tan 2 a-p’ tan 2^ cot 3 . a+b’ 
e =u+v, cos 9 sin 
cot p = cosbtana, sin? +8 sinf b 
cot yY = cosa tan b, pe CE 2. Log EE 2 
Y -etu. 2 E 2 ja 

2 2 

Checking: Double calculation of 5 and 2. 








Intersection Points with the Equator: The intersection points Pg, (Ag, , 0?) and Pg, (Ag,, 0?) of the 
orthodrome with the equator can be calculated from (3.203) because tan ew cos(Ag, — An) = 0 (v = 
1,2) must hold: 


λε, = AN = 90° (v = 1,2). 





(3.206) 
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Figure 3.105 Figure 3.106 


Remark: In certain cases an angle reduction is needed according to (3.205). 
3. Arclength If the orthodrome goes through the points A(A4, c4) and B(Ap, vg), the cosine law 





for sides yields for the spherical distance d or for the arclength between the two points: 
d — arccos|[sin ρα sin yg + cos y4 cos ep cos(Ag — A4)]. (3.2072) 
This central angle can be converted into a length considering the radius of the Earth R: 
: ' R 
d — arccos[sin ρα sin qp + cos Ya Cos op cos(Ag — A4)]- vane (3.207b) 


4. Course Angle Using the sine and cosine laws for sides to calculate sin a4 and cos a4, gives the 
final result for the course angle a4 after division: 


COS QA cos yp sin(Ag — Aa 
Qa = arctan cos p4 cos ps sin(Ag — àa) 


3.208 
sin yg — sin y4 cos d ( ) 


Remark: With the formulas (3.207a), (3.208), (3.204a) and (3.204b), the coordinates of the point Py 
closest to the north pole can be calculated for the orthodrome given by the two points A and B. 

5. Intersection Point with a Parallel Circle For the intersection points X4(Ax,, px) and X2(Ax,, 
yx) of an orthodrome with the parallel circle y = yx one gets from (3.203): 

tan px 
tan Yn 





Àx, = An F arccos 





v=1,2). (3.209) 


From the Neper rule (3.4.3.2,2., p. 170) used for the intersection angles ax, and ax,, by which an 
orthodrome with a point Py(An, Yn) closest to the north pole intersects the parallel circle p = px: 


lax,| = arcsin = PN (v — 1,2). (3.210) 





ΦΧ 
The argument in the arc sine function must be extremal with respect to the variable x for the minimal 
course angle |Qmin|. This gives: singx — 0 — x — 0, ie., the absolute value of the course angle is 
minimal at the intersection points of the equator: 

[ox] — 90? — py. (3.211) 
Remark 1: Solutions of (3.209) exist only for |gx| € ew. 
Remark 2: In certain cases a reduction of the angle is needed according to (3.205). 
6. Intersection Point with a Meridian For the intersection point Y (Ay, yy) of an orthodrome 
with the meridian \ = Ay according to (3.203) is given by 

yy = arctan[tan ew cos(Ay — Ax)]. (3.212) 
2. Small Circle 
1. Notion The definition of a small circle on the surface of the sphere must be discussed here in more 
detail than in 3.4.1.1, p. 160: The small circle is the locus of the points of the surface (of the sphere) 
at a spherical distance r (r « 90?) from a point fixed on the surface M(Aw, pm) (Fig. 3.107). The 
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spherical center is denoted by M; r is called the spherical radius of the small circle. 

The plane of the small circle is the base of a spherical cap with an altitude h (see 3.3.4, p. 158). The 
spherical center M is above the center of the small circle in the plane of the small circle. In the plane 
the circle has the planar radius of the small circle ro (Fig. 3.108). Hence, parallel circles are special 
small circles with yyy = +90°. 

E For r — 90? the small circle tends to an orthodrome. 









Sp 


small circle plane 





Figure 3.107 Figure 3.108 


2. Equations of Small Circles As defining parameters either M and r can be used or the small 
circle point Py(An, Yn) nearest the north pole and r. If the running point on the small circle is Q(A, v), 
one gets the equation of the small circle with the cosine law for sides corresponding to Fig. 3.107: 
cosr = sin ysin Ym + cos y cos yyy cos(A — Ay). (3.213a) 
From here, because of py = Yn — r and Ay = An, 
cosr — sing sin(gw — r) ἠ- cos o cos(gw — r) cos(A — Ax). (3.213b) 
E A: For ym = 90° the parallel circles are obtained from (3.213a), since cosr = siny = sin(90° — 
r) =siny =>  — const. 
W B: Forr — 90° follow orthodromes from (3.213b). 
3. Arclength The arclength s between two points A(A4, pa) and B(Ag, op) on a small circle k can 
2πτρ 
360°’ 


cosd = cos? r + sin? r cos o 





be calculated corresponding to Fig. 3.108 from the equalities f= 
σ 
and rg — Rsinr: 
I cosd—cos?r mR 
s = sin r arccos ——_,——_- ) 
sin“ r 180° 

W For r — 90? the small circle becomes an orthodrome, and from (3.214) and (3.207b) it follows that 
s — d. 
4. Course Angle According to Fig. 3.109 the orthodrome passing through A(A4, o4) and M (Ay, 
pm) intersects perpendicularly the small circle with radius r. For the course angle orth of the ortho- 
drome after (3.208) 


(3.214) 





cos p4 cos Ym Sin(Àm — À 
Qorth = arctan z FA SPM gi M 4) (3.215a) 
sin yu — sin ya cosr 
is valid. So, one gets for the required course angle o4 of the small circle at the point A: 
o4 — ([eosa| — 90) sign(ao). (3.215b) 
5. Intersection Points with a Parallel Circle For the geographical longitude of the intersection 
points X1(AÀx,, ex) and Xo(Ax,, ex) of the small circle with the parallel circle o = yx follows from 
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Figure 3.109 Figure 3.110 
(3.213a): 


Ax, Ay arccos EG =. (3.216) 
COS (x COS QM 





Remark: In some cases an angle reduction is needed according to (3.205). 


6. Tangential Point The small circle is touched by two meridians, the tangential meridians, at the 
tangential points T\(Ar,, pr) and To(Anm, pr) (Fig. 3.110). Because for them the argument of the arc 
cosine in (3.216) must be extremal with respect to the variable yx, holds: 
sin y cosr — sin Yy sin 
Qr — arcsin 22 TM. (35179) An, = Am F arccos —— —STYXEMPM (y 1,2). (3.217b) 
cosr COS Yy COS PM 








Remark: In certain cases an angle reduction is needed according to (3.205). 


7. Intersection Points with a Meridian The calculation of the geographical latitudes of the in- 
tersection points Yı (Ay, py, ) and Y2(Ay, yy,) of the small circle with meridian À — Ay can be done 
according to (3.213a) with the equations 


C+ BV A? + B2 — C2 








py, = arcsin — ALB (v — 1,2), (3.2182) 
where the following notations are used: 
A — sin gy, B — cos ey cos(Ay — àm), C —-—oosr. (3.218b) 


For A? + B? > C?, in general, there are two different solutions, from which one is missing if a pole is 
on the small circle. 

If A? 4- B? — C? holds and there is no pole on the small circle, then the meridian touches the small 
circle at a tangential point with geographical latitude yy, = yy, = vr. 


3. Loxodrome 

1. Notion A spherical curve, intersecting all meridians with the 
same course angle, is called a loxodrome or spherical helix. So, par- 
allels (a = 90°) and meridians (a — 0?) are special loxodromes. 

2. Equation of the Loxodrome Fig. 3.111 shows a loxodrome 
Q(A,@) \ with course angle a through the running point Q(A, y) and the in- 
finitesimally close point P(A+dA, p+dy). The right-angled spher- 
ical triangle QC P can be considered as a plane triangle because of 
Figure 3.111 its small size. Then: 






— 


A*dA,q*dqo) 
go 


cos dA ta 1 
tana = a => d\= κ 


3.219a 
Rdy COS (p (3-2198) 
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Considering that the loxodrome must go through the point A(A4, ya), therefore, the equation of the 
loxodrome follows by integration: 


sop E 
tan (45 + £) 180° 
tan (45° + 24) © 


In particular if A is the intersection point Pg(Ag, 0?) of the loxodrome with the equator, then: 





A— A, = tana ln (a £ 90°). (3.219b) 


à — àg = tana ln tan (45° + 2) ul (a # 90°). (3.219c) 
T 


Remark: The calculation of Ag can be done with (3.224). 
3. Arclength From Fig. 3.111 one can find the differential relation 


d d 
cosa = zm => ds= a i (3.2202) 
ds cos a 





Integration with respect to y results in the arclength s of the arc segment with the endpoints A(A4, Ya) 
and B(Ag, op): 


<a een val TR 

cosa 180° 

If A is the starting point and B is the endpoint, then from the given values A, a and s can be determined 
step-by-step first yg from (3.220b), then Ag from (3.219b). 

Approximation Formulas: According to Fig. 3.111, with Q — A and P = PB one can get an 


approximation for the arclength / with the arithmetical mean of the geographical latitudes with (3.221a) 
and (3.221b): 





(a £ 90°). (3.220b) 





i = P Qa - M) TR (201 fee =- As — Aa) TE. B21 
cin 1 “180° 199519) E sina (As = àa): 180° ` po 


4. Course Angle For the course angle a of the loxodrome through the points A(A4, 4) and 
B(An.n), or through A(A4, v4) and its equator intersection point Pg(Ag, 0?) according to (3.219b) 
and (3.219c) the following holds: 


(AB — àa) . ( 
tan (45° + £2) 180° 


In —— A 
tan (45° + τα 


a = arctan 3.222a) 
(Aa = Ag) T 


eo ens (8-222) 
Intan (45° + 2a) 180 


a = arctan 


5. Intersection Point with a Parallel Circle Suppose a loxodrome passes through the point 
Α(λα, ρα) with a course angle a. The intersection point X(Ax, yx) of the loxodrome with a paral- 
lel circle y = yx is calculated from (3.219b): 


ο ΦΧ 
tan (45 + £x) 180? 
tan (ο: + ga) m 
2 
With (3.223) the intersection point with the equator Pg(Ag, 0°) is calculated as 


fa) E 
2 π 


Ax =Aat+ tana - In (a 90"). (3.223) 


Ag — A4 — tana - Intan e t (a £ 90°). (9.224) 
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Remark: In certain cases an angle reduction is needed according to 
(3.205). 

6. Intersection Point with a Meridian Loxodromes - except par- 
allel circles and meridians — wind in a spiral form around the pole 
(Fig. 3.112). The infinitely many intersection points Y,(Ay, Yy, ) 
(v € Z) of the loxodrome passing through A(A4, ya) with course angle 
a with the meridian À = Ay can be calculated from (3.219b): 


Ay àa + ν- 9605 OT 
tana 180° 





Qy, — 2arctan {ow | 





dn (45° + £4)\ -90° (vez). (3.225) 


Figure 3.112 


If A is the equator intersection point Pg(Ag, 0?) of the loxodrome, then simply holds 
ày — Ag r-:360 m 
tana 180° 





py, = 2 arctan exp | | — 90° (v € 2). (3.226) 
4. Intersection Points of Spherical Curves 


1. Intersection Points of Two Orthodromes Suppose the considered orthodromes have points 
Py, (An,, 9n,) and Py, (Anz, ον.) Closest to the north pole, where Py, # Pw, holds. Substituting the 
intersection point S(As, ys) in both orthodrome equations gives the system of equations: 


tan yy, cos(Ag — An,) = tans, (3.227a) tan yy, cos(Ag — An,) — tans. (3.227b) 


Elimination of yg and using the addition law for the cosine function yields: 


tan (yn, cos An, — tan ~n, COS Ày. 
tan As _ Ni 1 VN» 2 





' 3.228 
tan yy, sin Ay, — tan yy, sin An, ( ) 
The equation (3.228) has two solutions Ag, and Ag, in the domain —180° < A < 180° of the geographical 
longitude. The corresponding geographical latitudes can be got from (3.227a): 

ps, = arctan[tan yy, cos(As, — An, )] (v — 1,2). (3.229) 
The intersection points Sı and 5S3 are antipodal points, i.e., they are the mirror images of each other 
with respect to the centre of the sphere. 
2. Intersection Points of Two Loxodromes Suppose the considered loxodromes have equator in- 
tersection points Pg, (Ag, , 0?) and Pg, (Ag,, 0°) and the course angles a; und a2 (a; ¥ ag). Substituting 
the intersection point S(Ag, ys) in both loxodrome equations gives the system of equations: 


As — Ap, = tana, - ln tan (a + 2s) QI (αι # 90°), (3.230a) 
180° 
As — An, = tanag- In tan (ο: + #2) GE (ao 6 907). (3.230b) 
2 π 


Elimination of As and expressing yg gives an equation with infinitely many solutions: 
AE, — Àm +v 360° π 


tana, — tana, : a] SR ΝΕΤ (3-231) 


Ps, = 2 arctan exp | 


The corresponding geographical longitudes As, can be found by substituting qs, in (3.2302): 
es.) 130° 
2 π 


Remark: In certain cases an angle reduction is needed according to (3.205). 


Às, = Àp, + tan a; ln tan (45° + (a, £90°), (VEZ). (3.232) 
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3.5 Vector Algebra and Analytical Geometry 


3.5.1 Vector Algebra 
3.5.1.1 Definition of Vectors 


1. Scalars and Vectors 

Quantities whose values are real numbers are called scalars. Examples are mass, temperature, energy, 
and work (for scalar invariant see 3.5.1.5, p. 185, 1., p. 214 and 4.3.5.2, p. 288). 

Quantities which can be completely described by a magnitude and by a direction in space are called 
vectors. Examples are power, velocity, acceleration, angular velocity, angular acceleration, and electri- 
cal and magnetic force. We represent vectors by directed line segments in space. 

In this book the vectors of three-dimensional Euclidean space are denoted by a, and in matrix theory 
by a (see also 4.1.3, p. 271). 

2. Polar and Axial Vectors 

Polar vectors represent quantities with magnitude and direction in space, such as speed and acceler- 
ation; azial vectors represent quantities with magnitude, direction in space, and direction of rotation, 
such as angular velocity and angular acceleration. In notation they are distinguished by a polar or by 
an axial arrow (Fig. 3.113). In mathematical discussion they are treated in the same way. 

3. Magnitude or Absolute Value and Direction in Space 

For the quantitative description of vectors a or a, as line segments between the initial and endpoint A 
and B resp., are used the magnitude, i.e., the absolute value |a|, the length of the line segment, and the 
direction in space, which is given by a set of angles. 





4. Equality of Vectors 
Two vectors & and b are equal if their magnitudes are the same, and they have the same direction, i.e., 
if they are parallel and oriented identically. 
Opposite and equal vectors are of the same magnitude, but oppositely directed: 
— — — — 
AB=a, ΒΑΞ -ᾱ but |AB| = |BA|. (3.233) 


Axial vectors have opposite and equal directions of rotation in this case. 





B B Z^ Pxyz) C 
€ 
a b d 
B 
a)'A b)/A 
Figure 3.113 Figure 3.114 Figure 3.115 


5. Free Vectors, Bound or Fixed Vectors, Sliding Vectors 

A free vector is considered to be the same, i.e. its properties do not change, if it is translated parallel to 
itself, so its initial point can be an arbitrary point of the space. If the properties of a vector belong to 
a certain initial point, it is called a bound or fixed vector. A sliding vector can be translated only along 
the line it is already in. 

6. Special Vectors 

a) Unit Vector a? — & isa vector with length or absolute value equal to 1. With it the vector & can 
be expressed as a product of the magnitude and of a unit vector having the same direction as a: 


a- ela]. (3.234) 


The unit vectors i, j, k or €;, &;, €; (Fig. 3.114) are often used to denote the three coordinate axes in 
the direction of increasing coordinate values. 
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In Fig. 3.114 the directions given by the three unit vectors form an orthogonal triple. These unit 
vectors define an orthogonal coordinate system because for their scalar products 


ejéj — ejex — eje = 0 (3.235) 
is valid. Furthermore also 
eje; — ejej — exei — 1, (3.236) 


holds, i.e. it is an orthonormal coordinate system. (For more about scalar product see (3.248).) 


b) Null Vector 0 or zero vector is the vector whose magnitude is equal to 0, i.e., its initial and endpoint 
coincide, and its direction is not defined. 


c) Radius Vector r or position vector of a point P is the vector oP with the initial point at the origin 
and endpoint at P (Fig. 3.114). In this case the origin is also called a pole or polar point. The point 
P is defined uniquely by its radius vector. 

d) Collinear Vectors are parallel to the same line. 

e) Coplanar Vectors are parallel to the same plane. They satisfy the equality (3.260). 


3.5.1.2 Calculation Rules for Vectors 


1. Sum of Vectors 
— — = 
a) The Sum of Two Vectors AB = a and AD = b can be represented also as the diagonal of the 


— 
parallelogram ABC D, as the vector AC — c in Fig. 3.115b. The most important properties of the 
sum of two vectors are the commutative law and the triangle inequality: 


a+b=b+a, |a+b|<|al|+|b]. (3.237a) 


> sy — 
b) The Sum of Several Vectors a, b, C,...,e€ is the vector f = AF’, which closes the broken line 
composed of the vectors from à to € as in Fig. 3.115a. For n vectors a; (i — 1,2,...,n) holds: 


Ya-f. (3.237b) 
i=1 


Important properties of the sum of several vectors are the commutative law and the associative law of 
addition. For three vectors holds: 

a+b+¢=G+b+a, (€+b)+e=a+(b+9e). (3.237¢) 
c) The Difference of Two Vectors a — b can be considered as the sum of the vectors & und —b, ie., 


a—b=a+(-b)=d (3.237) 
which is the other diagonal of the parallelogram (Fig. 3.115b). The most important properties of the 
difference of two vectors are: 


a—a=0 (mulvector), |a—b|>||/a|—|b]]. (3.237e) 
w v $ p B 
a pv 
u A 
a) b) ϱ) d) od 


Figure 3.116 


2. Multiplication of a Vector by a Scalar, Linear Combination 
The products aa and ga are equal to each other and they are parallel (collinear) to a. The length 
(absolute value) of the product vector is equal to |a||a|. For a > 0 the product vector has the same 
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direction as a; for a < 0 it has the opposite one. The most important properties of the product of 
vectors by scalars are: 


αἄ--ἄα, αβᾶ--βαᾶ, (αΓβ)ᾶ-- αᾶ -βᾶ, α(ἄ-ΓΡ)-- αἅ -αβ. (3.2382) 
The linear combination of the vectors a, b, πα d with the scalars a, B,...,Óóis the vector 
k—a&- Bb4--- d. (3.238b) 


3. Decomposition of Vectors 
In three-dimensional space every vector à can be decomposed uniquely into a sum of three vectors, 
which are parallel to the three given non-coplanar vectors d, V, w (Fig. 3.116a,b): 
ü-—aüdvdyw. (3.2392) 
'The summands au, £y and ^w are called the components of the decomposition, the scalar factors a, 
B and ^ are the coefficients. When all the vectors are parallel to a plane one can write 
ᾱ- αἄ Γβν (3.239b) 
with two non-collinear vectors ii and v being parallel to the same plane (Fig. 3.116c,d). 


3.5.1.3 Coordinates of a Vector 


X 
1. Cartesian Coordinates According to (3.239a) every vector AB — 
a can be decomposed uniquely into a sum of vectors parallel to the basis 





vectors of the coordinate system i, j, k or €;, €;, y: 


a= yi + ayj + ak = Az; + dy€; + a,€g, (3.240a) 
where the scalars az, ay and a, are the Cartesian coordinates of the vector 
a in the system with the unit vectors é;, €; and Εμ. Also is written 

a= {az,dy,az} or (azr, ay, az). (3.240b) 
The three directions defined by the unit vectors form an orthogonal direc- 
tion triple. The components of a vector are the projections of this vector 
Figure 3.117 on the coordinate axes (Fig. 3.117). 





The coordinates of a linear combination of several vectors are the same linear combination of the co- 
ordinates of these vectors, so the vector equation (3.238b) corresponds to the following coordinate 
equations: 

















ky = (Ay + Bb, d bó ds, 
ky = aay + Bby t ò dy, (3.241) 
kz = aaz + bbz +- +ôdz. 
For the coordinates of the sum and of the difference of two vectors 
é=atb (3.242a) 
the equalities 
G =t; E by G= ay E by C—O, 4 0, (3.242b) 











are valid. The radius vector T of the point P(x,y, z) has the Cartesian coordinates of this point: 








Tz =%, Ty—y, T,—z2; ř=ri+yj+zk. (3.243) 
2. Affine Coordinates are a generalization of Cartesian coordinates with respect to a system of 
linearly independent but not necessarily orthogonal vectors, i.e., to three non-coplanar basis vectors 
&1, €2,63. The coefficients are a!,a?, aë, where the upper indices are not exponents. Similarly to 


(3.240a,b) for a holds 
ἃ-- αἲ 6ι -- αἱ ὄ, -- αἳ δι (3.244a) or a= fa, a?, a’ or a (a, a’, a?) f (3.244b) 


This notation is especially suitable as the scalars at, a?, a? are the contravariant coordinates of a vector 
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(see 3.5.1.8, p. 188). For &j — f, & — j, & — k the formulas (3.244a,b) become (3.240a,c). For the 
linear combination of vectors (3.238b) just as for the sum and difference of two vectors (3.242a,b) in 
analogy to (3.241) the same coordinate equations are valid: 








k! — aal 4- BU. 4. E ód!, 
P=aad+6b?+---+6a, (3.245) 
k3 = aa + 8B ++- -+8 d; 
αἳ -- αἲ tb, =r tb, C=0 EU (3.246) 














3.5.1.4 Directional Coefficient 


The directional coefficient of a vector a along a vector b is the scalar product 


aj — &b? — |&| cos p, (3.247) 
» b 3 = 
where b® = Bl is the unit vector in the direction of b and e is the angle between a and b. 


The directional coefficient represents the projection of a on b. 
E Inthe Cartesian coordinate system the directional coefficients of the vector a along the x, y, z axes are 
the coordinates az, dy, az. This statement is usually not true in a non-orthonormal coordinate system. 


3.5.1.5 Scalar Product and Vector Product 
1. Scalar product 


The scalar product or dot product of two vectorsaand , 
b is defined by the equation b ce b 
a-b = ab = (ab) = |a| |b | cosy, (3.248) — 
d a 


where y is the angle between a and b considering them 

with a common initial point (Fig. 3.118). The value 

of a scalar product is a scalar. Figure 3.118 Figure 3.119 
2. Vector Product 
or cross product of the two vectors a and b is a vector € such that it is perpendicular to the vectors a and 
b, and in the order a, b, and € the vectors form a right-hand system (Fig. 3.119): If the vectors have 
the same initial point, then looking at the plane of a and b from the endpoint of C, the shortest rotation 
of & in the direction of b is counterclockwise. The vectors &, b, and € have the same arrangement as the 
thumb, the forefinger, and the middle finger of the right hand. Therefore this is called the right-hand 
rule. The vector product (3.249a) has the magnitude (3.249b) 


ax b= [ab] =é, (3.249a) |é| = |a| |b] sing, (3.249b) 


where c is the angle between a and b.N umerically the length of c is equal to the area of the parallelo- 
gram defined by the vectors à and b. 
3. Properties of the Products of Vectors 
a) The Scalar Product is commutative: 

ab=ba. (3.250) 
b) The Vector Product is anti commutative (changes its sign by interchanging the factors): 

ax b=—(b x a). (3.251) 
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c) Multiplication by a Scalar Scalars can be factored out: 


o(&b) — (a&) b, (3.2522) a(a x b) = (aa) x b. (3.252b 
d) Associativity The scalar and vector products are not associative: 
&(bc) 4 (ab)é, (3.253a) ax (bx @) 4 (ax b) xe (3.253b 
e) Distributivity The scalar and vector products are distributive over addition: 
a(b+é) =ab+aé, (3.254a 
ax (b+é@)=axb+axé and (b+é)xa=bxa+exa (3.254b 


f) Orthogonality of Two Vectors Two vectors are perpendicular to each other (a L b) if the equality 


&b —0 holds, and neither & nor b are null vectors. (3.255 
g) Collinearity of Two Vectors Two vectors are collinear (& || b) if the equality 

&xb-—Ü holds, and neither & nor b are null vectors. (3.256 
h) Multiplication of the same vectors: 


aa=2=c?, Axa=0. (3.257 





i) Linear Combinations of Vectors can be multiplied in the same way as scalar polynomials (be- 
cause of the distributive property), only one must be careful with the vector product. If interchanging 
the factors, also the signs are to be changed. 


BA: ( 3a «5b — 2c) (à — 2b — 4c) — 3a? 4- 5bá — 2cà — 68b — 10b? 4- Acb — 12ác — 20be 4- 8c? 
= 3a? — 10b? 4- 8c? — ἄθ -- 1486 -- 1606. 

B B: ( 3& «5b — 2€) x (à—2b — 46) 2 3& x à-- 5b x & 26 x & — 68 x b — 10b x b 
+ 4E xb -— 128 x € — 20b x €+ 8€ x €= 0 — 58 x b + 28 x €— 68 x b--0— 4b x € 
— 128 x€ — 20b x € + 0 = —118 x b — 108 x € — 24b x € = 11b x 8 + 107 x & 4- 24 x b. 


j) Scalar Invariant is a scalar quantity if it does not change its value under a translation or a rotation 
of the coordinate system. The scalar product of two vectors is a scalar invariant. 





W A: The coordinates of a vector a = {a1, 2, a3} are not scalar invariants, because in different coor- 
dinate systems they can have different values. 

E B: The length of a vector ais a scalar invariant, because it has the same value in different coordinate 
systems. 

W C: Since the scalar product of two vectors is a scalar invariant, the scalar product of a vector by 
itself is also a scalar invariant, i.e., 2a = |a|? cosy = |al?, because y = 0. 


3.5.1.6 Combination of Vector Products 


1. Double Vector Product 
The double vector product a x (b x c) results in a vector coplanar to b and c: 

&x (bx &) = b (ac) — &(&b). (3.258) 
2. Mixed Product 
'The mixed product (à x b) c, which is also called the triple product, results in a scalar whose absolute 
value is numerically equal to the volume of the parallelepipedon defined by the three vectors; the result 


is positive if à, b, and c form a right-hand system, negative otherwise. Parentheses and crosses can be 
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omitted: 

(xb)é—a(bxc)-—abc-bca-dcab- —acb — —bac— —cbà. (3.259) 
The interchange of any two terms results in a change of sign; the cyclic permutation of all three terms 
does not affect the result. 4 
For coplanar vectors, i.e., if a is parallel to the plane defined by b and C, holds: 

















a(b x é) =0. (3.260) 
3. Formulas for Multiple Products 
a) Lagrange Identity: (a x b)(é x d) — (àc) (bd) — (bc) (&d), (3.261) 
ac af ag 
b) abé-éfg=|be bf bg. (3.262) 
ce cf cg 
4. Formulas for Products in Cartesian Coordinates 
If the vectors a, b, € are given by Cartesian coordinates as 
&-ía,aya,), b-(b bb), €= {Crs Cys cz}, (3.263) 
the calculation of the products can be made by the following formulas: 
1. Scalar Product: ab= Azby + Ayby + azbz. (3.264) 
2. Vector Product: x b = (ayb; — a4by) i-- (a4b, — a40;) j -- (asby — dybr) k 
ijk 
= | ay Qy az |- (3.265) 
by by bz 
= Gz Ay Az 
3. Mixed Product: abé=| bz by bz |. (3.266) 
Gy Cy Ce 








5. Formulas for Products in Affine Coordinates 
1. Metric Coefficients and Reciprocal System of Vectors If the affine coordinates of two vec- 


tors à and b in the system of €i, &», € are given, i.e., 

ü—alé -Γ αἲ δα δι, Ὁ -- ὐἱ δι 6D 6, (3.267) 
and there is to calculate the scalar product 

ab =a'b! & & + a? b? Ez E + a? b? € &3 


+ (a'b? +a? b) ë & + (a? b + a? b?) E283 + (a? b + a'b) 838: (3.268) 





or the vector product 

a x b = (a? b — a? b?) & x ës + (a? b! — a WP) & x & + (a'b? — a? b) &1 x E, (3.2692) 
with the equalities 

δι χ δι -- ὄ, x & = & x & =O, (3.269b) 


then the pairwise products of the coordinate vectors must be known. For the scalar product these are 
the six metric coefficients (numbers) 


gii — €1€1, $22 — 6262, 33 — €3 €3; 
gi2 — 61625 €2€;, 923 = 6963 — 6162, 931 = €3 €i — € C3 (3.270) 
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and for the vector product the three vectors 


E! = N (Ez X E3), E? = N (E X E1), EF =N(E, x &), (3.271a) 
which are the three reciprocal vectors with respect to €, €», &3, where the coefficient 
1 
(3.271b) 


N=- 
616263 

is the reciprocal value of the mixed product of the coordinate vectors. This notation serves only as 
a shorter way of writing in the following discussion. Calculations with the coefficients will be easy to 
perform with the help of the multiplication Tables 3.13 and 3.14 for the basis vectors . 





















































Table 3.13 Scalar product Table 3.14 Vector product 
of basis vectors of basis vectors 
Multipliers 
| & | & | δι | | δι | & | & 
3 : a3 a2 
| €1 | 911 | 912 | 913 | $!é& 0 Er η 
= a 2 2 
2 | 921 | 922 | 923 | A FE él 
= Β δὲ --- 0 — 
€3 | 931 | 932 | 933 3 f R 
5 32 z1 
E xs e e 
(Iki = gik) | ὃν Ἢ -y Ὁ 
Table 3.15 Scalar product Table 3.16 Vector product 
of reciprocal basis vectors of reciprocal basis vectors 
Multipliers 
























































| iljlk τ ΤΙ κ 
ΠΕΚΚ tli olk 
3 $ - = 
E ofifo] 4 j|-k|o|i 
συ 1| 3| kE j| o0 
A 








2. Application to Cartesian Coordinates The Cartesian coordinates are a special case of affine 
coordinates. From Tables 3.15 and 3.16 for the basis vectors holds 








&=i, &=j, &=k (3.272a) 
with the metric coefficients 

gu = 92= 933=1, g12— 237-9 — 0, Q πε 1, (3.272b) 
and the reciprocal basis vectors 

Sai @ =j, @=k. (3.272c) 


So the basis vectors coincide with the reciprocal basis vectors of the coordinate system, or, in other 
words, in the Cartesian coordinate system the basis vector system is its own reciprocal system. 
3. Scalar Product of Vectors Given by Coordinates 
3 83 
ab = X` Y gmna" b" = gaga? UP. (3.273) 
m=1 


n=1 
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For Cartesian coordinates (3.273) coincides with (3.264). 

After the second equality in (3.273) a shorter notation for the sum was applied which is often used in 
tensor calculations (see 4.3.1, 2., p. 280): instead of the complete sum one writes only a characteristic 
term so that the sum should be calculated for repeated indices, i.e., for the indices appearing once down 
and once up. Sometimes the summation indices are denoted by Greek letters; here they have the values 
from 1 until 3. Consequently holds 





Japa? b? = gua’ bt + giza” b? + giza” b? + gab + ga2a? b? + goza? b? 


+ g31a”b! + g320? b? + g33a? b?. (3.274) 
4. Vector Product of Vectors Given by Coordinates In accordance with (3.269a) 
4 elg? g? 
ax b = 6, @6@3| a! a? αὖ 
p gp 
- & éé [(a* ? — a9 b)é* + (a3 bt — ab) 6? + (a τὸ -- ὦ e?] (3.275) 


is valid. For Cartesian coordinates (3.275) coincides with (3.265). 


5. Mixed Product of Vectors Given by Coordinates In accordance with (3.269a) holds 
ADE = & Ges |b! b? b|. (3.276) 


For Cartesian coordinates (3.276) coincides with (3.266). 
3.5.1.7 Vector Equations 


Table 3.17 contains a summary of the simplest vector equations. In this table a, b, c are given vectors, 
X is the unknown vector, a, (, y are given scalars, and zx, y, z are the unknown scalars to be calculated. 


3.5.1.8 Covariant and Contravariant Coordinates of a Vector 


1. Definitions The affine coordinates a!, a?, a? of a vector a in a system with basis vectors €1, &5, 


€, defined by the formula 

ī = a' ë +a? E «- αἳ δι -- αἲ δα (3.277) 
are also called contravariant coordinates of this vector. The covariant coordinates are the coefficients 
in the decomposition with the basis vectors é!, é?, &?, i.e., with the reciprocal basis vectors of ej, e, 
é3. With the covariant coordinates a1, a2, a3 of the vector à 

a= a, 6! + αι δ” + a3 6? = aye. (3.278) 
In the Cartesian coordinate system the covariant and contravariant coordinates of a vector coincide. 
2. Representation of Coordinates with Scalar Product 
The covariant coordinates of a vector a are equal to the scalar product of this vector with the corre- 
sponding basis vectors of the coordinate system: 

Ωι ae, 5 — ae», ü3 — a 63. (3.279) 
The contravariant coordinates of a vector à are equal to the scalar product of this vector with the 
corresponding basis vectors: 

al—adl α =88?, sg —ae? (3.280) 
In Cartesian coordinates (3.279) and (3.280) are coincident: 


dy =ai, ay=aj, a, =ak. (3.281) 
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Table 3.17 Vector equations 


X unknown vector; à, b, C, d given vectors; zr, y, z unknown scalars; a, 8 , y given scalars 








Equation Solution 
1. X4á-b x-b-à 
A zy 4, a 
2. ax —a κ---- 
α 
3. Xa—a Indeterminate equation; considering all vectors X satisfying 


the equation, with the same initial point, then the endpoints 
form a plane perpendicular to the vector à. Equation 3. is 
called the vector equation of this plane. 

4. Xxa-—b (b lä) Indeterminate equation; considering all vectors X satisfying 
the equation, with the same initial point, then the endpoints 
form a line parallel to a. Equation 4. is called the vector 
equation of this line. 






































5 χπ-α , aa+axb -- 
'"lxxa-b (bia |*^ z ec dap 
Xa=a T "m MEE: 
> b b 5 = 5 
6. 1 Xb—f X Bib Seas. ) αᾶ 4- Bb 1- ης, 
Xé-5 | abc 
where ᾱ, b, € are the reciprocal vectors of a, b, € (see 3.5.1.6, 
1 : p. 186). 4 νο 
3 + {ρε adc abd 
7 d=xa+yb+z¢e C= 2 a. = 
z "M übc abe abe 
8. d=x(b x é) da db dé 
x s p | T=- =—, 2-Ξ-- 
+y (€ x a) + 2 (a xb) abe " abe abé 





3. Representation of the Scalar Product in Coordinates 
The determination of the scalar product of two vectors by their contravariant coordinates yields the 
formula (3.273). The corresponding formula for covariant coordinates is: 


ab = g”? aa bp, (3.282) 
where g”™” = €™ 6” are the metric coefficients in the system with the reciprocal vectors. Their relation 
with the coefficients gmn iS 
(— Ίγπηρη Amn 
Gu 912 913 


921 922 923 
931 932 933 


where A"" is the subdeterminant of the determinant in the denominator obtained by deleting the row 
and column of the element gmn- 


mn 


(3.283) 








If the vector a is given by covariant coordinates, and the vector b by contravariant coordinates, then 
their scalar product is 


ab=a! bı +a? bz + a? bz = a ba (3.284) 
and analogously holds 
Ab = aa b°. (3.285) 
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3.5.1.9 Geometric Applications of Vector Algebra 

Table 3.18 demonstrates some geometric applications of vector algebra. Other applications from ana- 
lytic geometry, such as vector equations of the plane and of the line, are demonstrated in 3.5.1.7, p. 189 
and 3.5.3.10, p. 218ff. and on the subsequent pages. 


Table 3.18 Geometric application of vector algebra 





Vector Formula with coordinates 
formula (in Cartesian coordinates) 


Length of the vector a a= vā? a — ja - a2 -- a2 


Area of the parallelo- 
gram determined by 


Determination 




















2 2 2 
the vectors à and b g= la x b| S= rg 5 + + i 2 
y z Uy 
Volume of the 
parallelepiped 
determined by the 3 Qr Ay Az 
vectors a, b.c V= lab z| V = | bz by bz 
ὃς ος, 








Angle between the ab Agby + dyby + azbz 


al 
vectors a and b cos p = cosy = 
V228? a2 -- a2 + a2 ,/b2 + b2 + b2 























3.5.2 Analytical Geometry ofthe Plane 
3.5.2.1 Basic Concepts, Coordinate Systems in the Plane 


The position of every point P of a plane can be given by an arbitrary coordinate system. The numbers 
determining the position of the point are called coordinates. Mostly Cartesian coordinates and polar 
coordinates are in use. 


1. Cartesian or Descartes Coordinates 

The Cartesian coordinates of a point P are the signed distances of this point, given in a certain measure, 
from two coordinate axes perpendicular to each other (Fig. 3. 120). The intersection point 0 of the 
coordinate axes is called the origin. The horizontal coordinate axis, usually the z-axis, is usually called 
the axis of abscissae, the vertical coordinate axis, usually the y-axis, is the azis of ordinates. 








I II II IV 





0 x 


Figure 3.120 Figure 3.121 
The positive direction is given on these axes: on the z-axis usually to the right, on the y-axis upwards. 
The coordinates of a point P are positive or negative according to which half-axis the projections of 
the point fall (Fig. 3.121). The coordinates x and y are called the abscissa and the ordinate of the 
point P, respectively. The point with abscissa a and ordinate b is denoted by P(a, b). The x, y plane is 
divided into four quadrants I, IT, ITI, and IV by the coordinate axes (Fig. 3.121,a). 
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2. Polar Coordinates 

The polar coordinates of a point P (Fig. 3.122) are the radius p, i.e., the distance of the point from a 
given point, the pole 0, and the polar angle y, i.e., the angle between the line 0P and a given oriented 
half-line passing through the pole, the polar a: The pole is also called the origin. The polar angle is 
positive if it is measured counterclockwise from the polar axis, otherwise it is negative. 





v=b, AM 
P(p,o) u=a, 
v=b, 
p 
v-b, η-ας 
0 
Figure 3.122 Figure 3.123 


3. Curvilinear Coordinate System 

This system consists of two one-parameter families of curves in the plane, the family of coordinate curves 
(Fig. 3.123). Exactly one curve of both families passes through every point of the plane. They intersect 
each other at this point. The parameters corresponding to this point are its curvilinear coordinates. In 
Fig. 3.123 the point P has curvilinear coordinates u — a; and v = 63. In the Cartesian coordinate 
system the coordinate curves are straight lines parallel to the coordinate axes; in the polar coordinate 
system the coordinate curves are concentric circles with the center at the pole, and half-lines starting 
at the pole. 








Figure 3.124 Figure 3.125 


3.5.2.2 Coordinate Transformations 

Under transformation of a Cartesian coordinate system into another one, the coordinates change ac- 
cording to certain rules. 

1. Parallel Translation of Coordinate Axes 

The axis of the abscissae is shifted by a, and the axis of the ordinates by b (Fig. 3.124). Suppose a 
point P has coordinates x,y before the translation, and it has the coordinates x’, y’ after it. The old 
coordinates of the new origin 0! are a,b. The relations between the old and the new coordinates are the 
following: 


c=a'+a, y=y' +), (3.286a) z'—r—a y-—y-b. (3.286b) 


2. Rotation of Coordinate Axes 
Rotation by an angle (Fig. 3.125) yields the following changes in the coordinates: 


a'—rcosQ -cysing, (3.2872) r —z'cosQ —y' sing, (3.287b) 
y — —rsing +y cosg, y =a'sing +y' cosy. 








192 3. Geometry 


The coefficient matrix belonging to (3.287a) 


τ ok Jl n . T 
ο ο ο... 
—siny cosy y y y y 


is called the rotation matrix. 

In general, the transformation of a cartesian coordinate system into another can be performed in two 
steps, a translation and a rotation of the coordinate axes. 

Remark: With the so-called coordinate transformation considered here the coordinate system is trans- 
formed, but the represented object rests in its position. In contrast to this with a so-called geometric 
transformation the object is transformed, but the coordinate system remains unchanged in its position. 
In 3.5.4, p. 229 the rotation of an object is described by 


E -n(7). (3.288) 
Up UP 


where R is the rotation matrix. Between D and R there exists the relation 


R-Di!. (3.289) 






P6; ,y;) Ρ 


1 
d / d 
βι/ Ρ, 


Pi6G γι) 


/Qy-7 0) P2 
0 


Figure 3.126 Figure 3.127 Figure 3.128 





X 


3. Transforming Cartesian Coordinates into Polar Coordinates and Conversely 
Supposing the origin coincides with the pole and the axis of abscissae coincides with the polar axis 
(Fig. 3.126), then 

z= p(p)cosy y= ply)siny (-r<p<rm, p20); (3.290a) 


arctan P. +r forz <0, 
q 


arctan P forx > 0, 
p—wyaz* y, (3.290b) p= ; for x — 0 and y 7 0, (3.290c) 
T 


2 for x — 0 and y « 0, 


undefined for x = y = 0. 


3.5.2.3 Special Notations and Points in the Plane 


1. Distance Between Two Points 
If the two points given in Cartesian coordinates as P, (£1, y1) and P, (£2, y2) (Fig. 3.127), then their 
distance is 

d — y (za — 21)? + (yo — y1}. (3.291) 
If they are given in polar coordinates as Pi (p1, v1) and P5 (ps, 3) (Fig. 3.128), their distance is 


d= V + p3 — 2pxpa cos (a — v1). (3.292) 
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2. Coordinates of Center of Mass 
The coordinates (x,y) of the center of mass of a system of material points M; (v;,y;) with masses 








m; (t= 1,2,...,n) are calculated by the following formula: 
YS myx; X miyi 
c= ; = Z., 3.293 
T Emi y Emi ( ) 
3. Division of a Line Segment 
PP m 
1. Division in a Given Ratio The coordinates of the point P with division ratio PE =T) 
n 


(Fig. 3.129a) of the line segment P, P, are calculated by the formulas 

















nzi +MI2 T1 + AÀT2 ny My2 yı + Ay 
qnos ΕΞ ἘΠ ΠΕ 9945 y= SS 2941 
΄ ncm τα, (3-294a) y n+m 1+A (5:2940) 
For the midpoint M of the segment P; P2, because of A = 1 
es Ta (3.294c) y=% 3 2 (3.294d) 


holds. The sign of the segments P,P and PP, can be defined. Their signs are positive or negative 
depending on whether their directions are coincident with P,P, or not. Then formulas (3.294a,b,c,d) 








result in a point outside of the segment P, P» in the case À « 0. This is called an external division. 





If P is inside the segment P, Pz, it 

is called an internal division. One 

defines: 

a)à=0if P= RP, 

b) à= œ if P = P, and 

c) à = —1 if P is an infinite or im- 

proper point of the line g, i.e., if 

P is infinitely far from P, P; on g. 
The shape of A is shown in Fig. 


P566 ,y3) € 








3.129b. 
E For a point P, for which P, is 
b) the midpoint of the segment P P, 
P,P 
Figure 3.129 A- PE — —2 holds. 


2. Harmonic Division Ifthe internal and external division of a line segment have the same absolute 
value |À], it is called harmonic division. Denote by P; and P; the points of the internal and external 
division respectively, and by A; and A, the internal and external rates. Then 


HB γε Fife 5, (3.295a) ^ yt =o (3.295b) 
P,P, PP, 











If M denotes the midpoint of the segment P, P, at a distance b from Pj (Fig. 3.130), and the distances 
of P; and P, from M are denoted by x; and £a, then 

box to Zi b i 
= or —=—, ie. 
b-aj πι-ὐ b oc 


, Lifa =b. (3.296) 








The name harmonic divisionis in connection with the harmonic mean (see 1.2.5.3, p. 20). In Fig. 3.131 
the harmonic division is represented for À = 5 : 1, analogously to Fig. 3.14. The harmonic mean r of 
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the segments P, P; = p and Pj P, — q according to (3.295a) equals in accordance with (1.67b), p. 20, to 


2pq 


r— —— — —2b (see Fig. 3.132). (3.297) 


| pq 





Figure 3.132 


3. Golden Section ofa segment a is its division into two parts x and a — x such that the part x 


and the whole segment a have 


a) 


4— x —>jea-x 


Figure 3.133 


3.5.2.4 Areas 


A 


NID 


ο 





he same ratio as the parts a — x and z: 


PIE (3.2982) 


T 
a T 





In this case xv is the geometric mean of a and a — x (see also golden 
section in 1.1.1.2, p. 2), and it holds: 


x= /a(a— x),  (3.298b) x = Εν 5” 0.018 -α.  (3.298c) 


The part x of the segment can be geometrically constructed as shown 
in Fig. 3.133a. 

Remark 1: The line segment v is also the length of the side of a 
regular decagon with circum radius a (see also 3.1.5.3, p. 139). 
Remark 2: The following geometrical problem produces also the 
equation of the golden section: Given a rectangle with the constant 
side length a and a variable side a — x. To find is the value x so, that 
the area a(a — x) of the rectangle is equal to the area x? of the square 
(see Fig.3.133b). 


1. Areaofa Convex Polygon 


If the vertices are given by P (z1, y1), P» (12, ya), ..., Pn (En, Yn), then the area is 
1 
S — 5 {σι = z2) (y1 + y2) + (£2 = z3) (y2 + ys) +++: + (En = 2) Qs t 9] (3.299) 


The formulas (3.299) and (3.300) result in a positive area if the vertices are enumerated counterclock- 
wise, otherwise the area is negative. 


2. Area ofa Triangle 


If the vertices are given by P; (£1, y1), P2 (£2, Y2), and P; (£3, y3) (Fig. 3.134), then the area can be 


calculated by the formula 
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y 
P3(X3 γα) 
ῃ|2 νι 1] 1 
S = 3 |72 Y2 1| = 5 [71 (y2 = ys) + z2 (ys — y1) + z3 (yı — y2)] 
T3 y l 
1 
P EL [αντ 22) (y1 + y2) + (2 — £3) (y2 + y3) + (£3 — z1) (y3 + yı)] - (3-300) 
10% /Y1) Pa% Vo) 
ryt 
0 X Three points P1, P», P5 are on the same line if. | x» y» 1| — 0 holds. (3.301) 
T3 ya l 








Figure 3.134 


3.5.2.5 Equation of a Curve 


An equation F(x,y) — 0 for the coordinates x and y often corresponds to a curve, which has the prop- 
erty that every of its points P satisfies the equation, and conversely, every point whose coordinates 
satisfy the equation is on the curve. The set of these points is also called the geometric locus or simply 
locus. If there is no real point in the plane satisfying the equation F(x,y) = 0, then there is no real 
curve, and one talks about an imaginary curve: 

MA: 2? +y?+1=0, 

WB B: y—1n (1 —a?- cosh z) =0. 

The curve corresponding to the equality F(x,y) = 0 is called an algebraic curve if F(x,y) is a poly- 
nomial, and the degree of the polynomial is the order or degree of the curve (see 2.3.4, p. 65). If the 
equation of the curve cannot be transformed into the form F(x, y) — 0 with a polynomial expression 
F(x,y), then the curve is called a transcendental curve. 

The equation of a curve can be defined in the same way in any coordinate system. But from now on, in 
this book only the Cartesian coordinate system is used, except when stated otherwise. 


3.5.2.6 Line 


1. Equation of the Line 


Every equation that is linear in the coordinates is the equation of a line, and conversely, the equation 
of every line is a linear equation of the coordinates. 


1. General Equation of Line 

Ax4- By-4- C —0 (A,B,C const). (3.302) 
For A = 0 (Fig. 3.135) the line is parallel to the z-axis, for B = 0 it is parallel to the y-axis, for C = 0 
it passes through the origin. 
2. Equation of the Line with Slope (or Angular Coefficient) Every line that is not parallel to 
the y-axis can be represented by an equation written in the form 

y — kv b. (k,b const). (3.303) 


The quantity k is called the angular coefficient or slope of the line; it is equal to the tangent of the angle 
between the line and the positive direction of the x-axis (Fig. 3.136). The line cuts out the segment b 
from the y-axis. Both the tangent and the value of b can be negative, depending on the position of the 
line. 

3. Equation ofa Line Passing Through a Given Point The equation of a line which goes through 
a given point Pj (z1, y1) in a given direction (Fig. 3.137) is 


y—y —k(r—z,), with k-—tanó. (3.304) 
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y 
PAG γι) 
0 0 x 
Figure 3.135 Figure 3.136 Figure 3.137 


4. Equation of a Line Passing Through Two Given Points If two points of the line Pj (z1, 1), 
P» (12, y2) are given (Fig. 3.138), then the equation of the line is 
-! --2 
yom _ 1 (3.305) 
Y2 — Yı T2 — Tı 
5. Intercept Equation of a Line If a line cuts out the segments a and b from the coordinate axes, 
considering them with sign, the equation of the line is (Fig. 3.139) 





T y 
Zatoa (3.306) 
y 
y 
Pix y) 
P32 Υ2) b 
Ριίχι γι) 6 
0 x 0 a x 
Figure 3.138 Figure 3.139 Figure 3.140 


6. Normal Form of the Equation of the Line (Hessian Normal Form) With p as the distance 
of the line from the origin, and with a as the angle between the x-axis and the normal of the line passing 
through the origin (Fig. 3.140), with p > 0, and 0 < a < 27, the Hessian normal form is 

xcosa + ysina —p=0. (3.307) 
The Hessian normal form can be got from the general equation if multiply (3.302) by the normalizing 
factor 

un" 1 

BETTE 

The sign of jj must be the opposite to that of C in (3.302). 


7. Equation ofa Line in Polar Coordinates (Fig. 3.141) With p as the distance of the line from 
the pole (normal segment from the pole to the line), and with a as the angle between the polar axis and 
the normal to the line passing through the pole, the equation of the line is 


-- p 
Em (3.309) 





(3.308) 


2. Distance of a Point from a Line 
The distance d of a point Pi (21, y1) from a line (Fig. 3.140) can be got by substituting the coordinates 
of the point into the left-hand side of the Hessian normal form (3.307): 

d — r4 cosa 4- yi sina — p. (3.310) 
If P, and the origin are on different sides of the line, yields d > 0, otherwise d < 0. 
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P(X yo) 





Figure 3.141 Figure 3.142 Figure 3.143 


3. Intersection Point of Lines 

1. Intersection Point of Two Lines In order to get the coordinates (xo, yo) of the intersection 
point of two lines the system of equations given by the equations is to be solved. If the lines are given 
by the equations 




















Az + Biy -C4—0, Aor 4 Boy - C9 —0 (3.311a) 
then the solution is 
Bı σι σι Αι 
Bz C2 Cy Ag 
Πρ ποτ, w=: 3.311b 
Αι δι ΓΒ; ( ) 
A» Bg Ag DB» 
B C. 
If i D — 0 holds, the lines are parallel. If n = E = a holds, the lines are coincident. 
2. Pencil of Lines Ifa third line with equation 
A3x + Bzy + C3 = 0 (3.312a) 
passes through the intersection point of the first two lines (Fig. 3.142), then the relation 
Ay By Cy 
Ag By Co} = 0 (3.312b) 
Aa B3 C3 








must be satisfied. 
The equation 

(Aix + Biy + C1) + A (Aga + Boy + C2) = 0 (—00 < A < +00) (3.312c) 
describes all the lines passing through the intersection point Po(xo, yo) of the two lines (3.311a). By 
(3.312c) a pencil of lines is defined with center Po(xo, yo). If the equations of the first two lines are given 
in normal form, then \ = +1 yields the equations of the bisectrices of the angles at the intersection 
point (Fig. 3.143). 





y y 








οἱ X ol 
a) b) 
Figure 3.144 Figure 3.145 


4. Angle Between Two Lines 
In Fig. 3.144 there are two intersecting lines. If their equations are given in the general form 


Air 4 Bıy+Cı=0 and Ar + Bzy + C2 = 0, (3.313a) 
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then for the angle y holds 


A1B3 = A,B, 
tang — —— ———— 9191 
ang Z,A 41 B,B,' (3.313b) 
4143 + BıB2 Aı Bə — ABı 
COS(Q) — ———————————-,. (3.3130) Sin — ————L——— ——-.  (3.313d) 
\/ At + B?y/ A3 + BB y A1 - B24/ A2 - B2 
With the slopes kı and kə of the intersecting lines holds 
O k-k l 
tany = LF hkz’ (3.313e) 
1+ kik: kə — k 
aes A (3.313f) ding = 99 AM (3.313g) 


Vir BV B VA KRÁT ADS 


Here the angle o is to be considered into the counterclockwise direction from the first line to the second 
one. 

A B 

x = zB or ky = kg are valid. 

For perpendicular (orthogonal) lines (Fig. 3.145b) holds Ay A2 + By By = 0 or ky = —1/hky. 


For parallel lines (Fig. 3.145a) the equalities 








Figure 3.146 Figure 3.147 


3.5.2.7 Circle 


1. Definition of the Circle The locus of points at the same given distance from a given point is 
called a circle. The given distance is called the radius and the given point is called the center of the 
circle. 

2. Equation of the Circle in Cartesian Coordinates The equation of the circle in Cartesian 
coordinates when its center is at the origin (Fig. 3.146a) is 


+y = R. (3.314a) 
If the center is at the point C'(zo, yo) (Fig. 3.146b), then the equation is 

(a — 20)” + (y— yo)” = R?. (3.314b) 
The general equation of second degree 

aa? -- 2bxy + cy” + 2dr + 2ey + f =0 (3.315a) 


is the equation of a circle only if b = 0 and a = c. In this case the equation can always be transformed 
into the form 

x? 4 y? - 2mx + 2ny+q=0. (3.315b) 
For the radius and the coordinates of the center of the circle the following equalities hold 
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Πες Υπ πο -- ᾳ, (3.316a) t=—m, y=-n. (3.316b) 


If q > Mm? + n? holds, the equation defines an imaginary curve, if q = m? + n? the curve has one single 
point P(xo, yo). 
3. Parametric Representation of the Circle 

x = xo + Rcost, y= y + Rsint, (3.317) 
where t is the angle between the moving radius and the positive direction of the x-axis (Fig. 3.147). 





αν 
0 ΄ 0 
Figure 3.148 Figure 3.149 Figure 3.150 
4. Equation ofthe Circle in Polar Coordinates in the general case corresponding to Fig. 3.148: 
f? — 2ppo cos (i — qo) -- pd — R”. (3.3182) 


If the center is on the polar axis and the circle goes through the origin (Fig. 3.149) the equation has 
the form 


p — 2Rcos g. (3.318b) 
5. Tangent of a Circle The equation of the tangent of a circle, given by (3.314a) at the point 
P (xo, yo) (Fig. 3.150) has the form 

zzo + yyo = R2. (3.319) 
3.5.2.8 Ellipse 
1. Elements of the Ellipse In Fig. 3.151, AB — 2a is the major azis, CD = 2b is the minor 
azis, A, B, C, D are the vertices, F1, Fù are the foci at a distance c = Va? — b? on both sides from 
the midpoint, e = c/a < 1 is the numerical eccentricity, and p = b?/a is the semifocal chord, i.e., the 
half-length of the chord which is parallel to the minor axis and goes through a focus. 





directrice 
directrice 








d— —9— —— d 





Figure 3.151 Figure 3.152 
2. Equation ofthe Ellipse If the coordinate axes and the axes of the ellipse are coincident, the 
equation of the ellipse has the normal form. This equation and the equations in parametric form are 
a? y? 
gg 


For the equation of the ellipse in polar coordinates see 3.5.2.11, 6., p. 208. 


=1, (3.320a) y -acost, y-bsint. (3.320b) 
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3. Definition of the Ellipse, Focal Properties The ellipse is the locus of points for which the sum 
of the distances from two given points, the foci, is a constant, and equal to 2a. These distances, which 
are also called the focal radii of the points of the ellipse, can be expressed as a function of the coordinate 
x from the equalities 

m=hP=a-erz, ra= FP =a+er, τι-Ἔτος 2α. (3.321) 
Also here, and in the following formulas in Cartesian coordinates, it is supposed that the ellipse is given 
in normal form. 


4. Directrices of an Ellipse are lines parallel to the minor axis at distance d = a/e from it 
(Fig. 3.152). Every point P(x, y) of the ellipse satisfies the equalities 
TÉ Το 
——— 3.322 
moe (3.322) 


and this property can also be taken as a definition of the ellipse. 


5. Diameter of the Ellipse The chords passing through the midpoint of the ellipse are called 
diameters of the ellipse. The midpoint of the ellipse is also the midpoint of the diameter (Fig. 3.153). 
The locus of the midpoints of all chords parallel to the same diameter is also a diameter; it is called the 
conjugate diameter of the first one. For k and k’ as slopes of two conjugate diameters the equality 


--ῥὲ 
j= 
kk' — πο (3.323) 
holds. If 2a; and 2b; are the lengths of two conjugate diameters and a and f are the acute angles 
between the diameters and the major axis, where k — — tana and k’ = tan f hold, then the Apollonius 
theorem holds in the form 
a,b; sin (a + 8) = ab, +b =a Hb. (3.324) 








Figure 3.153 Figure 3.154 Figure 3.155 


6. Tangent of the Ellipse at the point P(xo, yo) is given by the equation 
Tro | YYo 
a dp 
The normal and tangent lines at a point P of the ellipse (Fig. 3.154) are bisectors of the interior and 
exterior angles of the radii connecting the point P with the foci. The line Ax + By +C = 0 is a tangent 
line of the ellipse if the equation 
Αα + Bb? — C0? =0 (3.326) 


is satisfied. 


=1. (3.325) 


7. Radius of Curvature of the Ellipse (Fig. 3.154) If u denotes the angle between the tangent 
line and the radius vector connecting the point of contact P(xo, yo) with a focus, then the radius of 
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curvature is 











3 3 
5 xy (nn? p 
242 To | Yo 173)2 — d 
Tes at b ab sin?u eee) 
b2 
At the vertices A and B (Fig. 3.151) and at C and D the radii are R4 — Rg — — — p and Re — 
a 
|a? 
D= >: 
8. Areas of the Ellipse (Fig. 3.155) 
a) Ellipse: 
S -—mab. (3.3282) 
b) Sector of the Ellipse BOP: c) Segment of the Ellipse PBN: 
y X 
Spop — E arccos ~. (3.328b) Sppn = abarccos πο (3.328c) 
a 


9. Arc and Perimeter of the Ellipse The arclength between two points A and B of the ellipse 
cannot be calculated in an elementary way as for the parabola, but with an incomplete elliptic integral 
of the second kind E(k, y) (see 8.2.2.2, 2., p. 502). 

The perimeter of the ellipse (see also 8.2.5, 1., 515) can be calculated by a complete elliptic integral of 


π π 
the second kind E(e) — E (e 5) with the numerical eccentricity e = Va? — b?/a and with y = e (for 
one quadrant of the perimeter), and it is 


71/2 


L = 4aE(k, 5) — 4a / Vl-K?sm?y with k—e- va?— l2/a. (3.3292) 
o 


The calculation of L = 4aE(E, 7/2) — 4aE(e) can be performed by the help of the following methods: 
a) Series expansion 














λος 1.3 2et 1.3.5266 
L = 4a E(e) = 2ra | 5 e (; - ;) 3 (5 a 2) p a (3.329b) 
X3 0M 008 28$ (a — b) 
D= + b) 14 Eee ith A= 3 92 
η | 4 * 64 * 256 16384 | nis (a+b) (3.3290) 
b) Approximate formulas 
64 — 3A* 
Lm [1,5(a-- b) — Vab 3.329d Edo σης, 3.3290 
m[1,5(a + 6) — Vab] (3.329d) or Le mat ba (3.329e) 


c) Using table 21.9, p. 1103 for the complete elliptic integral of the second kind. 
d) Methods of numeric integration to determine the integral in (3.329a). 


W Fora — 1.5, b — Lone gets the following approximate values for L: according to (3.329e) L = 7.9327, 
according to (3.329e) 7.9327, by the help of table 21.9, p. 1103 L & 7.94 (see 8.1.4.3, BE p. 491) and 
by numeric integration the more exact value L z 7.932 721 . 


3.5.2.9 Hyperbola 


1. Elements of the Hyperbola In Fig. 3.156 AB — 2a isthe real axis; A, B are the vertices; 0 the 
midpoint; F| and F3 are the foci at a distance c > a from the midpoint on the real axis on both sides; 


CD = 2b = 2V@ — a? is the imaginary axis; p = b?/a the semifocal chord of the hyperbola, i.c., the 
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half-length of the chord which is perpendicular to the real axis and goes through a focus; e = c/a > 1 
is the numerical eccentricity. 


2. Equation of the Hyperbola The equation of the hy- 
perbola in normal form, i.e., for coincident x and real axes, 
and the equations in parametric form are 








a? y 2b 
d gh (3.3302) 
x — kacosht, y — bsinht (Coo « t « oo), | (3.330b) 
α π π 
tat y —btant (-5 «e 3^ (3.330c) 


Figure 3.156 


In polar coordinates see 3.5.2.11, 6., p. 208. 


3. Definition of the Hyperbola, Focal Properties The hyperbola is the locus of points for which 
the difference of the distances from two given points, the foci, is a constant 2a. The points for which 
r1—r3 — 2a belong to one branch of the hyperbola (in Fig. 3.156 on the left), the others with ro— r4 — 
2a belong to the other branch (in Fig. 3.156 on the right). These distances, also called the focal radii, 
can be calculated from the formulas 








Το -- Τι -- 





rı = +(ex—a); r= +(ex +a); -2a, (3.331) 
where the upper sign is valid for the right branch, the lower one for the left branch. Here and in the 
following formulas for hyperbolas in Cartesian coordinates it is supposed that the hyperbola is given 


in normal form. 


T 
Ἂς 
P(xyy,) 
r 
τι Ν 
E/A ΒΑΕ, x 








Figure 3.157 


Figure 3.158 


4. Directrices of the Hyperbola are the lines perpendicular to the real axis at a distance d = a/c 


from the midpoint (Fig. 3.157). | 





Every point of the hyperbola P(x,y) satisfies the equalities 


Τι T2 

ως ερ. 992 

à 4 € (3.332) 
5. Tangent of the Hyperbola at the point P(xo, yo) is given by the equation 

TTo — UUyo 

-------ι. à 

= he (3.333) 


The normal and tangent lines of the hyperbola at the point P (Fig. 3.158) are bisectors of the interior 

and exterior angles between the radii connecting the point P with the foci. The line Ax + By + C =0 

is a tangent line if the equation 
Au? — BY? — C? =0 

is satisfied. 

6. Asymptotes of the Hyperbola are the lines (Fig. 3.159) approached infinitely closely by the 

branches of the hyperbola for « — oo. (For the definition of asymptotes see 3.6.1.4, p. 252.) The slopes 


(3.334) 
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Figure 3.159 Figure 3.160 














of the asymptotes are k — + tand = +b/a. The equations of the asymptotes are 


y= (2) 2. (3.335 








a 
A tangent is intersected by the asymptotes, and they form a segment of the tangent of the hyperbola, 
i.e., the segment TT, (Fig. 3.159). The midpoint of the segment of the tangent is the point of contac 
P,so TP =7,P holds. The area of the triangle TOT; between the tangent and the asymptotes for any 
point of contact P is the same, and is 

Sror, = ab. (3.336 


The area of the parallelogram 0F' PG, determined by the asymptotes and two lines parallel to the 
asymptotes and passing through the point P, is for any point of contact P 








b 
Sorpa — 7. (3.337 
7. Conjugate Hyperbolas (Fig. 3.160) have the equations 
z y yY r 
a p 1 and pe i (3.338 


where the second is represented in Fig. 3.160 by the dotted line. They have the same asymptotes, 
hence the real axis of one of them is the imaginary axis of the other one and conversely. 





Figure 3.161 Figure 3.162 


8. Diameters of the Hyperbola (Fig. 3.161) are the chords between the two branches of the 
hyperbola passing through the midpoint, which is their midpoint too. Two diameters with slopes k and 
k' are called conjugate if one of them belongs to a hyperbola and the other one belongs to its conjugate, 


and kk’ = b? /a? holds. The midpoints of the chords parallel to a diameter are on its conjugate diameter 


(Fig. 3.161). From two conjugate diameters the one with |k| < b/a intersects the hyperbola. If the 
lengths of two conjugate diameters are 2a, and 2b;, and the acute angles between the diameters and 
the real axis are a and B « a, then the equalities 


at — be =a? —B?, ab= ab sin(a -— 8) (3.339) 
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are valid. 
9. Radius of Curvature of the Hyperbola At the point P(xo, yo) the radius of curvature of the 
hyperbola is 


Q2. ,2N 3/2 s a 3/2 ] 
R ev(s i i) ue e (3.3402) 











at pt ab sin?u 


where u is the angle between the tangent and the radius vector connecting the point of contact with a 
focus. At the vertices A and B (Fig. 3.156) the radius of curvature is 


p 

Rí-HRa-poc. (3.340b) 
10. Areas in the Hyperbola (Fig. 3.162) 
a) Segment APN: 

SApN — xy — abln (Z + 2) = «xy — ab Arcosh z ] (3.341a) 

a a 
b) Area OAPG: 
ab ab, 2d 
Soapa = * + 5 In zd (3.341b) 


The line segment PG is parallel to the lower asymptote, c is the focal distance and d = 0G. 
11. Arcofthe Hyperbola The arclength between two points A and B of the hyperbola cannot be 
calculated in an elementary way like the parabola, but it can be calculated by an incomplete elliptic 
integral of the second kind E(k, y) (see 8.1.4.3,2., p. 502), analogously to the arclength of the ellipse 
(see 3.5.2.8, BE p. 201). 
12. Equilateral Hyperbola has axes with the same length a — b, so its equation is 

-y =a. (3.342a) 
The asymptotes of the equilateral hyperbola are the lines y = +a; they are perpendicular to each 
other. If the asymptotes coincide with the coordinate axes (Fig. 3.163), then the equation is 





αυ (3.342b) 


α 
p* 
y 





xy 





(Xo Yo) 


Figure 3.163 Figure 3.164 Figure 3.165 
3.5.2.10 Parabola 


1. Elements of the Parabola In Fig. 3.164 the x-axis coincides with the axis of the parabola, 
0 is the vertex of the parabola, F is the focus of the parabola which is on the x-axis at a distance p/2 
from the origin, where p is called the semifocal chord of the parabola. The directrix is denoted by NN’; 
it is the line perpendicular to the axis of the parabola and intersects the axis at a distance p/2 from 
the origin on the opposite side as the focus. So the semifocal chord is equal to half of the length of the 
chord which is perpendicular to the axis and passes through the focus. The numerical eccentricity of 
the parabola is equal to 1 (see 3.5.2.11, 4., p. 207). 
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2. Equation of the Parabola Ifthe origin is the vertex of the parabola and the x-axis is the axis o 
the parabola with the vertex on the left-hand side, then the normal form of the equation of the parabola 
is 

y -2pz. (3.343 
For the equation of the parabola in polar coordinates see 3.5.2.11, 6., p. 208. For a parabola with 
vertical axis (Fig. 3.165) the equation is 

y — aa? 4 bx 4 c. (3.344a. 

1 


ADT. «9441 
Fjal (3.344b 


The parameter of a parabola given in this formis p — 


If a > 0 holds, the parabola is open up, for a < 0 it is open down. The coordinates of the vertex are 
b _ 4ac— p? 
2a’ no Δα ` 


3. Properties of the Parabola (Definition of the Parabola) The parabola is the locus of points 
P(x,y) whose distance from a given point, the focus, is equal to its distance from a given line, the 
directrix (Fig. 3.164). Here and in the following formulas in Cartesian coordinates the normal form 
of the equation of the parabola is supposed. Then holds the equation 





(3.344c 


t= 


PF =PK=2+ 5 i (3.345) 


where PF is the radius vector whose initial point is at the focus and endpoint is a point of the parabola. 
4. Diameter of the Parabola is a line which is parallel to the axis of the parabola (Fig. 3.166). 
A diameter of the parabola halves the chords which are parallel to the tangent line belonging to the 
endpoint of the diameter (Fig. 3.166). With slope k of the chords the equation of the diameter is 


y= v (3.346) 











Figure 3.166 Figure 3.167 Figure 3.168 


5. Tangent of the Parabola (Fig.3.167) The equation of the tangent of the parabola at the point 
P(2x0, yo) is 

YYo = p (T + zo) . (3.347) 
Tangent and normal lines are bisectors of the angles between the radius starting at the focus and the 
diameter starting at the point of contact. The tangent at the vertex, i.e., the y-axis, halves the segment 
of the tangent line between the point of contact and its intersection point with the axis of the parabola, 
the x-axis: 

TS=SP, T0O=0M=20, TF=FP. (3.348) 
A line with equation y = kx + b is a tangent line of the parabola if 

p=2bk. (3.349) 
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6. Radius of Curvature of the Parabola at the point P(xo, yo) with /, as the length of the norma 
PN (Fig. 3.167) is 


(p+) p B 


R= VEO BI i p (3.350a 
and at the vertex 0 it is 
R=p. (3.350b 


7. Areas in the Parabola (Fig. 3.168) 
a) Parabolic Segment PON: 


2 
Sopn = 3 mone (MQNP is a parallelogram). (3.351a. 
b) Area 0PR (Area under the Parabola Curve): 
231 
Sopr = A : (3.351b 


8. Length of Parabolic Arc from the vertex 0 to the point P(x,y) 


| (m e(t) (3.3522. 
p p p p 
-|z (s + 5) ο εαν N (3.352b 
2 2 p 


x . 2 : 
For small values of — the following approximation can be used: 
y 


lop S y h + τς) = (a : (3.352c) 


3.5.2.11 Quadratic Curves (Curves of Second Order or Conic Sections) 


NIS 


lop 





1. General Equation of Quadratic Curves (Curves of Second Order or Degree) 
The ellipse, its special case, the circle, the hyperbola, the parabola or two lines as a singular conic 
section are defined by the general equation of a quadratic curve (curve of second order) 

axr?’ +2bry+cy +2dr+2ey+f=0. (3.353a) 
This equation can be reduced to the normal form with the help of the coordinate transformations given 
in Tables 3.19 and 3.20. 
Remark 1: The coefficients in (3.353a) are not the parameters of the special conic sections. 
Remark 2: If two coefficients (a and b or b and c) are equal to zero, the required coordinate transfor- 
mation is reduced to a translation of the coordinate axes. 
The equation cy? 4- 2dx -- 2ey 4- f — 0 can be written in the form (y — yo)? = 2p(x — c9); 
the equation az? + 2dx + 2ey + f = 0 can be written in the form (x — ao)? — 2p (y — vo). 
2. Invariants of Quadratic Curves 
are the three quantities 


abd DE 
A=|bcel, ὃ- λε; ὅπατο (3.353b) 
def 














They do not change during a rotation of the coordinate system, i.e., if after a coordinate transformation 
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Table 3.19 Equation of curves of second order. Central curves (0 z 0)! 














Quantities 6 and A Shape of the curve 
Ellipse a) for A- S < 0: real, 
AF0 
b) for A - S > 0: imaginary? 
Central curves 5>0 
"E A=0 A pair of imaginary? lines with real common point 





AF0 Hyperbola 
6<0 A=0 





A pair of intersecting lines 









































d : i Normal form of the equation 
Required coordinate transformations an 
after the transformation 
1. Translation of the origin to the a 
center of the curve, whose coordinates are ar c-cy + m 0 
be — cd bd — ae 
20 = | yo ; 
0 5 » Yo 5 
2. Rotation of the coordinate axes by 
2b +e — cy! 4. 4p? 
the angle a with tan2a — : ü c atc y(a-c) t 4P 
α-- ο 9 
The sign of sin 2a must coincide with 
the sign of 2b. Here the slope at+e—y(a— οἱ 4 4p? 
of the new z'-axis is d= REM 
2 ; 
ks c— a+ y(c— a) +4? (a’ and ο΄ are the roots of the quadratic 
~ 2b ; equation u?—Su+6=0). 





1 A.ó and S are numbers given in (3.353b). 
? 'The equation of the curve corresponds to an imaginary curve. 


the equation of the curve has the form 


a'z? + 2b'x'y' + dy? + 2d'x' + 2e'y’ + f'=0, (3.353c) 
then the calculation of these three quantities A, ó, and S with the new constants will yield the same 
values. 

3. Shape of the Quadratic Curves (Conic Sections) 

If a right double circular cone is intersected by a plane, the result is a conic section. If the plane does 
not pass through the vertex of the cone, the result is a hyperbola, a parabola, or an ellipse depending 
on whether the plane is parallel to two, one, or none of the generators of the cone. If the plane goes 
through the vertex, the result is a singular conic section with A — 0. As a conic section of a cylinder, 
i.e., a singular cone whose vertex is at infinity, yield parallel lines. The shape of a conic section can be 
determined with the help of Tables 3.19 and 3.20. 


4. General Properties of Curves of Second Degree 

The locus of every point P (Fig. 3.169) with constant ratio e of the distance to 
a fixed point F, the focus, and the distance from a given line, the directrix, is a 
curve of second order with numerical eccentricity e. For e < 1 it is an ellipse, for 
e = l it is a parabola, for e > 1 it is a hyperbola. 


directrix~ 





Figure 3.169 
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Table 3.20 Equations of curves of second order. Parabolic curves (0 — 0) 











Quantity ó and A Shape of the curve 
Parabolic curves!, 6 =0 ἘΠΕ | 
Two lines: Parallel lines for d? — af > 0, 
A=0 Double line for d? — af — 0, 








Imaginary? lines for d? — af « 0. 








Normal form of the equation 


Required coordinate transformation after thetransformnation 





A#0 


1. Translation of the origin to the vertex of the 
parabola whose coordinates x and yo are defined 











d+ be ? = pa! 

by the equations azo + byo + τ 0 and P m 
dc — be ae — bd QI ee 
d+ 5 zo d [e 3 yo-f-0. SVa +b? 


2. Rotation of the coordinate axes by the angle a 
' α 2 . : 
with tana = τὸ the sign of sina must differ 


from the sign of a. 





A= 
ad+be , 


Vere” 
formed into the form 
(ν΄ - yo) (y’ — 1) = 0. 


Sy? +2 + f = 0 can be trans- 


Rotation of the coordinate axes by the angle a 
: a ‘ : : 
with tana = s the sign of sina must differ 


from the sign of a. 














1 In the case of 6 = 0 it is supposed that none of the coefficients a, b, c is equal to zero. 
2 The equation of the curve corresponds to an imaginary curve. 


5. Determination of a Curve Through Five Points 
There is one and only one curve of second degree passing through five given points of a plane. If three 
of these points are on the same line, then it is a singular or degenerate conic section. 
6. Polar Equation of Curves of Second Degree 
All curves of second degree can be described by the polar equation 

ge (3.354) 

1+ecosyp 

where p is the semifocal chord and e is the eccentricity. Here the pole is at the focus, while the polar 
axis is directed from the focus to the closer vertex. For the hyperbola this equation defines only one 
branch. 
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3.5.3 Analytical Geometry of Space 
3.5.3.1 Basic Concepts 


1. Coordinates and Coordinate Systems 

Every point P in space can be determined by a coordinate system. The directions of the coordinate lines 
are given by the directions of the unit vectors. In Fig. 3.170a the relations of a Cartesian coordinate 
system are represented. There are to distinguish right-angled and oblique coordinate systems where 
the unit vectors are perpendicular or oblique to each other. Another important difference is whether it 
is a right-handed or a left-handed coordinate system. 

'The most common spatial coordinate systems are the Cartesian coordinate system, the spherical polar 
coordinate system, and the cylindrical polar coordinate system. 

2. Right- and Left-Handed Systems 

Depending on the successive order of the positive coordinate directions there are to distinguish right 
systems and left systems or right-handed and left-handed coordinate systems. A right system has for 
instance three non-coplanar unit vectors with indices in alphabetical order €;,€;,é€,. These vectors 
form a right-handed system when an observer, looking on the €;,é€; plane and at the same time into 
the direction of &j, can rotate e; into e; along the smallest angle, i.e., clockwise; looking from the cusp 
of the vector E€ into the direction of the €, €; plane he can rotate €; into €; counterclockwise. A left 
system consequently requires the opposite rotation in both cases. The alphabetical order of rotation is 
represented symbolically in Fig. 3.34, p. 143 where the notations a, b, c are substituted for the indices 
i j,k. 





Figure 3.170 


Right- and left-handed systems can be transformed into each other by interchanging two unit vectors. 
The interchange of two unit vectors changes its orientation: A right system becomes a left system, and 
conversely, a left system becomes a right system. 

A very important way to interchange vectors is the cyclic permutation, where the orientation remains 
unchanged. As in Fig. 3.34 the interchange the vectors of a right system by cyclic permutation yields 
a rotation in a counterclockwise direction, i.e., according to the scheme (i > j > k > i,j > k > 
i > j,k => i —> j > k). In a left system the interchange of the vectors by cyclic permutation follows a 
clockwise rotation, i.e., according to the scheme (i > k > j > ik > j> i> k,j>i>k> j). 





A right system is not superposable on a left system. 

The reflection of a right system with respect to the origin is a left system (see 4.3.5.1, p. 288). 

E A: The Cartesian coordinate system with coordinate axes x, y, z is a right system (Fig. 3.170a). 
E B: The Cartesian coordinate system with coordinate axes x, z, y is a left system (Fig. 3.170b). 
W C: From the right system &;, €j, €j one gets the left system &;, €x, €j by interchanging the vectors 
6; and δι S 

E D: By cyclic permutation follows from the right system 6;, 6;, €; the right system | 6j, 6, €; and 
from this one κ, €j, €j, a right system again. 
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Table 3.21 Coordinate signs in the octants 





Octant I II III IV V VI VII | VIII 
x + 
y + | 






































3. Cartesian Coordinates 
of a point P are its distances from three mutually orthog- 
onal planes in a certain measuring unit, with given signs. 
They represent the projections of the radius vector T of 
the point P (see 3.5.1.1, 6., p. 182) onto three mutually 
perpendicular coordinate axes (Fig. 3.170). The inter- 
section point of the planes O, which is the intersection 
point of the axes too, is called the origin. The coordinates 
x, y, and z are called the abscissa, ordinate, and appli- 
cate. The written form P(a, b, c) means that the point P 
has coordinates x = a, y = b, z = c. The signs of the co- 
ordinates are determined by the octant where the point 
P lies (Fig. 3.171, Table 3.21). 

In a right-handed Cartesian coordinate system (Fig. 
3.170a) for orthogonal unit vectors given in the order 
Figure 3.171 €;, 6j, € the equalities 




















&xé&jzeé, Bdjxé —e, Gd xd--eé, (3.355a) 
hold, i.e., the right-hand lawis valid (see 3.5.1.5, p. 184). The three formulas transform into each other 
under cyclic permutations of the unit vectors. 

In a left-handed Cartesian coordinate system (Fig. 3.170b) the equations 

ej x 6; = -δε, 6; x ei = -6;, ei x e; = -6; (3.355b) 
are valid. The negative sign of the vector product arises from the left-handed order of the unit vectors, 
see Fig. 3.170b, i.e., from their clockwise arrangement. 

Notice that in both cases the equations 

Ei x E = ëj X Ej = ëk X Er = Õ (3.355c) 
are valid. Usually one works with right-handed coordinate systems; the formulas do not depend on this 
choice. In geodesy usually left-handed coordinate systems are in use (see 3.2.2.1, p. 144). 

4. Coordinate Surfaces and Coordinate Curves 

Coordinate Surfaces have one constant coordinate. In a Cartesian coordinate system they are planes 
parallel to the plane of other two coordinate axes. By the three coordinate surfaces x = 0, y = 0, 
and z — 0 three-dimensional space is divided into eight octants (Fig. 3.171). Coordinate lines or 
coordinate curves are curves with one changing coordinate while the others are constants. In Cartesian 
systems they are lines parallel to the coordinate axes. The coordinate surfaces intersect each other in 
a coordinate line. 


3.5.3.2 Spatial Coordinate Systems 


1. Curvilinear Three-Dimensional Coordinate Systems 

arise if three families of surfaces are given such that for any point of the space there is exactly one 
surface from every system passing through it. The position of a point will be given by the parameter 
values of the surfaces passing through it. The most often used curvilinear coordinate systems are the 
cylindrical polar and the spherical polar coordinate systems. 
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2. Cylindrical Polar Coordinates (Fig. 3.172) 

consist of: 

e the polar coordinates p and yọ of the projection of the point P to the x, y plane and 
e the applicate z of the point P. 

The coordinate surfaces of a cylindrical polar coordinate system are: 

e The cylinder surfaces with radius o = const, and the z-axis as axis of the cylinder, 
ο the half-planes starting from the z-axis, ip = const and 

e the planes being perpendicular to the z-axis, z = const. 

The intersection curves of these coordinate surfaces are the coordinate curves. 


The transformation formulas between the Cartesian coordinate system and the cylindrical polar coor- 
dinate system are (see also Table 3.22): [ 


xz = ocos, y= posin, z= z; (3.356a) 
y -. 8 
PEN pe 2 — aret: = arcs κά { ο ή 1 R I 
ο ντ, p=are A = ο or r0 (3.356b) 


For the required distinction of cases with respect to q see (3.290c), p. 192. 





Figure 3.172 Figure 3.173 
3. Spherical Coordinates or Spherical Polar Coordinates (3.173) 


contain: 

e The length r of the radius vector r of the point P, 

e the angle V between the z-axis and the radius vector r and 

e the angle o between the x-axis and the projection of r on the x, y plane. 

The positive directions (Fig. 3.173) here are for r from the origin to the point P, for ) from the z-axis 
toT, and for o from the x-axis to the projection of F to the x, y plane. With the values 0 < r < 00,0 < 
Ü «X m, and —7 < y < m every point of space can be described. 

Coordinate surfaces are: 

e Spheres with the origin 0 as center and with radius r — const, 

e circular cones with V = const, with vertex at the origin, and the z-axis as the axis and 

e closed half-planes starting at the z-axis with y = const. 

The intersection curves of these surfaces are the coordinate curves. 

4. Relations between Cartesian, Cylindrical and Spherical Polar Coordinates 

W Tho transformation formulas between Cartesian coordinates and spherical polar coordinates (see 
also Table 3.22) are: 





r-—rsinÜcosp, y=rsinvsiny, z=rcosv, (3.357a) 
r— a? a?-z?, 0 — arctan Py ; € — arctan x (3.357b) 
z x 


For the required distinction of cases with respect to y see (3.290c), p. 192. 
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Table 3.22 Relations between Cartesian, cylindrical, and spherical polar coordinates 





Cartesian coordinates | Cylindrical polar coordinates | Spherical polar coordinates 








[= = o cos p = r sin V cos y 
y= =osiny =rsinvsiny 
= =g =rcosv 

Va? 4 y? =0 —rsinü 
arctan 2 = = 

τα li Ξ =rcosv 





VFF - ez =r 


0 
arctan ————— — arctan — =v 
2 Ζ 


p = 








y 
arctan — 
x 








5. Direction in Space 
A direction in the space can be determined by a unit vector το (see 3.5.1.1, 6., p. 181) whose coordinates 
are the direction cosines, i.e., the cosines of the angles ag, Go, Yo between the vector and the positive 
coordinate axes (Fig. 3.174) 





l=cosag, m=cosfy, n=cosyp, P+m?+4+n?=1. (3.358a) 
The angle y between two directions given by their direction cosines 11, m1, m1 and lz, m2, ng can be 
calculated by the formula 


coso = lila + Mmm + ny No. (3.358b) 
Two directions are perpendicular to each other if 
lila + mim + n313 — 0. (3.358c) 





Figure 3.174 Figure 3.175 

3.5.3.3 Transformation of Orthogonal Coordinates 
1. Parallel Translation 
If the original coordinates are x, y, z, and a, b, c denote the coordinates of the origin of the new coor- 
dinate system in the old system (Fig. 3.175), then the new coordinates 2’, y’, 2’ satisfy the relations 

rata y-—yc-b z-—z-cco az'—-z—a, y-—-y-b z-—z-c. (3.359) 
2. Rotation of the Coordinate System 
The relation between the original coordinates z, y, z and the new coordinates z^, y’, 2’ after rotation is 
given by 


αν x 
ο = Dx. (3.360a) 
2 2 
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D is called the rotation matriz of the coordinate system. The special rotation matrix 


1 0 0 

D, (a) = 0 cosa sina ) (3.360b) 
0 — sina cosa 

describes rotations of the x, y, z system around the x-axis by the angel a. In analogy rotations of the 

x,y, z system around the y-axis by the angel 6 or around the z-axis by the angel y describe the following 

rotation matrices: 


cos B 0 — sin f cosy siny 0 
D,(8) = | 0 1 0 ) ] (3.360c) D.(7) = (^m 605 o) ‘ (3.360d) 
sing 0 cos 0 0 1 


Remark:The rotation ofthe x, y, z system around an arbitrary axis through the origin can be described 
with the help of direction cosines (see 3.5.3.4), Cardan angels (3.5.3.5) or Euler angels (3.5.3.6). 

3. Rotation of an Object 

In geometry two types of transformations are distinguished (see also 3.5.4, p. 229): 

a) coordinate transformations (the coordinate system is transformed), and 

b) geometric transformations (the position of a geometric object is changed in a fixed coordinate sys- 
tem). 

Consequently, at the rotation around an arbitrary axis through the origin the following rotations are 
distinguished 

a) rotation of the coordinate system (see (3.360a-d)) and 

b) rotation of an object in a fixed coordinate system. In this case: 


x'p = Rxp (3.361a) R-D!-DT. (3.361b) 


Here the formulas (3.361a,b) describe the relation between the coordinates £p, yp, zp of the initial po- 
sition of the object and its coordinates xp, Yp, Zp after the rotation. R is called the rotation matrix of 
the object. 

Remarks: 

1. Rotations around an arbitrary axis through the origin will have a suitable description with quater- 
nions (see 4.4.2.5, p. 297). 

2. Rotations around an arbitrary axis not running through the origin, will be discussed in the example 
on p. 235. 


3.5.3.4 Rotations with Direction Cosines 


1. Rotation ofthe Coordinate Axis 

If the direction cosines of the new axes are given as in Table 3.23, see also (Fig. 3.176), then for the 
new and old coordinates 

— hz! 4 loy  laz , 

— maa! -- may 4- maz! , (3.362b) 
= mt’ + nay’ + naz' 


z = he + my + nz, 
y' = lx + my + nəz, (3.362a) 


we oH 


z' — lav 4 may 4 naz ; 





holds. The coefficient matrix of the system (3.362a), which is called the rotation matrix D, and the 
determinant of the transformation A are 


L k k L b Ὦ 
D={m m mj], (3.362c) det D = A = mi ma ma|. (3.362d) 
n n Ng ny n3 ma 
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The following relations are valid: 


(4) -»(*) or ϱ) τε"). (3.3620) 











Table 3.23 Notation for the direction cosines under coordinate transformation 
With respect to the Direction cosine 
the old axes of the new axes 
/ 
a! y z' 
T [ῃ lə la 
y my mo ma 
zZ nı ng n3 

















2. Properties of the Transformation Determinant 

a) A = +1, with a positive sign if it remains left- or right-handed, as it was, and with negative sign if 
it changes its orientation. 

b) The sum of the squares of the elements of a row or column is always equal to one. 

c) The sum of the products of the corresponding elements of two different rows or columns is equal to 
zero (see 4.1.4, 9., p. 275). 
d) Every element can be written as the product of A = +1 and its adjoint (see 4.2.1, p. 278). 

3. Scalar Invariant 

This is a scalar which keeps its value during translation and rotation. The scalar product of two vectors 
is a scalar invariant (see 3.5.1.5, 3., p. 185). 








E A: The components of a vector a = (a4, a2, a3] are not scalar invariants, because they change their 
values during translation and rotation. 


: The length of a vector à — {a1, a2, a3}, i.e., the quantit a a ag, is a scalar invariant. 
WB B: The length of a vector à 1, a2, a3}, i.e., the quantity \/a? + a3 + a3, is a scalar invariant 


W C: The scalar product of a vector with itself is a scalar invariant: 
aa = a? = |a|? cos p = |a?|, because y = 0. 


3.5.3.5 Cardan Angles 
1. Definition of the Cardan angles 


Every rotation of a coordinate system around an arbitrary axis through the origin can be described by 
three successive rotations around the coordinate axis. The angles of rotation o; f), ^; are called the 
CARDAN-angles, if the rotations are performed in the following or- 
der (see schematically Fig.3.176): 

1. The first rotation is around the z-axis by an angle y, and results 
in the a), y™, 2 coordinate system with οἱ1) = 2. 

2. The second rotation is made around the image of the y-axis by 
the first rotation, i.e. around the y™) axis by an angle 8, and results 
the «@), y®), z@) coordinate system with y® = y™, 

3. The third rotation is made around the image of the a-axis by the 
second rotation, i.e. around the c-axis by an angle a. It results 
the finally required z/, y, z’ coordinate system, x’ = z99 — z(2, 
yf =y®, 2 = 2). 

Remark: There are different definitions of the Cardan-angles given 
Figure 3.176 in the literature. 
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2. Calculation of the rotation matrix Dc 
It follows from the given order of the rotations (3.360a-d) that the rotation matrix D =Dc is 

De = Dz(a)Dy(8)D.(1)- (3.363a) 
According to the Falk’s schema (Fig. 4.1, 4.2, p. 273) holds 


D;(y) 
D,(5)D 






,ie. (3.363b) 


cosy siny 0 
— sin y cos y 0 








0 0 1 
cos B 0 — sin B | cos B cos^ cos D sin — sin 8 
0. 1. 4 — sin cos y 0 
sing 0 cos |cosysinĝ sin siny cos 8 
1 0 0 cos D cos ¥ cos f sin — sin B 


0 cosa sina|—cosasiny+sinacosysinfB cosacosy+sinasinfsiny sina cos 8 
0 —sina cosa] sinasin y + cosa cos ysin 8 cos a sin f sin^j — sina cos^ cosa cos f 








cos f) cos ^ cos f sin y — sin f 
Dc = | — cosasin y + sina cosysin f cosacosy+sinasinfsiny sinacosp |. (3.363c) 
sinasin y + cosacos ysin cos asin f sin y — sina cosy cosa cos 8 


3. Direction Cosines as Function of the Cardan angles 
Since the rotation matrices D (in (3.362c), p. 213) and Dc (in (3.363c)) coincide, the direction cosines 
can be expressed as functions of the Cardan-angles: 


li = C263, Mı = C253, nı = 89; 
lə = —c153 + 51C352, M2 = Cq€3 4-818253, N2 = S102; (3.364a) 
13 = 8183 + €1C382, M3 = C1S283 — S1C3, N3 = C1C3 


C1 = CO0OS@, c9-—c0osf, ca-—cosy, 
s5;— Sina, s,=sinf, s3;=siny. 


(3.364b) 


3.5.3.6 Euler's angles 


1. Definition of the Euler angles 
The position of the new coordinate system with respect to the old one can be uniquely determined by 
three angles, introduced by Euler (Fig. 3.177). 


a) The nutation angle 9 is the angle between the positive halves 
of the z- and z’-axis there is 0 < J < 7. 

b) The precession angle w is the angle between the positive di- 
rection of the x-axis and the intersection line K ofthe x, y plane and 
the z', y' plane. The positive direction of K is chosen depending on 
whether the z-axis, the z'-axis and K form a direction triplet with 
the same orientation as the coordinate axes (see 3.5.1.3, 2., p. 183). 
The angle v is measured from the x-axis to the direction of the y- 
axis; there is 0 < Y < m. 

c) The rotation angle ¢ is the angle between the positive halves 
of the z'-axis and the intersection line K; thereis 0 €  « 27. 
Remark: In the literature there are also other definitions in use for 
Figure 3.177 the Euler angles. 
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2. Calculation of the rotation matrix Dg 

The transition from coordinate system x, y, z into the coordinate system 2’, y’, 2’ (Abb.3.177) can be 
given by three rotations considering (3.360a-d)) as follows: 

1. The first rotation is around the z-axis by angle v, and results the coordinate system a9, (9, (D, 
where 2 = z: 


av cosy sinw 0\ /ax 
y® | = | -siny cosy 0 y |, ie x? — D,(U)x. (3.365a) 
(1) 0 0 1 


< 


The axis « coincides with the intersection line K. 


2. The second rotation is around the z®-axis by angle 9, and results the coordinate system z”, y®, 2, 
where z®) = ¢@: 


p 1.0 ον (τὸ 
ya |= (° cos an yO |, ie. x? 2 D,(9)xO . (3.365b) 
40) 0 — sin Ü cos 9 2) 


3. The third rotation is around the z-axis by angle v, and results the final coordinate system z^, y, 2’, 
where 2! = z@ 





a cosp sing 0X f zO 
y' | = | -siny cosy 0 | | y2 |, ie. x’ =D,.(y)x®. (3.365c) 
2 0 0 1 {9 
All together for D = Dg in (3.360a) holds: 
Dg — D(o)D,(0)D(v). (3.3662) 
Analogous to the calculation of the rotation matrix Dc using Falk's schema, here in the form 
D.(W) 
Ῥ.(9) D,(/)Di(v) one gets (3.366b) 
D.(¢ D.(y)D.(¥)D.(¥) = Dg 





cos ọ cos Y% — sin q cos Ü sin t cos ysin Y + sin p cos cosy singsind 
Deg = | — sin y cosy — cos y cos Ù siny  —sinqsin v 4- cospcosü cosy cosysind | (3.366c) 
sin Ü sin i — sin Ü cos v σος Ὁ 


3. Direction Cosines as Function of the Euler angles 
Because of the identity of the rotation matrices D (see (3.362c), p. 213) and Dz (see (3.366c) the 
following formulas for the direction cosines as functions of the Euler angles are valid: 


lj — C3€3 — €1 8253, Mı = 8203 + C1C283 , ny = 8183; 
lg = —C983 — C1 89C3, Mg = —SQ83+C1C29C3, N2 = $1C3; (3.367a) 
l3 = $182, m3 = —S1C2, ng = Cy 
sp Cy =cosv, ce=cosy, C3 = COS 
with © | B P, (3.367b) 


$,=sinv’, sg=sinw, 53 =sing. 
3.5.3.7 Special Quantities in Space 
1. Coordinates of the Center of Mass 
The coordinates of the center of mass P(Z,¥,Z) (often called incorrectly the center of gravity) of a 


system of n material points P;(z;, yi, z;) with mass m; are calculated by the following formulas, where 
the sum index i changes from 1 to n: 


D miti j D miyi 2 E muzi 
Em ? Em Ems 








(3.368) 


3.5 Vector Algebra and Analytical Geometry 217 


2. Division of a Segment 
The coordinates of the point P(x, y, z) dividing a segment between the points P; (x1, y1, 21) and P3 (x2, 
Y2, z2) in a given ratio 



















P,P m 
— 3.369a 
PP, n ( a) 
are given by the formulas 
nj mro 2 +Are nyc mys Wi Aye 
z = —— ——— -—— ——, (3.369b ------Ἕεεξ-----, 3.369c 
E n+m 1+A | ( ) E ndm 1+A ( o) 
natma zy + Az (3.3694) 
n+m 1-A ^ ` 
The midpoint of the segment is given by 
Ὧι + T2 Yı + Y2 Z tz 4 
Lm = > Ym = 1 m= ] 3.369e 
7 2 v 2 2 (3-3098) 
Ζ P 3. Distance Between T'wo Points 
24322 22) i ] : : 
The distance between the points P, (21,41, 21) and P (x2, y2,22) in 
δ Fig. 3.178 is 
P(x,y,z) = 11)” + (Yo — yi)” + (22 — 21)”. (3.370a) 
P(X 7! ,Ζι) The direction cosines of the segment between the points can be calculated by 






the formulas 
cosa — 22 1 , cos B= a , cosy= ay. (3.370b) 
Figure 3.178 d d d 


4. System of Four Points 

Four points P(x, y, z) , Pi (1, y1, 21) , Pe (2, yo, 22) and P3 (3, y3, 23) can form 
a tetrahedron (Fig. 3.179) or they are ina plane. The volume of a tetrahedron 
can be calculated by the formula 





T 1 x Tı Y Yı z 
limuzl 1 τι EM — 
Pss al —-65-929—922—72, (3.371) 
23 y3 23 1 T — T3 Y — Y3 Z — 38 


X ux y 
Figure 3.179 


— — — 
where it has a positive value V > 0 if the orientation of the three vectors PP; , PP2, PP3 is the same 
as the coordinate axes (see 3.5.1.3, 2., p. 183). Otherwise it is negative. 

The four points are in the same plane if and only if 

gy el 

Titi y holds, (3.372) 

T2 Y2 Z2 1 

T3 Y3 Z3 1 
3.5.3.8 Equation of a Surface 
Often an equation 

F(z,y,z) 20 (3.373) 
corresponds to a surface with the property that the coordinates of every of its points P satisfy this 
equation. Conversely, every point whose coordinates satisfy the equation is a point of this surface. The 
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equation (3.373) is called the equation of this surface. If there is no real point in the space satisfying 
equation (3.373), then there is no real surface. 

1. The Equation of a Cylindrical Surface (see 3.3.4, p. 156) whose generating lines are parallel 
to the x-axis contains no z coordinate: F(y, z) — 0. Similarly, the equations of the cylindrical surfaces 
with generating lines parallel to the y- or to the z-axes contain no y or z coordinates: F(r,z) — 0 or 
F(v,y) — 0 resp. The equation F(x, y) = 0 describes the intersection curve between the cylinder and 
the x,y plane. If the direction cosines, or the proportional quantities |, m,n of the generating line of a 
cylinder are given, then the equation has the form 


F(na — lz,ny — mz) =0. (3.374) 
2. The Equation ofa Rotationally Symmetric Surface, i.e., asurface 


which is created by the rotation of a curve z = f(x) given in the x, z plane 
around the z-axis (Fig. 3.180), will have the form 


κα (νεα) ὶ (3.375) 


The equations of rotationally symmetric surfaces can be obtained similarly 
in the case of other variables as well. 





W The equation of a conical surface, whose vertex is at the origin (see 3.3.4, 
p. 157), has the form F(r, y, z) — 0, where F is a homogeneous function of 


the coordinates (see 2.18.2.5, 4., p. 122), e.g. F(zx,y,z) 2 z— Va? cy? —0 Figure 3.180 
with the degree 1 of homogeneity, i.e. F(Ax, Ay, Az) = AF (2, y, z). 
3.5.3.9 Equation of a Space Curve 
A space curve can be defined by three parametric equations 
r=yilt), y= alt), z= p(t). (3.376) 
To every value of the parameter t, which does not necessarily have a geometrical meaning, there corre- 


sponds a point of the curve. 
Another method to define a space curve is the determination by two equations 


Fy(x,y,z)=0, F(a, y,z) =0. (3.377) 
Both define a surface. The space curve contains all points whose coordinates satisfy both equations, 
i.e., the space curve is the intersection curve of the given surfaces. In general, every equation in the 
form 

ΕΙ ΓλΙΟΞ 0 (3.378) 
for arbitrary A defines a surface which goes through the considered curve, so it can substitute any of 
the equations (3.377). 


3.5.3.10 Line and Plane in Space 
1. Equations of the Plane 


Every equation linear in the coordinates defines a plane, and conversely every plane has an equation of 
first degree. 

1. General Equation of the Plane 

a) with coordinates: Ax + By+Cz+D=0, (3.379a) 


b) in vector form: FN + D=0," (3.379b) 





*For the scalar product of two vectors see 3.5.1.5, p. 184 and in affine coordinates see 3.5.1.6, 5., p. 186; for the 
vector equation of the plane see 3.5.1.7, p. 189. 
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where the vector N(A, B,C) is perpendicular to the plane. In (Fig. 3.181) the intercepts a, b, and c 


are shown. The vector N is called the normal vector of the plane. Its direction cosines are 














A B σ . 
cosa = a 008 B = ATETOS y= Tara (3.379c) 


If D — 0 holds, the plane goes through the origin; for A — 0, or B — 0, or C — 0 the plane is parallel 
to the x-axis, the y-axis, or the z-axis, respectively. If A = B = 0, or A = C = 0, or B = C = 0, then 
the plane is parallel to the x, y plane, the x, z plane or the y, z plane, respectively. 

2. Hessian Normal Form of the Equation of the Plane 

a) with coordinates: x cosa + y cos 8 + z cosy -— p = 0, (3.380a) 


b) in vector form: EN°— p=0, (3.380b) 








where N° is the unit normal vector of the plane and p is the distance of the plane from the origin. The 
Hessian normal form arises from the general equation (3.379a) by multiplying by the normalizing factor 
1 1 

oe A? + B2 + C2 
Here the sign of jj must be chosen opposite to that of D. 

3. Intercept Form of the Equation of the Plane With the segments a, b, c, considering them 
with signs depending on where the plane intersects the coordinate axes (Fig. 3.181) holds: 


Aeque (3.381) 


a b c 





with N = |Ñ]. (3.380c) 


4. Equation of the Plane Through Three Points If the points are P; (x1, y1, 21) , Po (2, ye, 22) , 
P3 (x3, y3, 23) , then it holds: 








L-— fy Yryl £—£«1 

a) with coordinates: | 22 — 2) Y2 — Yı 22— 43 | — 0, (3.382a) 
ποτ T1 Y — Y 23 ~ 71 

b) in vector form: (r£ — £i) (F — τῳ) (F — r3) — 0f. (3.382b) 


5. Equation of a Plane Through Two Points and Parallel to a Line 
The equation of the plane passing through the two points Pj (σι, y1, 21); P» (12, ys; 22) and being parallel 


to the line with direction vector R(, m, n) is the following 








Z-% Y-Y 2-4 
a) with coordinates: |z2— x192— 9i 22 — 21| = 0, (3.3832) 
l m n 
b) in vector form: (£ — £i) (E — i5) R. — 0f. (3.383b) 


6. Equation of a Plane Through a Point and Parallel to Two Lines 
If the direction vectors of the lines are Ri (l, mı, nı) and Rə (l2, M2, n2), then it holds: 








T— Ti Y— Yı Z- 21 
a) with coordinates: [ῃ mi ni ΙΞ0, (3.3842) 
lj ma ng 
b) in vector form: (5-- τι) R.Ra — 0f. (3.384b) 


7. Equation of a Plane Through a Point and Perpendicular to a Line 
If the point is P, (z1, y1, z1), and the direction vector of the line is N(A, B,C), then it holds: 
a) with coordinates: A(x -— x1) + B (y — yı) +C (z — z1) =0, (3.385a) 





For the mixed product of three vectors see 3.5.1.6, 2., p. 185 





220 3. Geometry 





b) in vector form: (r—r)N-0.* (3.385b) 


8. Distance of a Point from a Plane Substituting the coordinates of the point P(a, b, c) in the 
Hessian normal form of the equation of the plane (3.3802) 


xcosa + y cos B -- z cosy — p — 0, (3.3862) 
results in the distance with sign 
ô = acosa + bcos 8 + ccosô — p, (3.386b) 


where 6 > 0, if P and the origin are on different sides of the plane; in the opposite case ὃ « 0 holds. 

9. Equation of a Plane Through the Intersection Line of Two Planes The equation of a plane 
which goes through the intersection line of the planes given by the equations A42 4- B1y 4- Cz 4- Dı = 0 
and Aga + Boy + Coz + Do = 0 is 


a) with coordinates: Ax + Biy + Ciz + Dı + A (Agr + Boy + C22 + D2) = 0. (3.387a) 


b) in vector form: FÑ; + Dı + ACN, t D3) — 0. (3.387b) 


Here A is a real parameter, so (3.387a) and (3.387b) define a pencil of planes. Fig. 3.182 shows the 
case with three planes. If A takes all the values between —oo and +00 in (3.387a) and (3.387b), so this 
yields all the planes from the pencil. For A = +1 it results in the equations of the planes bisecting the 
angle between the given planes if their equations are in normal form. 





2. Two and More Planes in Space 


1. Angle between Two Planes, General Case: The angle y between two planes given by the 
equations Aya + Biy + Ciz + Dı = 0 and Agr + Boy + Coz + D2 = 0 can be calculated by the formula 


Aj Ag + By By + C1C2 

















cos à (3.3882) 
V/A - Bt - CT) (43 + BF + C3) 
If the planes are given by vector equations r Νι + D, =Oandr No + D = 0, then 
ΝΙΝ 3 3 
cosy = == with M = |N;] and M: = [Ns]. (3.388b) 
ΝΙΝ, 
Υ 





Figure 3.181 Figure 3.182 Figure 3.183 


2. Intersection Point of Three Planes: The coordinates of the intersection point of three planes 
given by the three equations Aix 4- B1y 4- C12 4- Di — 0, Agr + Boy + Coz + Dy = 0, and Aga + B3y+ 
C3z + D3 = 0, are calculated by the formulas 

—Amq. —Ay —A 


Lo 2 * j 
cA 2 YO 5 A with (3.3892) 


r= 





*For the scalar product of two vectors see 3.5.1.5, p. 184 and in affine coordinates see 3.5.1.6, p. 186; for the equation 
of the plane in vector form see 3.5.1.6, p. 189. 
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Αι Bı σι Dı Bı σι Αι Dı Ci Αι Bı Dı 
Δ- A» Bo C3 > Ag = Dz Bə Co 3 Ay = EU Dz C3 ; Az = A» Bo Də ὃ (3.389b) 
Aa B3 C3 Da B3 C3 A3 Da C3 A3 B3 D3 























Three planes intersect each other at one point if A Æ 0 holds. If A = 0 holds and at least one subde- 
terminant of second order is non-zero, then the planes are parallel to a line; if every subdeterminant of 
second order is zero, then the planes have a common line. 

3. Conditions for Parallelism and Orthogonality of Planes: 

a) Conditions for Parallelism: Two planes are parallel if 


Αι Bı Cı 


ε-π-ο τ Ni x No =0_ holds. (3.390) 
b) Conditions for Orthogonality: Two planes are perpendicular to each other if 
Ay Ay + ByB2+CiC2=0 or ΝΙΝ» --0 holds. (3.391) 


4. Intersection Point of Four Planes: Four planes given by the equations Ayw+Byy+C1z+D, = 
0, Aga + Bzy + C2z + D2 = 0, Asx + Bay 4- Caz 4- Da — 0, and A4z 4- Byy + Cyz+ Dy = 0 havea 
common point only if for the determinant 

Αι Βι Οι Dı 
A2 B2 C2 D2 
As Bs C3 Da 
Ag By Cy Da 
holds. In this case the common point is determined from three equations, the fourth equation is super- 
fluous; it is a consequence of the others. 

5. Distance Between Two Parallel Planes: If two planes are parallel, and they are given by the 
equations 


= =0 (3.392) 


Ax 4- By - Cz Di —0 and Ar + By- Cz- D3 — 0, (3.393) 
then their distance is 
[Di — Dj 
d — —————————. 3.394 
‘A? + B? +C? ( ) 





3.5.3.11 Lines in Space 


1. Equations of a Line 

1. Equation of a Line in Space, General Case Because a line in space can be defined as the 
intersection of two planes, it can be represented by a system of two linear equations. 

a) In component form: 


Air d Biy Ciz Di —0, Aor + Boy + Coz + D2 = 0. (3.395a) 
b) In vector form: 

FÑ, +D =0, FÑ, +D: =0. (3.395b) 
2. Equation of a Line in Two Projecting Planes 
The two equations y=ka+a, z=ha+b (3.396) 


define a plane each, and these planes go through the line and are perpendicular to the x, y and the z, z 
planes resp. (Fig. 3.183). They are called projecting planes. This representation cannot be used for 
lines parallel to the y, z plane, so in this case other projections to other coordinate planes are considered. 
3. Equation of a Line Through a Point Parallel to a Direction Vector 

The equation (or the system of equations) of a line passing through a point P, (1, y1, 21) parallel toa 
direction vector R(1,m,n) (Fig. 3.184) has the form 

a) in component representation and in vector form: 


222 3. Geometry 












Pies uy iz 
Pie Yq 21) 


i 
i 
1 
i 
y 





Figure 3.184 Figure 3.185 Figure 3.186 
αν BEIM (3.3972) (F-rj)xR-0, (3397) 
1 m n 


b) in parametric form and vector form: 
αξσπιεἰί, y-—yupdmt, z—z-cnti; (33976) rF=r,+Rt, (3.397d) 


where the numbers 2, y1, z1 are chosen such that the equations in (3.395a) are satisfied. The repre- 
sentation (3.397a) follows from (3.395a) with 











-[Βιςι JCA] | Ay By . 
"mel πο Αι πας Bal (3.3982) 
or in vector form R= N, x Np. (3.398b) 


4. Equation ofa Line Through Two Points The equation ofa line through two points Pi (21, y1, z1) 
and P» (r5, ys, 22) (Fig. 3.185) is 
in component form and in vector form: 


r—i y— yı z= zı T 
a) -----------πι 3.399a b) (r-r)x(r-r)c-o0:. 3.399b 

) T2 — τι Y2 — Yı Z2 — 21 ( ) ( 1) ( 2) ( ) 
y-u *-—4A 
Y2 — Yı Z2 — 21 
and yı = y2 are both valid, the equations in component form are x = £2, Y = Y2- 


If for instance xı = z2, the equations in component form are £ = ο, . Ifvy = x 








5. Equation of a Line Through a Point and Perpendicular to a Plane The equation of a 
line passing through the point Pi(x1, y1, zi) and being perpendicular to a plane given by the equation 
Ax + By 4- Cz 4 D —0 orby £N 4- D — 0 (Fig. 3.186) is 
in component form and in vector form: 
T — Ti y — yı Z= i E b" = ~ 
B = 3.4002 — =0. 4 
a) A B a? ( a) b) (f-7r1)xN=0 (3.400b) 


If for instance A = 0 holds, the equations in component form have a similar form as in the previous 
case. 











2. Distance of a Point from a Line Given in Component Form 
For the distance d of the point M (a, b, c) from a line given in the form (3.397a) holds: 
[(a — z1) m — (b — yi) I 4- (b —4) n — (e — z) mP — (e — 2)1— (a — zi) nP 


= 
P +4+m?+4n? 





(3.401) 





*For the product of vectors see 3.5.1.5, p. 184 
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3. Smallest Distance Between T'wo Lines Given in Component Form 
If the lines are given in the form (3.3972), their distance is 











Tı — T2 Yı — Y2 21 — 72 
Æ l my ny 
ly m» n3 
d- ‘ (3.402) 
2 5 7 
hm; mı nı ny ly 
ln Ma m3 n3 ng ly 




















If the determinant in the numerator is equal to zero, the lines intersect each other. 
3.5.3.12 Intersection Points and Angles of Lines and Planes in Space 


1. Intersection Points of Lines and Planes 
1. Equation of the Line in Component Form The intersection point of a plane given by the 
equation 

= Ly, y—u 2 AA 


Ax + By + Cz + D = 0, and a line given by Lom = has the coordinates 
m n 

















---αι-ἰρ, j—y-mp, F=2-—Np ~~ with (3.403a) 
B Ax; + By, +Cz,+D 
PAI+ BmtOn C 
If Al+ Bm+Cn = 0 holds, then the line is parallel to the plane. If Av; + By, + Cz, + D = 0 is also 
valid, then the line lies in the plane. 
2. Equation of the Line in Two Projecting Planes The intersection point of a plane given by 
the equation Ax 4- By -- Cz 4- D — 0, and a line given by y = ka +a, and z = ha +b has the coordinates 
Ba+Cb+D 
A+Bk+Ch’ 
ΙΑ B k -- C h — 0 holds, then the line is parallel to the plane. If Ba -- Cb 4- D — 0 is also valid, then 
the line lies in the plane. 


3. Intersection Point of Two Lines If the lines are given by y = kız + a, z = hx + bı and 
y = kax + ag, z — hax 4 b, then the coordinates of the intersection point, if any exists, are: 


(3.403b) 





y-—kz-a, z-—hz-b. (3.404) 


i- 


_ ag — ay, ba = by = kya = koay = 10ο E hob, 
t= = . Y= z= 
kı = Κο hy = hy d ky as ky ] hy -- ha 
The intersection point exists only if 


(a1 — ag)(hy — he) — (by — b3) (ky — ka). (3.405b) 


Otherwise the lines do not intersect each other. 





(3.4052) 


2. Angles between Planes and Lines 
1. Angle between Two Lines 
ατα y—1 ᾱ-- 9 dod 
a) General Case: If the lines are given by the equations LoL M 1 and 22 
h mi ny lə 
=í z-z > > = = 
992 27 7? orin vector form by (f — rj) x R4 — 0 and (r — r9) x Ra — 0, then for the angle of 
meg ng 
intersection between them 


Ll 
cosy = ο hlatmima t mna gn (3.4062) 


(B +m? +n?) (B+ m+ n?) 





2 


R m, With Bi = [R;| and Ro = [Rs]. (3.406b) 





cosy = 
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b) Conditions of Parallelism: Two lines are parallel if 


Π m m 


or R,xR2=0. (3.407) 


lj; ma m 
c) Conditions of Orthogonality: Two lines are perpendicular to each other if 


lilo + mim + nın = 0 or RıRz = 0. (3.408) 


2. Angle Between a Line and a Plane 
a) If the line and the plane are given by the equations Ž a H UTU E dA + By+Cz+ 
m n 


D — 0 or in vector form by (f — 14) x Ἡ -- ð and FÑ + D = 0, one gets the angle y by the formulas 
Al+ Bm 1 
sin y = E NERMAN d or (3.4092) 
(A? + B? + C?) (P +m? +n?) 














sing = ay with R = |R] and N = |Ñ]. (3.409b) 


b) Conditions of Parallelism: A line and a plane are parallel if 


Al+Bm+Cn=0 oœ  RN-0. (3.410) 
c) Conditions of Orthogonality: A line and a plane are orthogonal if 

A B CX ECT 

— = = g or RxN=0. (3.411) 


l m n 


3.5.3.13 Surfaces of Second Order, Equations in Normal Form 


1. Central Surfaces 


The following equations, which are also called the normal form of the equations of surfaces of second 
order, can be derived from the general equations of surfaces of second order (see 3.5.3.14, 1., p. 228) by 
putting the center at the origin. Here the center is the midpoint of the chords passing through it. The 
coordinate axes are the symmetry axes of the surfaces, so the coordinate planes are also the planes of 
symmetry. 
2. Ellipsoid 
With the semi-axes a, b, c (Fig. 3.187) the equation of an ellipsoid is 
2 2 2 

È y z | 

m + p + 37 1; (3.412) 
The following special cases are to be distinguished: 
a) Compressed Ellipsoid of Revolution (Lens Form): a = b > c (Fig. 3.188). 
b) Stretched Ellipsoid of Revolution (Cigar Form): a = b < c (Fig. 3.189). 
c) Sphere: a = b = c so that £? + y? + 2? = a? is valid. 
The two forms of the ellipsoid of revolution arise by rotating an ellipse in the x, z plane with axes a and 
c around the z-axis, and one gets a sphere if rotating a circle around any axis. If a plane goes through an 
ellipsoid, the intersection figure is an ellipse; in a special case it is a circle. The volume of the ellipsoid 
is 

4rabc 


= : Al 
va) (3.413) 
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Figure 3.187 Figure 3.188 


3. Hyperboloid 


a) Hyperboloid of One Sheet (Fig. 3.190): With a and b as real and c as imaginary semi-axes 
the equation is 

Soy og 

d + yo a 1 (for generator lines see p. 226). (3.414) 
b) Hyperboloid of Two Sheets (Fig. 3.191): With cas real and a, b as imaginary semi-axes the 
equation is 

2 2 2 

z^ y^ z I 

en (3.415) 
Intersecting it by a plane parallel to the z-axis results in a hyperbola in the case of both types of hy- 
perboloids. In the case of a hyperboloid of one sheet the intersection can also be two lines intersecting 
each other. The intersection figures parallel to the x, y plane are ellipses in both cases. 
For a = b the hyperboloid can be represented by rotation of a hyperbola with semi-axes a and c around 
the axis 2c. This is imaginary in the case of a hyperboloid of one sheet, and real in the case of that of 
two sheets. 

















Figure 3.189 Figure 3.190 Figure 3.191 


4. Cone (Fig. 3.192) 
If the vertex is at the origin, the equation is 
2 2 2 

x y z 

Meer f dy. 3.416 

a2 B d ( ) 
As a direction curve can be considerd an ellipse with semi-axes a and b, whose plane is perpendicular 
to the z-axis at a distance c from the origin. The cone in this representation can be considered as the 
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asymptotic cone of the surfaces 


2 2 2 
Ete had (3.417) 


a2 b2 ea D 





whose generator lines approach infinitely closely both hyperboloids at infinity (Fig. 3.193). Fora — b 
there is a right circular cone (see 3.3.4, 9., p. 157). 





Figure 3.192 Figure 3.193 
5. Paraboloid 


Since a paraboloid has no center, in the following section it is supposed that the vertex is at the origin, 
the z-axis is its symmetry axis, and the x, z plane and the y, z plane are symmetry planes. 
a) Elliptic Paraboloid (Fig. 3.194): 
2 2 
goy ; 

= + z (3.418) 
The plane sections parallel to the z-axis result in parabolas as intersection figures; those parallel to the 
x,y plane result in ellipses. 
The volume of a paraboloid which is cut by a plane perpendicular to the z-axis at a distance h from the 
origin is given as (see also Fig.3.194) 


d = 
V= jTabh, (3.419) 


The parameters @ = aVh and 6 — bv/h are the half axis of the intersecting cllipse at height h. So, 
(3.419) yields the half of the volume of an elliptic cylinder with the same upper surface and altitude 
(compare with (3.328a), p. 201). 

b) Paraboloid of Revolution: For a — b one gets a paraboloid of revolution. It can be generated by 
rotating the z — z?/a? parabola of the z, z plane around the z-axis. 


c) Hyperbolic Paraboloid (Fig. 3.195): 
2 2 
Xx y 4 

t= gb (3.420) 
The intersection figures parallel to the y, z plane or to the x, z plane are parabolas; parallel to the x, y 
plane they are hyperbolas or two intersecting lines. 
6. Rectilinear Generators of a Ruled Surface 
These are straight lines lying completely in this surface. Examples are the generators of the surfaces of 
the cone and cylinder. 
a) Hyperboloid of One Sheet (Fig. 3.196): 

2 p 2 


=1. (3.421) 
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Figure 3.194 Figure 3.195 
The hyperboloid of one sheet has two families of rectilinear generators with equations 
= ee u(i 3 «(s =) 24-2, (3.4222) 
a c b a c b 
Es cu (1 N, Je 3 2142, (3.422b) 
a c b a c b 


where u and v are arbitrary quantities. 
b) Hyperbolic Paraboloid (Fig. 3.197): 
2 2 


x y | 
t= ae (3.423) 
The hyperbolic paraboloid also has two families of rectilinear generators with equations 
ae νε u, u 6 - η) —z; (3.424a) m dec U, U (Z + η) =z. (3.424b) 
a b a b a b a b 


The quantities u and v are again arbitrary values. In both cases, there are two straight lines passing 
through every point of the surface, one from each family. Only one family of straight lines is denoted 
in Fig. 3.196 and Fig. 3.197. 





Figure 3.196 Figure 3.197 

7. Cylinder 
ας. η 

a) Elliptic Cylinder (Fig. 3.198): + " =1. (3.425) 
a 

: : : Doy 
b) Hyperbolic Cylinder (Fig. 3.199): — — BC i (3.426) 
a 


c) Parabolic Cylinder (Fig. 3.200): — y? — 2px. (3.427) 
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Figure 3.199 


Figure 3.200 


Figure 3.198 


3.5.3.14 Surfaces of Second Order or Quadratic Surfaces, General Theory 


1. General Equation of a Surface of Second Order 


aur? + a224? + a332? + Qayory 4- 2a23yz + 2a31 24 + 2a44T + 2az4y + 2ag4z + asa = 0. (3.428) 


2. Telling the Type of Second-Order Surface from its Equation 

The type of a second-order surface can be determined from its equation by the signs of its invariants 

A, 6, S, and T from Tables 3.24 and 3.25. Here one can find the names with the normal form of 

the equation of the surfaces, and every equation can be transformed into a normal form. From the 

equation of the so-called imaginary surfaces it is impossible to determine the coordinates of any real 

point, except the vertex of the imaginary cone, and the intersection line of two imaginary planes. 
Table 3.24 Type of surfaces of second order with 6 4 0 (central surfaces) 

















S-d>0, T>0 S-ó and T not both > 0 | 
Δ«0 Ellipsoid Hyperboloid of two sheets 
2 PM "EM MEC 
2 P Pd E og 
a PP ὦ a wv ὦ 
A0 Imaginary ellipsoid Hyperboloid of one sheet 
zy Z2 12 y g 
S+5+5=- — 45-2 21 
a 2 ὦ a? d ὦ 
A=0]| Imaginary cone (with real vertex) Cone 
2 y 2 2 p 2 
= +5 +50 >++5-3=0 
a? PP ὦ a? P d 











3. Invariants of a Surface of Second Order 
Substituting aik = api, then holds: 


Q11 012 013 Q14 aia dig 








a21 A22 A23 Q : 
νο ων. (3.4292) ô = | a21 a22 a23 |, (3.429b) 
a31 Q32 033 Q34 
Q31 032 033 
Q41 Q42 Q43 Q44 
2 2 2 
S = ayy + Aq + 033, (3.429c) T = a22033 + 433011 + 411422 — A33 — A31 — 412- (3.429d) 


During translation or rotation of the coordinate system these invariants do not change. 
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Table 3.25 Type of surfaces of second order with 5 = 0 (paraboloid, cylinder and two planes) 














A<0 (here T οὐ A0 (here T « 0) 
AZz0 Elliptic paraboloid Hyperbolic paraboloid 
z^ gy EE» ey? ae 
α d `” α 07 








A=0) Cylindrical surface with a second-order curve as a directrix whose type 
defines different cylinders: For T > 0 imaginary elliptic, for T < 0 
hyperbolic, and for T = 0 parabolic cylinder, if the surface does not 
split into two real, imaginary, or coincident planes. The condition for 
splitting is: 




















Q11 012 G14 Q11 413 G14 022 023 024 
a21 Q22 Q24 | + | a31 a33 a34 | + | a32 a33 a34 | = 0 
Q41 Q42 A44 Q41 143 Q44 G42 Q43 A44 














1 For the quantities A and T see p. 228. 


3.5.4 Geometric Transformations and 
Coordinate Transformations 


1. Transformations 

Transformations describe the changes of the position or of the form of objects in the plane or in the 
space. There are two types of consideration which are in close relationship with each other [3.22]. In 
the first case points or objects are transformed in a fixed coordinate system. It is called geometric 
transformation (3.5.4.1, p. 229, 3.5.4.5,1., p. 234). 

In the second case the object remains unchanged while the coordinate system is transformed with re- 
spect to the object. With this coordinate transformation (3.5.4.3, p. 231, 3.5.4.5,2., p. 234) not the 
object, but its coordinate representations are modified. At problem solving one of them can be more 
reasonable than the other. 

2. Application Fields 

e Building parts which are described in their own object coordinate-system. 

e Description of motions of parts connected to each other (e.g. robots). 

e Reproduction of two-dimensional projections of three-dimensional objects. 

e Description of motions and deformations in computer graphics and computer animation. 


3.5.4.1 Geometric 2D Transformations 


1. Translations 
Translation of a point P, given by Cartesian coordinates xp, yp, by t, in the direction of the positive 
x-axis, and by t, in the direction of the positive y-axis (Fig.3.201) results the new coordinates of the 
transformed point P’(2'p, yp) : 

Up=Uptte, Yp = yp tty. (3.430) 
If the coordinates are described as column vectors, then the formulas for the new coordinates of the 
transformation and the reversal are 


HEHE ως GD-GD- (e a 


2. Rotation Around the Origin 
At a rotation the object is rotated around the origin by an angle a. If a > 0, then the rotation is 
counterclockwise (Fig.3.202). The mapping of the coordinates of a point P(rp, yp) is described by 
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Figure 3.201 Figure 3.202 
| the relations 
Tp = Tp COSA — yp Sina , Yp = Up- sina + yp: cosa. (3.432) 
The matrix form of (3.432) is 
il MAE PIESE x 
(E) = Ma po A d (3.4332) 
Up sina cosa) \ yp 


The inverse of this transformation corresponds to a rotation by the angle —a: 


tp\ _ (cos(—a) —sin(—a)\ (ap) _ ( cosa sina \ (ap i 
ο τ Cu) cos(—a) J Wyp) ~~ \ sina cosa] \ yp ) (3.433b) 
3. Scaling with respect to the Origin 
At scaling transformations the coordinates are multiplied by s; and s, respectively (Fig.3.203). The 
transformation of a point P(xp, yp) is given by 
Up = Sy Up, Yp = Sy ` YP: (3.434) 
The matrix forms of this transformation and of its inverse are 


DGIE) ewa Ga) eem 


Scaling results in a change of the size of the transformed object. A positive multiplier s, « 1 results 
in contraction of the object in z-direction. Consequently a factor s, 7 1 results in an expansion. A 
negative factor s, « 0 results in a reflection with respect to the y-axis. The corresponding statements 
are valid for sy. 


Scaling transformations in special cases: 


e Reflection with respect to the z-axis: s; — 1, s, — —1. 
e Reflection with respect to the y-axis: s, — —1, sj — 1. 
e Reflection with respect to the origin: s; = Sy = —1. 


4. Shearing 
At shear transformation the value of each coordinate is changed proportionally to the other. The for- 
mulas of this transformation are: 

rp =«pt+dz- yp , Up — Up t ay: Xp. (3.436) 
The matrix form of this transformation (with notation m = 1 — a,ay,) is seen in the following: 


tp\ (1 az\ (xp ΜΕΤ 2ΡΝ. ( l/m —a,/m)\ (xp ; 
τ. νον τυ 


Fig.3.204 shows an example for shearing. 

5. Properties of Transformations 

The transformations introduced above are affine transformations, i.e. the (z', y) coordinates of a trans- 
formed point P' can be expressed by a system of linear equations of the original coordinates (x, y) of 
P. 


Remark: These transformations keep collinearity and parallelism, i.e. lines are transformed into lines, 
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Figure 3.203 Figure 3.204 


and the images of parallel lines are parallel lines. Furthermore translation, rotation and reflection are 
distance- and angle-preserving mappings. 

3.5.4.2. Homogeneous Coordinates, Matrix Representation 

While the changes of coordinates in the case of rotation, scaling and shearing can be described by 
multiplication by a 2 x 2 type matrix (3.433a),(3.435a) and (3.4372), translation does not have this 
type of representation. To be able to handle all of these transformations in the same way, homogeneous 
coordinates are introduced. Every point in the plane gets an additional coordinate w Z 0. Point P(x, y) 
will have coordinates (x^, y^, w), where 


s=, yak. (3.438) 


In the followings w is fixed as w = 1. So point P(x, y) has coordinates (c, y, 1). Now the basic trans- 
formations can be given by a 3 x 3 type matrix in the following form: 


Tp mıı M12 M13 Tp 
Yp | = | Mma Mm Mm yp |, ie, £p =Ma p holds. (3.439) 
1 0 0 1 1 
The matrices for translation, rotation, scaling and shearing are: 
10 f cosa —sina 0 
This tuy = 0 1 45 |., (3.440) R(a) =| sina cosa 0], (3.441) 
001 0 0 1 
Translation matrix Rotation matrix 
s, 00 1 a, 0 
S(sz, Sy) = | 05,0], (3.442) V(dz,dy) =| a 10). (3.443) 
001 001 
Scaling matrix Shearing matrix 


3.5.4.3 Coordinate Transformation 

With geometric transformations an object is transformed with respect to a fixed coordinate system. 
Whereas coordinate transformation gives the relation between the coordinate representations of a fixed 
object with respect to two different coordinate systems. 

The relation between the two types of transformations is represented in Fig.3.205. If the coordinate 
system is shifted by vector t, the coordinates of point P(ap,yp) become rp — zp — t4, yp — yp — ty. 
The translation of the coordinate system by the vector t results the same outcome as the translation 
of point P by —t. 

'The same consequences are valid for rotation and scaling. So, the transformation of the coordinate 
system is equivalent to the reverse transformation of the object. 
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Figure 3.205 


The coordinate transformations resulted by a translation, rotation or scaling can be given by 3 x 3 
transformation matrices T, R and S. Considering the matrices of geometric transformations (3.440)— 
(3.442) follows: 


T(te, ty) = T(—te, ty) = T (te, ty) (3.444) R(o) = R(—a) ^ R^!(a) (3.445 
S(s,, s) ^ S(1/s,, 1/sy) ^ S Τίς, Sy) (3.446 
In this way all basic transformations can be described by a 3 x 3 transformation matrix M: 
αρ Mi, M12 M413 Tp uH 
yp | = | Mar M2 M23 yp |, ic. p= Map. (3.447 
1 0 0 1 1 


3.5.4.4 Composition of Transformations 

Complex geometric transformations can be realized by combining different basic transformations. Le 
a sequence of transformations be given by matrices Mi, M»,...,M,. The consecutive execution o 
these transformations converts the point P(x,y) in n steps into P’. The transformation matrix M 
resulted by this sequence of mappings is the product of these matrices: 





M-2M, M,,:...:M$:. Mj. (3.448 
Similarly, for the reverse transformation 
Μ᾽! -Μτ'.Μ;'.....Μ;),:Μ;'. (3.449 


So, instead of performing n times basic transformations in a sequence on a point, it is possible to give 
the matrix of the composite transformation, and to apply it directly. 

Every affine transformation can be given as a chain (composition) of translation, rotation and scaling. 
Even shearing can be given as a rotation R(o), a scaling S(s;, sy) and a further rotation R(8) applied 
after each other. The parameters a, B, s,, sy can be determined so, that V (a, b) — R(8):S(s;, s,)-R(a) 
holds. 


HM Calculation of the transformation matrix of a rotation 
around an arbitrary point Q(r,, y,) by an angle a: The com- 
posite transformation is the result of composition of the follow- 
ing basic transformations: 

1. Shifting of Q into the origin: M, — T(—z,, —9;). 

2. Rotation around the origin: My = R(a). 

3. Translation back the origin into Q: Mg = My! = T (rq, q)- 
The sequence of the single steps of these transformations see 
Fig.3.206. Point P is transformed via P, and P; into P’. Figure 3.206 








M= M3 ; Μο ξ Μι ΞΞ Tita Ya) : R(a) x T(—2q, —Yq) 
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10 xq cosa —sina 0) 10-2, cosa —sina a_(1—cosa@) + yg sina 
—101g sina cosa 0} | 01—y | = | sina cosa yg(1—cosa) —z,sina | . 
001 0 0 1 00 1 0 0 1 
Reflection with respect to a line given by the equation y = mg + n: 
. Translation of the line to pass through the origin: M; = T(0, —n). 
. Rotation of the line clockwise until it coincides with the x-axis: Mz = R(—a), with tana = m. 
Reflection with respect to the x-axis: M3 — S(1, — 1). 
. Rotation back by a: My = M3! = R(a). 
. Translation of the line back into the original position: Ms = My! = T(0, n). 
M = T(0,n) - R(a) - S(1, -1) - R(—a) - T(0, —n) 


100 cosa —sina 0 1 oy cosa sina 0 10 0 
LIEGE! sina cosa 0 0—10 —sina cosa 0 01-n|. 
001 0 0 1 0 01 0 0 1 00 1 


Use of the well known trigonometric relations sin œ = m/v m? + 1 and cosa — 1//m? 4- 1 results the 
transformation matrix 


σι οοιρ-:ββ 

















1—m? 2m  —2mm 
τῃ3 4-1 τι31-1 m?-1 
M= 2m  m?—1 2n 
m?+1 m?-c-1 m?41 
0 
y a b 
W Complete centered transformation of a rectangle shape cut 
with side lengths a and b into a similar one in a window with width " 


c and height d (Fig.3.207). Sequence of the basic transforma- 
tions: 
1. Shifting of P(zp, yp) into the origin: M; — T(—rp, —yp). 
2. Clockwise rotation by an angle a: Mz = R(—a). 
3. Scaling by factor s = s, — s, — min(c/a, d/b): M3 — S(s, s) 
4. Shifting of the origin into the center of the window: M4 — 
T (c/2, d/2). 

M — T(c/2,d/2) - S(s, s) R(-o)- T(—xp, —yp) Figure 3.207 


10c/2Y /s00 cosa sina 0X /10 —zp 
01d/2]|050 —sinoa cosa 0 || 01 —yp 
00 1 001 0 0 1 00 1 


(^9 ssina η 





—ssina scosa d/2—s(ypcosa — xpsina) 


0 0 


3.5.4.5. 3D- Transformations 


'The mathematical description of geometric transformations and coordinate transformations in the 
three-dimensional space is based on the same ideas which have been discussed for the two-dimensional 
case in sections 3.5.4.1-3.5.4.4. Affine transformations of the three-dimensional space are the compo- 
sitions of basic transformations as: translation, rotation around one of the coordinate axes and scaling 
with respect to the origin. Using homogencous coordinates these transformations can be given by 4 x 4 
transformation matrices. As in the two-dimensional case, the composed transformations can be real- 
ized by matrix multiplications. 





234 3. Geometry 





1. Geometric Transformations 
The transformation of a point P(ap, yp, zp) is performed according to the following rule: 


Tp Mii M12 M13 M14 tp 
/ η d 
Yp — | ma d Moz Mea YP , ie vp'=Mzp. (3.450) 
Zp ™31 1232 M33 M34 ZP 
1 0 0 0 1 1 
The transformation matrices of the basic transformations are: 
1004 Sr 0 0 0 
010f 0 s 00 
T (te, ty, tz) = 001 t , (8.451) S(sz, Sy, 52) = 0 0 ES (3.452) 
000 0001 
Translation Scaling with respect to the origin 
1 0 0 0 cosa 0 sina 0 
0 cosa —sina 0 0 1.0 0 
Βα(α) -- 0 sina cosa 0 (3.453) Ry(a) = —sina 0 cosa 0] ’ (3.454) 
0 0 0 1 0 0 0 1 
Rotation around the x-axis Rotation around the y-axis 
cosa —sina 0 0 10 a, 0 
sina cosa 0 0 01a, 0 
R.(a) = 0 0 10 (3.455) γογίας, αγ) Ξ- 0 0 T 0 (3.456) 
0 0 01 000 1 
Rotation around the z-axis Shearing parallel to the x, y-plane 


For a positive a the rotation is counterclockwise looking from the positive half of the coordinate axis 
to the origin. For reverse transformations the following relations are valid: 


T (best te) = T(t hy, te), 8 (55555) —8(1/55,1/sy; 1/22); (3.457) 
R;!(o) = R,(—a), R; (a) Ξ-Ἡμί-α), Ἡ, (α) -- Β.(--α). (3.458) 


2. Coordinate Transformations 


Analogously to the two-dimensional case, the transformations of the coordinate systems have the same 
effects to the coordinate representations of a point as the reverse geometric transformation (see 3.5.4.3, 
p. 231). Hence, the matrices of the transformations are: 


T (tei ty, tz) = T(t, 4, 6) — T (lai ty ita), (3.459) 
R,(a;) -- Βαί(--α)) -- ΒΖ (αι), (3.460) 
R,(ay) = Ry(—ay) = Ry (0), (3.461) 
Β.(α.) -- Β.ί--α.)-- Β., (αἲ). (3.462) 
S(s,, sy, 5;) — S(1/ss, 1/5, 1/52) — S (55, Sy, 82) - (3.463) 


In practical applications it occurs quite often, that a particular transformation is replaced from a right- 
handed Cartesian coordinate system into another Cartesian coordinate system. The original one is 
often called as world coordinate system, the other one is called the local or object coordinate system . 1f 
the origin U (£u, Yu, Zu) and the unit vectors & — (Lh, mi,n31), &; — (Io, ma, n3) and &j — (Ia, ma, ma] 
of the local coordinate system are given in the world coordinate system, then the transformation from 
the world system into the local system and its reverse are given by the matrices 
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l m m ltuy — miy, — mz, L lg Ig ty 
πτ l9 ma n3 —lar, — may, — nz xrl m; m» ms 
M-|2m»na 73x. m3). 7 34 | — (3464) M =| m m2 M3 Yu | (4 165) 
l4 ma na —lar, — may, — naz nij Ng NZ Žu 


0 0 0 1 0. 0 0 1 


If a point P has the coordinates (xp, yp, zp) in the world coordinate system and it has the coordinates 
(xp, yp. zp) in the local system, then the following equalities are valid: 


ΜΙ xl 

zp Μαρ, (3.466) tp=M αρ. (3.467) 
If M; and M; denote the matrices of transformations from the world coordinate system into two local 
coordinate systems, then the transformation between the two local systems are given by the matrices: 


M-M,M; ad M'-M,M;. (3.468) 
W Determination of the matrix of a rotation by an angle 0 around the line passing through the points 
P(xp,yp,zp) and Q(£4, Yq, Zq) with direction vector Vo = 
{Uz Uy, Uz} = {ep — T4, YP — Yq ZP — 5ο}. It is easy to choose 
P and Q in a distance of one unit, so V is a unit vector. First the 
line is transformed into the z-axis of the coordinate system. Then 
the rotation follows, by an angle @ around the z-axis. Finally it 
follows the transformation of the line back into the original one. 
It is represented in Fig.3.208 how the transformation of the spa- 
tial line into the z-axis is performed. It consists of the following 
steps: 

1. Q is translated into the origins = My = T(—2q, —yq, —2q)- 
2. Rotation around the z-axis so that the rotation-axis is mapped 


into the y, z-plane: My = R.(a,) with cosa, = wyf /v2 + v2 and 


sina, = ve/ fw Tu 


Point P, has the coordinates (0, ,/v2 - v2, vz). Figure 3.208 





3. Rotation around the z-axis by the angle a, until the image of the direction vector V is on the z-axis: 
M; — R;(o;), where cosa; — v;/|V| and sina, — 4/v2 + v2 / [V]. The point P; has the coordinates 
(0, 0, |v]). 


The matrix of transforming the direction vector into the z-axis is with m = ,/v2 + v2 and |¥| = 1: 





Ma = Re(az) - Re(az) -T(—aq, να 5) 














Uy =U 
10 0 Ofm m ONAL Oe, 
_|0 u —m 0 ve Ὁν ϱϱ 010 ~y 
^" [0m v 0 m m 001 —z 
00 0 1/| 9 0 10]JN000 1 
0 0 01 
Uy —Ue 0 UrUg — UyTa Vy Vez -- 
m m m m m 9574 
γη j A 
M,= UpVz UyU; Hrs Uy UzXq + VyVz2Yq M; S TU. υμυς ΗΠ 
m m m > m m 
Uy Uy Uz —UzTq — VyYg — UzZq 0 —-m v %4 


0 0 0 1 0 001 
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Comparing M4 and Mj! with matrices (3.465) and (3.464) shows, that the transformation of the 
spatial line into the z-axis is identical to a coordinate transformation from the world coordinate system 
into a local coordinate system where the origin is Q and the z-axis-direction is V. In the local system 
the rotation is made around the z-axis. The matrix of transformation of the complete rotation is given 
by matrix (3.455) 


M-M;j-R,(0): Ma 














Vy Urls s m vy e 9 Όσα - Vyta 

m m ^? cosÜ —sinÓ 0 0 m m m 
oce Me. y sin? cos@ 0 0 Vode Vyve ^on mz, UsUzta + VyVzYq 
^| m m “ἡ 0 0 10 m m m 

0 —M Uz Zq 0 0 01 Us Uy Uz —Uglg — VyYq — Vz%q 

E E E 0 0 0 1 


In section 4.4, p. 289 an alternative method is given to describe rotation transformations by using the 
properties of quaternions. 





Figure 3.209 


3.5.4.6 Deformation Transformations 
The affine transformations discussed in section 3.5.4.5, p. 233 change the position of objects and result 
in stretching or compressing in given directions. If the elements of the transformation matrix M — 
(mij) are not constants, as they have been until now, but they are functions of position, then this will 
lead to a generalized class of structure-changing transformations. The elements of the matrix now have 
the form: 

Miz = Maj (x,y, z). (3.469) 
1. Contraction 
This transformation is a generalization of scaling. At a contraction in the direction of the z-axis the 
scaling parameters sz, sy are functions of z. The matrix of transformation is: 


s,(z 0 00 
Sis) )-| à 990 (3.470) 
0 0 01 


The functions s,(z) and sy(z) define the contraction-profile. If s,(z) 7 1, then the transformed object 
is stretched in the direction of the z-axis, if ;(z) « 1, then the object is compressed. Fig.3.209b shows 
the result of the transformation of the unit-cube in Fig.3.209a by functions s,(z) — sy(z) — 1/(1— 22). 
2. Torsion around the z-axis 
This transformation is the generalization of the rotation around the z-axis. The angle of rotation is 
changing along the z-axis. The matrix of transformation is: 
cosa(z) —sina(z) 0 0 
sina(z) cosa(z) 0 0 

0 0 10 

0 0 01 


R.(a(z)) = (3.471) 
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The function a(z) defines the angle of rotation along the z-axis. Fig.3.209c shows the rotation of the 


π 
unit-cube with a(z) — 4^ 


3. Bending Around the x-axis 
At bending the angle of rotation changes in a direction perpendicular to the rotation-axis. The matrix 
of transformation has the following form: 


1 0 0 


0 
0 cosa(z) —sina(z) 0 
0 
1 


R,(a(z)) = (3.472) 


0 sina(z) cosa(z 


0 0 0 
Bending of the unit-cube by a(z) = 55 is shown in Fig.3.209d. 


These deformations can be applied in a sequence like affine transformations. The object in Fig.3.209e 
is the result of a contraction then a torsion of the unit-cube. 


3.5.5 Planar Projections 


There are several methods to visualize three-dimensional objects in a two-dimensional media [3.22]. 
Among them planar projections are very important. A planar projection is a mapping, where points of 
a three-dimensional space are assigned to points of a plane. An image point is given as the intersection 
point of this plane and a ray connecting an observer with the space-point. The plane is called a pro- 
jection plane or a picture plane, the ray is called projecting ray, and its direction is the direction of the 
projection . 





Figure 3.210 Figure 3.211 Figure 3.212 


3.5.5.1 Classification of the projections 


1. Central Projection 

The central projection which is called also perspective projection the projecting rays start from a common 
center point (Fig.3.210). Objects being farther from the center of perspectivity C are represented 
smaller than the ones being closer to the center. Parallel lines, which are not parallel to the projection 
plane, become not parallel and they intersect in a so called vanishing point. Perspective projections 
give a realistic impression of the object for the viewer. But relations of lengths and angles are lost at 
this mapping. 

2. Parallel Projection 

At the parallel projection the projecting rays are parallel to each other (Fig.3.211). Line segments, 
which are not parallel to the projection plane are shortened, and angles are usually distorted. 

1. Orthogonal Parallel Projections A parallel projection is an orthogonal projection if the di- 
rection of the projecting ray is perpendicular to the picture plane. If furthermore the picture plane is 
perpendicular to one of the coordinate-axes, then it is an orthographic projection or principal projec- 
tion, which is well known from industrial designs. 
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Ifthe direction of projection is not perpendicular to any coordinate-axis, then the orthogonal projection 
is called an axonometric projection. 

2. Oblique Parallel Projections A parallel projection is an oblique projection if the direction of 
the projecting ray is not parallel to the normal vector of the picture plane (Fig.3.212). Special cases 
of oblique projections are Cavalier projection and cabinet projection . 

Sometimes parallel projections preserve the ratio of sizes, but they seem less realistic than perspective 
representations. 


3.5.5.2 Local or Projection Coordinate System 


Defining the direction of the projection plane and the coordinates of the result of a projection in the 
world coordinate system is not reasonable. It seems to be useful to apply a picture coordinate system 
whose z, y-plane is identical to the projection plane. This picture system represents the projection from 
the viewpoint of an observer who is looking perpendicularly to the projection plane. 

Let the projection plane be given by a reference point A(r,.,y.,z.) and a unit normalvector i = 
{ns ny; nz}. Then the picture coordinate system is defined in the following way. The origin is placed 
into R(z,, y», zr). Among the coordinate unit vectors &;, €; and 6j first &j =n is fixed. An additional 
information is needed to fix the coordinate vectors &; and & in the picture plane. An ,,upward” vector 


U is chosen in the world coordinate system, and its projection into the picture plane defines the vertical 
direction i.e. the y’-direction of the picture system. The normed vector product of ii and ri defines the 
vector &j. Summary: 

"E . xd 4 4 

ë =ñ, = - &; = &, x &. (3.473) 

[1 χη] 

The transformation matrices of the mapping from the world coordinate system into the picture system 
and their inverses are given as in (3.464) and (3.465) by substituting the corresponding coordinates of 


point R(z,, y», 2,) and vectors 6;, €; and 6j. 


3.5.5.3 Principal Projections 


The projection is perpendicular to a plane which is perpendicular to one of the coordinate-axes. De- 
pending on the direction of the projection and the viewing direction to the picture plane, the ground 
plan, the top view or one of the side views is formed on the projection plane. 

The matrix form of the orthogonal projection of a point P(xp, yp, zp) to the projection plane being 
parallel to the x, y-plane with equation z = 2p is 


zh 1000 


Tp 
yp|_ 1/0100 Yp d 
Zl-lós0allxl (3.474) 
1 0001 J 


In general, the projection plane is chosen zo — 0. The matrices of projections to the coordinate planes 
are 


0 0 0 
0 0 0 

Pus [3 riole dus (3.475) 
0 0 0 


rococo 
DOGO 


000 
100 
000 
001 


ooor 


1 
0 
0 
0 


3.5.5.4 Axonometric Projection 

In contrary to the orthographic projection, now the normal-vector ii — (nz, ny, nz) of the projection 
plane and the direction of the projection are not parallel to any of the coordinate axes. There are three 
different cases to consider: 

e Isometric: The angles of ii are the same with every coordinate axis. So, for the coordinates of ri, 
|nz| = |ny| = |nz|. The angle between the projected coordinate axes is 120°. Line segments being 
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parallel to the coordinate axes have the same distortion factor (Abb.3.213a). 
e Dimetric: ii has the same angle with two of the coordinate axes. In these directions equal distances 
remain equal. Two of the coordinates of ri have the same absolute value (Fig.3.213b). 


e Trimetric: ni has different angles with the coordinate axes. Consequently the coordinate axes have 
different distortion factors (Fig.3.213c). 


Ζ 





a) ^ y 


Figure 3.213 


3.5.5.5 Isometric Projection 


Consider the case when the projection plane contains the origin of the world coordinate system, and 


] oes 1 1 1 
when its normal vector is ri [s JS A 
In order to determine the matrix of projection, a coordinate transformation into the projection co- 
ordinate system is combined with a subsequent projection along the z'-axis to the x’, y/-plane. The 
definition of the picture coordinate system corresponds to (3.473). The upward vector is chosen as 
u = {0,0,1}. In this way the z-axis is mapped to the y’-axis. The unit basis-vectors of the picture 
coordinate system are: 


g- xu a Gy æ =ë xë = ea Ec 
aT UV S 5 95 l| o Ve 


σι 


S 1 1 1 
—n-4—z,—,—-,. 
i V3' V3' V3 
The transformation matrices of mapping from the picture system into the world coordinate system and 
reverses with (3.464) and (3.465) are: 





1 1 1 0 1 1 0 0 
V2. V6 V3 v2 v2 
1 Jo g 0 1 1 2 0 
Ma-| v2 νν |, Μι-Μ]-| νο ve ve |. (3.476) 
2 1 1 1 1 
0 — —&0 = a= c 
νό v3 ν3 νᾶ v3 
0 0 0 1 0 0 0 1 
Now, in the picture system the orthogonal projection along the z'-axis is 
1 1 
cuc ut Ud) 1 1 
2 v2 =z 5 009 
1000 v y 2 v2 v2 
Απ 1ος ye ae ve a | t 5g (3.477) 
Am eA — 100.00 l i 34 V6 V6 V6 : 
0001 —Ó — 0 0 0 0 0 
V3 v3 v3 0 0 9: 3 
0 0 0 1 
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The projection matrix P4 maps the points from the world coordinate system into the 2’, y/-plane of 
the picture system. By multiplication with matrix M 4 one obtains the points in the world coordinate 
system. The projection-matrix of the complete projection is: 
2 —1 —1 0 
1|-1 2-10 
3|-1-1 20 
0 0 03 


3.5.5.6 Oblique Parallel Projection 
At an oblique projection the projecting rays intersect the projection plane in an angle f. In Fig.3.214 
the oblique projection of point P(xp, yp, zp) is P'(xp, yp. zp) and its orthogonal projection is Pj. L 


P =M,P.M, = (3.478) 





is the length of the projected line segment P5 P'. The projection is characterized by two quantities. 


gives the factor by which the line segment perpendicular to the projection plane is 





1 
d-—- 
z tan 8 
scaled. a is the angle between the x-axis and the projected image of the perpendicular line segment. 
Then the rule for the coordinates of the whole projection is: 


tp = £p — zp dcosa, yp=yp—zpdsina, zp=0 or (3.479) 
Tp 1 0 —dcosa 0 xp 
yp | _ | 0 1 —dsina 0 Up | 
lal η [s (3.480) 
1 00 0 1 1 


If the projection plane is different from the x, y-plane, then a coordinate transformation into the picture 
system must be done before applying the projection-matrix (see 3.5.5.2, p. 238). 


Al τ 


LA y 





Figure 3.214 Figure 3.215 Figure 3.216 


W Cavalier projection of the unit cube with a = 45? and d — 1, i.e. 8 = 45°. Segments being perpen- 
dicular to the projection plane do not become shorter in this case (Fig.3.215). 
The four vertices not lying in the x, y-plane are put as columns into a matrix. This matrix multiplied 
by the matrix of the projection will give the coordinates of the projected points: 





1 1 1 1 1 
ο πω το ο 0 -— 0 
V2 V2 /2 /2 /2 0101 
Ld 1 1 i aloa L a 0011 
V2 ν3 νᾶ V2 V2 1111 
0 0 0 0 00 0 0 1111 
1 1 1 1 00 0 1 


E Cabinet projection of the unit cube with a — 30*, d — 1/2, i.e. 8 — 63.4*. The lengths of segments 
perpendicular to the projection plane are halved (Fig.3.216). 
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The coordinates of the projections of the four vertices not lying in the z, y-plane are calculated by 


4 4 4 4 4 τ 

1 1 8 8 1 
ES Ee e = = quse 

4 4 4 4 : 4 | 113114 
0 0 0 0 00 0 0 1111 
1 1 1 1 00 0 1 


3.5.5.7 Perspective Projection 
1. Mapping Formulation E 


The formulation of the mapping of a perspective projection can be given reasonably in the picture 
coordinate system. The origin is chosen so that the center of projection lies on the z-axis. 

As it can be seen in Fig.3.217, the relations between the coordinates of a point P(xp, yp, zp) and its 
projected image P’ (xp, yp, zp) can be calculated 
using equations of the intercept theorems (see 


3.1.8:2,5., p.135): 





tp 2 _ 1 yp 1 
: Zp? Zp? 
Xp ^ 29—z2p 1-2 y 1. 
20 20 
af — 
ab 0. 


The relations between the original and the im- 
age coordinates are not linear. However using 
the properties of homogeneous coordinates (see 
3.5.4.2, p. 231) the projection rule can be given 








Figure 3.217 in the following matrix form: 
Tp Tp : | i P up 
Ut yp 1 
s=] o 1=|O0 0 ol] eo (3.481) 
-%2 -1 
1 1-2 00 —z 1 1 


2. Vanishing points 

Perspective projections have the property, that parallel lines, which are not parallel to the picture 
plane, seem to intersect each other in a point. This point is called vanishing point. The vanishing 
points of lines being parallel to the coordinate axes are called principal points or principal vanishing 
points. 'The number of principal points is equal to the number of coordinate axes intersecting the picture 
plane. In Fig.3.218 there are perspective representations with one, two and three principal points. The 





Figure 3.218 
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principal points coincide with the intersection points of the picture plane and the rays being parallel to 
the coordinate axes. If the center of projection is C (xe, yc, Zc), a point of the picture plane is R(x, Yr, Zr) 
and the normal vector is Ñ = {nz, ny, nz}, then the coordinates of the principal points are: 


d d d 

Falte + —, Yes Ze), Fylte, Ye + —, Ze), Fe (Xe, Ye, % + —) where (3.482a) 
Ne ny Nz 

d = (È — F) - B = (£e — 2) ng + (Ye — Yr)Ny + (Ze — 2) Nz (3.482b) 


in the case they exist. If a coordinate of the normal vector ii is zero, then there is no principal point in 
that direction. 
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3.6 Differential Geometry 


In differential geometry planar curves and curves and surfaces in space are discussed by the methods 
of differential calculus. Therefore it is to be supposed that the functions describing the curves and 
surfaces are continuous and continuously differentiable as many times as necessary for discussion of 
the corresponding properties. The absence of these assumptions is allowed only at a few points of the 
curves and surfaces. These points are called singular points. 

During the discussion of geometric configurations with their equations there are to be distinguished 
properties depending on the choice of the coordinate system, such as intersection points with the co- 
ordinate axes, the slope or direction of tangent lines, maxima, minima; and invariant properties inde- 
pendent of coordinate transformations, such as inflection points, curvature, and cyclic points. There 
are also local properties, which are valid only for a small part of the curves and surfaces as the curva- s 
ture and differential of arc or area of surfaces, and there are properties belonging to the whole curve or 
surface, such as number of vertices, and arc length of a closed curve. 


3.6.1 Plane Curves 
3.6.1.1 Ways to Define a Plane Curve 


1. Coordinate Equations 
A plane curve can be analytically defined in the following ways. 
1. In Cartesian coordinates: 


a) Implicit: F(v,y) — 0, (3.483) 
b) Explicit: y — f(x), (3.484) 
2. In Parametric Form: c=x(t), y=y(t). (3.485) 
3. In Polar Coordinates: p= f(y). (3.486) 


2. Positive Direction on a Curve 

Ifa curve is given in the form (3.485), the positive direction is defined on it in which a point P (a(t), y(t)) 
of the curve moves for increasing values of the parameter t. If the curve is given in the form (3.484), 
then the abscissa can be considered as a parameter (x = x,y = f(x)), so the positive direction holds for 
increasing abscissa. For the form (3.486) the angle can be considered as a parameter o (x — f(y) cosy, 
y = f(y) sin), so the positive direction holds for increasing y, i.e., counterclockwise. 

E Fig.3.219a, b,c: A:x=??,y=t8, B:y=sina, C:p=ay. 


y 7 y 











a) b) 


Figure 3.219 


3.6.1.2 Local Elements of a Curve 

Depending on whether a changing point P on the curve is given in the form (3.484), (3.485) or (3.486), 
its position is defined by x, t or y. A point arbitrarily close to P with parameter values x + dx, t + dt 
or y + dy is denoted here by N. 

1. Differential of Arc 

If s denotes the length of the curve from a fixed point A to the point P, the infinitesimal increment 


As =PN can be expressed approximately by the differential ds of the arc length, the differential of 
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arc: 


2 
1+ (2) dx for the form (3.484), (3.487) 
ae 


V x? 4- y? dt for the form (3.485), (3.488) 
V p? — p? dp for the form (3.486). (3.489) 


B A: y-—sinz, ds — V1-4-cos?rdr.Wi B: z — 02, y — £, ds — tA 4- 98 dt. 
B C: p—ag (a0), ds-—ay1- e? de. 





Figure 3.220 Figure 3.221 Figure 3.222 


2. Tangent and Normal 

1. Tangent at a Point P to a Curve isalinein the limiting position of the secants PN for N — P; 
the normal is a line through P which is perpendicular to the tangent here (Fig. 3.220). 

2. The Equations of the Tangent and the Normal are given in Table 3.26 for the three cases 
(3.483), (3.484), and (3.485). Here x, y are the coordinates of P, and X, Y are the coordinates of the 
points of the tangent and normal. The values of the derivatives should be calculated at the point P. 


Table 3.26 Tangent and normal equations 




















Type of equation | Equation of the tangent Equation of the normal 
OF _ OF τα Y-y 
(3.483) ox (X —2)4 ay (Y -y)=0 IF IF 
Ox Oy 
3.484 ος ας Yoy- (a 
(3.484) EI - 1) -y= -q — 2) 
απ 
Y -1 X-z 
(3.485) ese a'(X 2) y(Y - y) 20 
y x 
Examples for equations of the tangent and normal for the following curves: 
E A: Circle x? +y? = 25 at the point P(3, 4): 





a) Equation of the tangent: 2r(X — x) -- 2y(Y — y) = 0 or Xa + Yy = 25 considering that the point 
P lies on the circle: 3X + 4Y = 25. 
. X-z Y-y y ] 4 
b) Equation of the normal: ——— = ~ or Y = —X; at the point P: Y = -X. 
2x 2y 2 3 





E B: Sine curve y = sin x at the point 0(0, 0): 
a) Equation of the tangent: Y — sinz — cosz(X — x) or Y — X cosx + sin x — z cos z; at the point 
(0,0): Y 2 X. 
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sin x 





1 
(X —r)orY X sec g +sin x + z sec x; at the point 


b) Equation of the normal: Y 
COS X: 


(0,0): Y 2 —X. 

Bl C: Curve with x = t2, y = at the point P(4, —8),t = —2: 
3 2 

a = E orY — SX — P at the point P: Y = —3X + 4. 

b) Equation of the normal: 2t (X — t?) + 3t? (Y — t) = 0 or 2X + 3tY = ? (2 + 913}; at the point 

P(4,—8): X —3Y = 28. 

3. Positive Direction of the Tangent and Normal of the Curve If the curve is given in one of 

the forms (3.484), (3.485), (3.486), the positive directions on the tangent and normal are defined in the 

following way: The positive direction of the tangent is the same as on the curve at the point of contact, 

and one gets the positive direction on the normal from the positive direction of the tangent by rotating 

it counterclockwise around P by an angle of 90° (Fig. 3.221). The tangent and the normal are divided 

into a positive and a negative half-line by the point P. 

4. The Slope of the Tangent can be determined 

a) by the angle of slope of the tangent a , between the positive directions of the axis of abscissae and 

the tangent, or 

b) if the curve is given in polar coordinates, by the angle jz, between the radius vector OP (OP = p) 

and the positive direction of the tangent (Fig. 3.222). For the angles a and µ the following formulas 

are valid, where ds is calculated according to (3.487)-(3.489): 








a) Equation of the tangent: 














tana — a , cosa = i , Sina— - ; (3.4902) 
tanu = i cos u = äp sin 4 = oe (3.490b) 
dp’ : ds! ` ds ` ` 
dy 
N 1 στ m cos © . 
WB A: y— sinz, tana — cosz, cosa = αι. : sina = JT GER j 
BB: c=, y=? πο ea es ας 
α Ο: ρ-αφ, tany = 9, Cos fl = : sin u = ? 


Vite’ 


Vite 


5. Segments of the Tangent and Normal, Subtangent and Subnormal (Fig. 3.223) 
a) In Cartesian Coordinates for the definitions in form (3.484), (3.485): 

















Dr y 2 : 

PT -|-4T ΄ segment of the tangent), 3.491a 
y +y (seg f gent) ( ) 

PN = |yy1 +y”? (segment of the normal), (3.491b) 

PU B (subtangent), (3.491c) P'N — |yy| (subnormal. (3.4914) 

b) In Polar Coordinates for the definitions in form (3.486): 

PT = R / p? 4- p? (segment of the polar tangent), (3.4922) 
p 

PN = e +p? (segment of the polar normal), (3.492b) 
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2 
OT' — 5 (polar subtangent), (3.492c) ON' — |p'| (polar subnormal). (3.492d) 


/ 


Bl A: y — coshz, y' — sinhz, /14- y? — coshz; PT = |coshzcothz|, PN — |cosh?z|, PT = 


|coth z|, P'N = |sinh x cosh z|. 
WB B: p — ago (a7 0), f 2a, yp? - p? a1 95; PT! 5 agy14 ge? aA 1l 4 q? 


[αρ] , ON! — a. 


PN = OLS 

















Figure 3.223 Figure 3.224 Figure 3.225 


6. Angle Between Two Curves The angle B between two curves T; and I at their intersection 
point P is defined as the angle between their tangents at the point P (Fig. 3.224). By this definition 
the calculation of the angle 6 has been reduced to the calculation of the angle between two lines with 
slopes 
d, d, 
πάν |E), (3.4932) acim | (3.493b) 
{τ }ρ dr jp 
Here y — fi(x) is the equation of /1 and y = fo(a) is the equation of I>, and the derivatives are 
calculated at the point P. One gets £ with the help of the formula 
tanos — tana, 
tan — tan(o4 — a3) — —————— —— . 3.494 
f (αι 2) 1 + tan q; tan @2 ( ) 


E Determine the angle between the parabolas y = yx and y = x” at the point P(1, 1): 


ανα 1 d (a? tan a — tar 3 
tana; — vz —-, tana,= (ο) —2, tanp = ΞΕ 
dx 1 2 dx {νι 1l+tana,tanag 4 








3. Convex and Concave Part of a Curve 


If a curve is given in the explicit form y = f(x), then a small part containing the point P can be 
examined if the curve is concave up or down here, except of course if P is an inflection point or a 
singular point (see 3.6.1.3, p. 249). If the second derivative f”(x) > 0 (if it exists), then the curve is 
concave up, i.e., in the direction of positive y (point P, in Fig. 3.225). If f”(x) < 0 holds (point Ρι), 
then the curve is concave down. In the case if f”(a) = 0 holds, it should be checked if it is an inflection 
point. 


Bl y — 2? (Fig. 2.15b); y" — 6z, for x > 0 the curve is concave up, for x < 0 concave down. 
4. Curvature and Radius of Curvature 


1. Curvature of a Curve The curvature K of a curve at the point P is the limit of the ratio of the 
angle 6 between the positive tangent directions at the points P and N (Fig. 3.226) and the arc length 
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PN for PN 0: 


K= lim E f (3.495) 
PN=>0 PN 


The sign of the curvature K depends on whether the curve bends toward the positive half of the normal 
(K > 0) or toward the negative half of it (K < 0) (see 3.6.1.1, 2., p. 245). In other words the center of 
curvature for K > 0is on the positive side of the normal, for K < 0 it ison the negative side. Sometimes 
the curvature K is considered only as a positive quantity. Then it is to take the absolute value of the 
limit above. 
2. Radius of Curvature of a Curve The radius of curvature R of a curve at the point P is the 
reciprocal value of the absolute value of the curvature: 
1 
R= —. 3.496 
K (3.496) 


The larger the curvature K is at a point P the smaller the radius of curvature R is. 








Β A: Fora circle with radius a the curvature K = 1/a and the radius of curvature R = a are constant 
for every point. 
B DB: For a line with K = 0 holds R = œ. 


3. Formulas for Curvature and Radius of Curvature 
With the notation 6 = da and PN= ds (Fig. 3.226) in gen- 












































eral 
. da ds 
— =j (3.497) 
For the different defining formulas of curves in 3.6.1.1, p. 243 
Figure 3.226 the different expressions for K and R are: 
3/2 
PE 
dy 14 (2) | 
uA x 
Definition as in (3.184): K = dx aye, R= Fy (3.498) 
dy πας 
1 — pe 
T (2) | dz 
1 CEA 
Definition as in (3.485): K = — 5" R= , (3.499) 
i 2 372 a^ oy 
(ω F $ ) | y" y" 
Pos Puy P 
Ένα Fy Fy 3/2 
F, F, 0 Εν; -- Εῷ 
Definition as in (3.483): K = 7 R= τω (3.500) 
(pq Fag Fay ὃς 
Y Fyz Fyy Fy 
EF, Fy 
2 + 292 — po" 2 72)3/2 
Definition as in (3.486): K — £7 —/P Wo (3.501) 
(p? 4- p?) p? t 2p? — pp 








248 9. Geometry 





1 3 Σ 0 


HA: y=coshr, K= : BB:c—-), y—0, K — —— — 
yee cosh? x’ t(A 4- 91292 
2 2 
2.2 5.2 32 a a : "m Padl yr +2 
BC: y-2=ca, = p E D: p= ay, K= ρε: 


5. Circle of Curvature and Center of Curvature 

1. Circle of Curvature at the point P is the limiting position of the circles passing through P and 
two points N and M of the curve from its neighborhood, for N —> P and M — P (Fig. 3.227). It 
goes through the point of the curve and here it has the same first and the same second derivative as the 
curve. Therefore it fits the curve at the point of contact especially well. It is also called the osculating 
circle. Its radius is the radius of curvature. It is obvious that it is the reciprocal value of the absolute 
value of the curvature. 

2. Center of Curvature The center C of the circle of curvature is the center of curvature of the 
point P. It is on the concave side of the curve, and on the normal of the curve. 

3. Coordinates of the Center of Curvature The coordinates (xc, yc) of the center of curvature 
for curves with defining equations in 3.6.1.1, p. 243 can be determined from the following formulas. 


dy dy\? dy\” 
es dli IV y 
dax | T (2) I+ (2) 

4 y+ : 









































Definition as in (3.484): rc Py i yc — Py (3.502) 
da? da? 
I ppl /2 a f ofa: 12 
Definition as in (3.485): rc —r— oar ; yc =y+ SU A (3.503) 
g" y" z" y" 
2 /2 ΕΙ Fas 
Definition as in (3.486): xo = pcos p gr P) pcosip +p sini) 
P+ 2p? Ie pp" 
- (p? + p”)(psin y — p' cosp) 
= ps ; .504 
yc = psin y B+ Dp — ppl (3.504) 
VM (i) ανν) 
efinition as in (3.483): xc = x+ TE. FS EC yc =y+ TF Fa Fl” (3.505) 
yz Fyy Fy yz Fyy Fy 
x y T y 
These formulas can be transformed into the form 
zc — r»— Rsino, yo=y+Rceosa Or (3.506) 
d dx 
πσ-α- Εν, yo=y+RZ (3.507) 
ds ds 





(Fig. 3.228), where Ft should be calculated as in (3.498)- (3.501). 





Figure 3.227 Figure 3.228 Figure 3.229 
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3.6.1.3 Special Points of a Curve 


In the following are to be discussed only the points remaining invariant during coordinate transforma- 
tions. In order to determine maxima and minima see 6.1.5.3, p. 444. 


1. Inflection Points and the Rules to Determine Them 

Inflection points are the points of the curve where the curvature changes its sign (Fig. 3.229) while 
a tangent exists. The tangent line at the inflection point intersects the curve, so the curve is on both 
sides of the line in this neighborhood. At the inflection point A = 0 and R = oo hold. 


1. Explicit Form (3.484) of the Curve y = f(a) 


a) A Necessary Condition for the existence of an inflection point is the zero value of the second 
derivative 

f"(x) =0 (3.508) 
if it exists at the inflection point (for the case of non-existent second derivative see b)). In order to 
determine the inflection points for existing second derivative all the roots of the equation f"(x) = 0 
with values 71, %2,...,Ui,-..,@n are to be considered, and to be substituted into the further deriva- 
tives. If for a value x; the first non-zero derivative has odd order, then there is an inflection point here. 
If the considered point is not an inflection point, because for the first non-disappearing derivative of 
kth order, k is an even number, then for f (z) « 0 the curve is concave down; for f®(x) > 0 it is 
concave up. If the higher-order derivatives are not be checked, for instance in the case they do not exist, 
see point b). 


b) A Sufficient Condition for the existence of an inflection point is the change of the sign of the 
second derivative f"(r) while traversing from the left neighborhood of this point to the right, if also 
a tangent exists here, of course. So the question, of whether the curve has an inflection point at the 
point with abscissa x;, can be answered by checking the sign of the second derivative traversing the 
considered point: If the sign changes during the traverse, there is an inflection point. (Since z; is a root 
of the second derivative, the function has a first derivative, and consequently the curve has a tangent.) 
This method can also be used in the case if y" — oo, together with the checking of the existence of a 
tangent line, e.g. in the case of a vertical tangent line. 

















1 1-32? 1 1-2? 
BA: y 1422’ f" (x) ΠΕΣΟ T12 = EVA f" (a) = “μπεκ 3 f" (213) # 0. 


Ἱ. -Ὁ 1 3 
Inflection points: Αἰ|--,-)], Β|--π,τ). 
B B: y — x^, /"(x) — 1222, z4 — 0, f"(x) — 24x, f"(z1) — 0, f!V(v) = 24; there is no inflection 
point. 

5, 52 , 4a d E 

BC: y=23,y= 3*5 y = y? 3; for r — 0 we have y" — oo. 
As the value of x changes from negative to positive, the second derivative changes its sign from “—” to 
“+” “so the curve has an inflection point at a = 0. 
Remark: In practice, if from the shape of the curve the existence of inflection points follows, for in- 
stance between a minimum and a maximum with continuous derivatives, then there are to be deter- 
mined only the points x; without checking the further derivatives. 
2. Other Defining Forms of the Curve The necessary condition (3.508) for the existence of an 
inflection point in the case of the defining form of the curve (3.484) will have the analytic form for the 
other defining formulas as follows: 


a) Definition in parametric form as in (3.485): =0. (3.509) 








b) Definition in polar coordinates as in (3.486): p? 4- 2p? — pp" — Q. (3.510) 
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Pig Fy Fo 
c) Definition in implicit form as in (3.483): F(x,y) =0 and |Fyzs Eyy Fy|= 0. (3.511) 
0 
a fy 


In these cases the system of solutions gives the possible coordinates of inflection points. 


1 1 
αλβ. τ-α (: κ sint) , y=a (1 — 5 008 t) (curtated cycloid (Fig. 2.68b), p. 102); 























x y'|_ @|2—cost sint|_ @ . . ' 1. e . 
a'y! © T sint cost —- 405t-1); cos tk 3 tk t3 t2kr (k =0,+1,+2,...). 














The curve has an infinite number of inflection points for the parameter values tg. 


MEME MEE 





ο E 1 V 12 Ho 2 5 E i fas As le 
HB: p= ug ρ' 2p" — pp 7 + mé πρ Ip (4y* — 1). The inflection point is at the angle 
p = 1/2. 

Bis 2 0 2g 
MC: «?—y? =a? (hyperbola). | - --|=|0 —2 —2y|= 82? — 8y?. The equations 2? — y? = a? 
. 2r —2y 0 


and 8(a? — y?) = 0 contradict each other, so the hyperbola has no inflection point. 


y 
Np" 
a) 





b) 


. Figure 3.230 
2. Vertices 


Vertices are the points of the curve where the curvature has a maximum or a minimum. The el- 
lipse has for instance four vertices A, B, C, D, the curve of the logarithm function has one vertex 
at E (1/2 , —In 2/2) (Fig. 3.230). The determination of vertices is reduced to the determination of 
the extreme values of K or, if it is simpler, the extreme values of R. The formulas (3.498)—(3.501) can 
be used for the calculations. 

3. Singular Points 

Singular point is a general notion for different special points of a curve. 

1. Types of Singular Points The points a), b), etc. to j) correspond to the representation in 
Fig. 3.231. 

a) Double Point: At a double point the curve intersects itself (Fig. 3.231a). 


b) Isolated Point: An isolated point satisfies the equation of the curve; but it is separated from the 
curve (Fig. 3.231b). 


c), d) Cuspidal point: Ata cuspidal point or briefly a cusp the orientation of the curve changes; ac- 
cording to the position of the tangent a cusp of the first kind and a cusp of the second kind (Fig.3.231c,d) 
can be distinguished. 


e) Tacnode or point of osculation: At the tacnode the curve contacts itself (Fig. 3.231e). 


f) Corner point: Ata corner point the curve suddenly changes its direction but in contrast to a cusp 
there are two different tangents for the two different branches of the curve here (Fig.3.231f). 


g) Terminal point: At a terminal point the curve terminates (Fig.3.231g). 


h) Asymptotic point: In the neighborhood of an asymptotic point the curve usually winds in and out 
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or around infinitely many times, while it approaches itself and the point arbitrarily close (Fig.3.231h). 


i), j) More Singularities: It is possible that the curve has two or more such singularities at the same 
point (Fig. 3.2311,j). 


P 
A ® M 3 τ 


Figure 3.231 








2. Determination of the Tacnode, Corner, Terminal, and Asymptotic Points Singularities 
of these types occur only on the curves of transcendental functions (see 3.5.2.5, p. 195). 

The corner point corresponds to a finite jump of the derivative dy/dx. 

Points where the function terminates correspond to the points of discontinuity of the function y = f(x) 
with a finite jump or to a direct termination. 

Asymptotic points can be determined in the easiest way in the case of curves given in polar coordinates 
asp = f(y). If for gp + co or y + —oo the limit is equal to zero (limp = 0), i.e., the pole is an 
asymptotic point. 





J: 


W A: The origin is a corner point for the curve y — 
l+ez 


1 
Wi B: The points (1,0) and (1, 1) are points of discontinuity of the function y — ———34- (Fig. 2.8) 


(see 2.1.4.5, p. 106). μα 

HC: The logarithmic spiral p = aet? (Fig. 2.75) (see 2.14.3, p. 106) has an asymptotic point at the 
origin. 

3. Determination of Multiple Points (Cases from a) to e), and i), and j)) Double points, 
triple points, etc. are denoted by the general term multiple points. To determine them, one starts with 
the equation of the curve of the form F(x,y) = 0. A point A with coordinates (1, y1) satisfying the 
three equations F — 0, F; — 0, and F,, — 0 is a double point if at least one of the three derivatives of 
second order F;,, F;,, and Fy, does not vanish. Otherwise A is a triple point or a point with higher 
multiplicity. 

The properties of a double point depend on the sign of the Jacobian determinant 


= sg Fu 
= [m ο (3.512) 


y—yi 
Case A < 0: For A < 0 the curve intersects itself at the point A; the slopes of the tangents at A are 
the roots of the equation 
Fyyk? + 2Fayk + Fe = 0. (3.513) 
Case A > 0: For A > 0 A is an isolated point. 
Case A = 0: For A = 0 Ais either a cusp or a tacnode; the slope of the tangent is 





Fy 
tana — —— , (3.514) 
yy 
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For more precise investigation about multiple points the origin can be translated to the point A, and 
rotate so that the x-axis becomes a tangent at A. Then from the form of the equation one can tell if it 
is a cusp of first or second kind, or if it is a tacnode. 

" A: F(z,y)&8 (i34 yy- 2a*(a* —y’) =0 (Lemniscate, Fig. 2.66, 2.12.6, p. 101); Fy = 4x(a? + 
i? — a2), Fy = 4y(x? + y? + a’); the equation system F, = 0, Fy = =) results i in the three solutions 














(0,0), (+a, 0), from which only the E one satisfies the condition pe — 0. Substituting (0, 0) into the 
second derivatives yields Fyr = —4a?, F;, — 0, Fj, — --4a?; A = —16a* < 0, i.e., at the origin the 
curve intersects itself; the slopes of thet tangents are tan a = 31, their equations are y = +2. 

E B: F(x,y) = 3? y? —a? — 9? — 0; F, — x(3x — 2), F, — y(3y — 2); among the t hs 0), 
(0, 2/3), (2/3, 0), m (2/3, 2/3) only the first one belongs to "the curve; there is Fy. = —2, = 0, 
Fy, = —2, A = 4 > 0, i.e., the origin is an isolated point. 


E C: F(x,y) = (y — x?) — x = 0. The equations F, = 0, F, = 0 result in only one solution (0, 0), 
it also satisfies the equation F = 0. Furthermore there is A = 0 and tana = 0, so the origin is a 
cusp of the second kind. This can be seen from the explicit form of the equation y = xz?(1 + vT). y 
is not defined for z « 0, while for 0 « x « 1 both values of y are positive; at the origin the tangent is 
horizontal. 





4. Algebraic Curves of Type F(a, y) = 0, F(x, y) Polynomial ina and y 

If the equation does not contain constant and first-degree terms, then the origin is a double point. The 
corresponding tangents can be determined by making the sum of the second-degree terms equal. 

Hi For the lemniscate (Fig. 2.66, p. 101) at the origin (0,0) the equations of the tangent y = +a follow 
from a? — y? — 0. 

If the equation does not contain second-degree terms as well but contains cubic terms, then the origin 
is a triple point. 





3.6.1.4. Asymptotes of Curves 
1. Definition 


An asymptote is a straight line to which the curve 
approaches while it moves away from the origin — 
(Fig. 3.232). 


The curve can approach the line from one side a) b) 
(Fig. 3.232a), or it can intersect it again and again ] 
(Fig. 3.232b). Figure 3.232 


Not every curve which goes infinitely far from the origin (infinite branch of the curve) has an asymptote. 
For instance the entire part of an improper rational expression is called an asymptotic approximation 
(see 1.1.7.2, p. 15). 


2. Functions Given in Parametric Form x = z(t), y = y(t) 


To determine the equation of the asymptote first those values are determined for which t > t; yields 
either x(t) — +too or y(t) — -Eoo (or both). 
There are the following cases: 








a) x(t) > œo for t> t; but y(t) =a #00: y=a. The asymptote is a horizontal line. — (3.515a) 


b) y(t) — oo for t — t; but z(t;) =a 400: x=a. The asymptote is a vertical line. (3.515b) 








t 
c) If both y(t) and z(t) tend to «oo, then the following limits are to be calculated: k — lim n and 
ur m 
b= lim [y(t) — ka(t)]. If both exist, the equation of the asymptote is 
st 


y= kr +b. (3.515c) 
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m π π : 
.y-—n(tant —t), t τς, t= , etc. Determine the asymptote at ti: 
cost 2 2 f 
n n 
x(ti) = y(t1) 2 oo, k — i E ment tcost) = ma 
T 
n m sint — tcost— 1 nT 

b= lim n(tant — t) - — = n lm. ————————— = --—. For the asymptote 

t> 1/2 m cost t> 1/2 cost 2 

. n no τος n no 
(3.515c) gives y = —ax — z For the second asymptote, etc. one gets similarly y= ——a + z“ 
m m 


3. Functions Given in Explicit Form y = f (æ) 
The vertical asymptotes are at the points of discontinuity where the function f(x) has an infinite jump 
(see 2.1.5.3, p. 59); the horizontal and oblique asymptotes have the equation 
y — kz +b with k = lim Λία) b= lim [f(z) — kx] (3.516) 
5 ` zr2oo q ^? z—00 ac ` 
4. Functions Given in Implicit Polynomial Form F(z, y) = 0 
1. To determine the horizontal and vertical asymptotes one chooses the highest-degree terms with 
degree m from the polynomial expression in x and y, then one separates them as a function (x, y) and 
solves, if it is possible, the equation d(x, y) — 0 for x and y: 
$(r,y)—0 yields α--φ(ῳ), υ--ψί(α). (3.517) 
The values yı = a for x — oo give the horizontal asymptotes y — a; the values z1 — b for y — oo the 
vertical ones x = b, if the limits exist. 
2. To determine the oblique asymptotes the equation of the line y = kx + b is to be substituted into 
the equation F(x,y), then the resulting polynomial is to be ordered according to the powers of x: 


F(a, ka +) = filk, b)a™ + falk, ba" +- . (3.518) 
The parameters k and b can be yield, if they exist, from the equations 
filk,b)=0, falk,b) = 0. (3.519) 


Β z’ +’ — 3axy = 0 (Cartesian folium Fig. 2.59, 2.11.3, p. 96). Based on equation F(x, kx + b) = 
(1+ k?)x? +3(k?b— ka)z? +- - - according to (3.519), fı(k, b) = 1 +k? = 0 and f2(k, b) = k?°b— ka = 0 
with the solutions k = —1, b = —a the equation of the asymptote is y — —x — a. 

3.6.1.5 General Discussion of a Curve Given by an Equation 

Curves given by their equations (3.483)-(3.486) are investigated in order to know their properties and 
shapes. 

1. Construction of a Curve Given by an Explicit Function y — f(x) 

a) Determination of the domain (sce 2.1.1, p. 48). 


b) Determination of the symmetry of the curve with respect to the origin or to the y-axis checking 
if the function is odd or even (see 2.1.3.4, p. 51). 





c) Determination of the behavior of the function at -too by calculating the limits lim f(a) and 
Ώ-}-οο 


ΝΙΝ f(x) (see 2.1.4.7, p. 55). 


d) Determination of the points of discontinuity (see 2.1.5.3, p. 59). 


e) Determination of the intersection points with the y-axis and with the x-axis calculating 
f (0) and solving the equation f(x) = 0. 

f) Determination of maxima and minima and finding the intervals of monotonicity where the 
function is increasing or decreasing. 


g) Determination of inflection points and the equations of tangents at these points (see 3.6.1.3, 
p. 249). 
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With these data the graph of the function can be sketched, and where it is necessary, some values can 
be calculated to make it more precise. 


W Sketch the graph of the function y = ΝΕ 

a) The function is defined for all « except x = 0. 

b) There is no symmetry. 

c) For z — —oo follows y — 2, and obviously y — 2 — 0, i.e., approach from below, while z — oo also 
holds y — 2, but y — 2 4- 0, an approach from above. 

d) x = 0 is a point of discontinuity such that the function from left and also from right tends to —oo, 
because y is negative for small values of x. 

e) Because f (0) = —oo holds, there is no intersection point with the y-axis, and from f(x) = 2x? + 
3a — 4 = 0 the intersection points with the z-axis are at x, © 0.85 and rg & —2.35. 

f) A maximum is at x = 8/3 ~ 2.66 and here y © 2.56. 

g) An inflection point is at 7 = 4, y = 2.5 with the slope of the tangent line tana = —1/16. 

h) After sketching the graph of the function based on these data (Fig. 3.233) one can calculate the 
intersection point of the curve and the asymptote, which is at x = 4/3 ~ 1.33 and y = 2. 

2. Construction of a Curve Given by an Implicit Function F(z, y) = 0 

There are no general rules for this case, because depending of the actual form of the function different 
steps can be or cannot be performed. If it is possible, the following steps are recommended: 


a) Determination of all the intersection points with the coordinate axes. 

b) Determination of the symmetry of the curve, replacing x by —x and y by —y. 

c) Determination of maxima and minima with respect to the x-axis and then interchanging a and 
y also with respect to the y-axis (see 6.1.5.3, p. 444). 

d) Determination of the inflection points and the slopes of tangents there (see 3.6.1.3, p. 249). 
e) Determination of singular points (sce 3.6.1.3, 3., p. 250). 


f) Determination of vertices (see 3.6.1.3, 2., p. 250) and the corresponding circles of curvature (see 
3.6.1.2, 4., p. 246). It often happens that the curve's arc can hardly be distinguish from the circular 
segment of the circle of curvature on a relatively large segment. 


g) Determination of the equations of asymptotes (see 3.6.1.4, p. 252) and the position of the 
curve branches related to the asymptotes. 








evolvent 


Figure 3.233 Figure 3.234 Figure 3.235 


3.6.1.6 Evolutes and Evolvents 

1. Evolute 

The evolute is a second curve which is the locus of the centers of circles of curvature of the first curve 
(see 3.6.1.3, 5., p. 248); at the same time it is the envelope of the normals of the first curve (see also 


3.6 Differential Geometry 255 





3.6.1.7, p. 255). The parametric form of the evolute can be derived from (3.503) or (3.504) for the 
center of curvature if xc and yc are considered as running coordinates. If it is possible to eliminate 
the parameter (x, t or y) from (3.502), (3.503), (3.504), then the equation of the evolute is obtained in 
Cartesian coordinates. 





2x(1 4- 42? 
Bl Determine the evolute of the parabola y = z? (Fig. 3.234). From X = x — 2 = 
1+ 42? 1+ 62? 
4x?, Y = 2? 4 9 ας 5 Z and considering X and Y as running coordinates the evolute 
1 Xx 
Y=- F3 (3) 
2 4 


2. Evolvent or Involute 
'The evolvent, also called involute, of a curve 75 is a curve 7}, whose evolute is l3. Here every normal 


PC of the evolvent is a tangent of the evolute (Fig. 3.234), and the length of arc CC, of the evolute 
is equal to the increment of the radius of curvature of the evolvent: 


CC;- B.C, - PC. (3.520) 
These properties show that the evolvent 71 can be regarded as the curve traced by the end of a stretched 
thread unspooling from 7/5. A given evolute corresponds to a family of curves, where every curve is de- 
termined by the initial length of the thread (Fig. 3.235). 

The equation of the evolute is obtained by the integration of a system of differential equations corre- 
sponding to its evolute. For the equation of the evolvent of the circle see 2.14.4, p. 106. 


W The catenoid is the evolute of the tractrix; the tractrix is the evolvent of the catenoid (see 2.15.1, 





p. 108). 
2 Q7 
a) 
ΝΥ d) 
b) 
a) c) rr 
Figure 3.236 Figure 3.237 


3.6.1.7 Envelope of a Family of Curves 


1. Characteristic Point 
Consider the one-parameter family of curves with equation 

F(x,y,a) — 0. (3.521) 
Every two close curves of this family corresponding to the values of parameter a and a + Aa have points 
K of nearest approach. Such a point is either a point of intersection of the curves (a) and (a + Aa) or 
a point of the curve (a) whose distance from the curve (a + Aq) along the normal is an infinitesimal 
quantity of higher order than Aa (Fig. 3.236a,b). For Aa — 0 the curve (a + Aq) tends to the curve 
(a), where in some cases the point ΚΝ approaches a limit position, the characteristic point. 
2. Geometric Locus of the Characteristic Points of a Family of Curves 
Equation (3.521) may represent one or more curves. They are formed by the approaching points or by 
the boundary points of the family (Fig. 3.237a), or they form an envelope of the family, i.e., a curve 
which contacts tangentially every curve of the family (Fig. 3.237b). A combination of these two cases 
is also possible (Fig. 3.237c,d). 
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3. Equation of the Envelope 

The equation of the envelope can be calculated from (3.521), where o can be eliminated from the fol- 
lowing equation system: 
OF — 
m^ 


HM Determine the equation of the family of 


vil 
straight lines arising when the ends of a line seg- 
ment AB with |AB| = l are sliding along the 
coordinate axes (Fig. 3.238a). The equation 
. . T y 1 
of the family of curves is — + = 1 y 
lsina  lcosa x 
or F = xcosa + ysina — lsinacosa — 0, A 1 
OF : 2 i3 
m —xsina+ycosa—Icos* a+lsin* a = 0. 
a 
Bx b) 


Eliminating a gives z?/? 4- y?/? — [2 asan en- — a) ῷ 
velope, which is an astroid (Fig. 3.238b, see 
also p. 103). Figure 3.238 


F=0, 0. (3.522) 








3.6.2 Space Curves 

3.6.2.1 Ways to Define a Space Curve 

1. Coordinate Equations 

To define a space curve there are the following possibilities: 

a) Intersection of Two Surfaces: F(r,y,z) 20, d(v,y,z) — 0. (3.523 
b) Parametric Form: x= z(t), y=y(t), z= z(t), (3.524 
with t as an arbitrary parameter; mostly t = x, y or z are used. 

c) Parametric Form: «=2(s), y=y(s), z= 2(s), (3.525a 
with the arc length s between a fixed point A and the running point P as the parameter: 


t 2 2 2 

u dx dy dz em 

s = (5) + (2) + (2) dt. (3.525b 
0 


2. Vector Equations 
With r as radius vector of an arbitrary point of the curve (see 3.5.1.1, 6., p. 182) the equation (3.524 
can be written in the form 

r=r(t), where r(t) = x(t)i - y(t)j 4- z(t)k, (3.526 
and (3.525a) in the form 

F=7(s), where £(s) =2(s)i+y(s)j + 2(s)k. (3.527 
3. Positive Direction 
This is the direction of increasing parameter t for a curve given in the form (3.524) and (3.526); for 
(3.525a) and (3.527) it is the direction in which the arc length increases. 


3.6.2.2 Moving Trihedral 
1. Definitions 


At every point P of a space curve can be defined three lines and three planes, apart from singular points. 
They intersect each other at the point P, and they are perpendicular to each other (Fig.3.239). 
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1. Tangent is the limiting position of the secants PN 
for N — P (Fig.3.220), p. 244. 

2. Normal Plane is a plane perpendicular to the tan- 
gent (3.239). Every line passing through the point P and 
contained by this plane is called a normal of the curve at the 
point P. 

Pn ---- 3. Osculating Plane is the limiting position of the 
planes passing through three neighboring points M, P and 
N, for N — P and M — P (Fig.3.240). The tangent line 
incipal normal is contained by the osculating plane. 

4. Principal Normal is the intersection line of the nor- 
mal plane mal and the osculating plane, i.e., it is the normal contained 
by the osculating plane. 

5. Binormal is the line perpendicular to the osculating 
Figure 3.239 plane through the point P. 


binormal rectifying plane 








6. Rectifying Plane is the plane spanned by the tangent and binormal 
lines. 

The positive directions on the lines (1.), (4.) and (5.) are defined as follows: 
7. Moving trihedral The positive directions on the lines tangent, prin- 
cipal normal and binormal are defined as follows: 


a) On the tangent line it is given by the positive direction of the curve; the 
Figure 3.240 unit tangent vector t has this direction. 
b) On the principal normal it is given by the sign of the curvature of the curve, and given by the unit 
normal vector ri. 


c) On the binormal it is defined by the unit vector 


b-txi, (3.528) 
where the three vectors t, ii, and b form a right-handed rectangular coordinate system, which is called 
the moving trihedral. 

2. Position of the Curve Related to the Moving Trihedral 


For the usual points of the curve the space curve is on one side of the rectifying plane at the point P, and 
intersects both the normal and osculating planes (Fig.3.241a). The projections of a small segment of 
the curve at the point P on the three planes have approximately the following shapes: 


1. On the osculating plane it is similar to a quadratic parabola (Fig.3.241b). 

2. On the rectifying plane it is similar to a cubic parabola (Fig.3.241c). 

3. On the normal plane it is similar to a semi cubical parabola (Fig.3.241d). 

If the curvature or the torsion of the curve are equal to zero at P or if P is a singular point, i.e., if 
a(t) = y(t) = z'(t) — 0 hold, then the curve may have a different shape. 








5 E F 
=< > 
t t "n 
b) c) d) 


Figure 3.241 
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3. Equations of the Elements of the Moving Trihedral 
1. The Curve is Defined in the Form (3.523) For the tangent see (3.529), for the normal plane 
see (3.530): 


X-r Υ-ν 4-2 - X-zY-yZ-z 
jor aF|-[ar aF| jor or; G9?) OF OF OF 
ὃν ðz| [ὃς ὃτι |ὃτ ὃν Ox Oy Oz |=0. (3.530) 
δ4 ὑφ! |o» e| os ot ae os oð 
Oy Oz| [181 ὃτ |Ox Əy Or Oy Oz 


Here x, y, z are the coordinates of the point P of the curve and X,Y, Z are the running coordinates of 
the tangent or the normal plane; the partial derivatives belong to the point P. 

2. The Curve is Defined in the Form (3.524), (3.526) In Table 3.27 the coordinate and vector 
equations belonging to the point P are given with z, y, z and also with r. The running coordinates and 
the radius vector of the running point are denoted by X,Y, Z and R. The derivatives with respect to 
the parameter f£ refer to the point P. 

3. The Curve is Defined in the Form (3.525a, 3.527) If the parameter is the arc length s, for 
the tangent and binormal, and for the normal and osculating plane the same equations are valid as in 
case 2, just t must be replaced by s. The equations of the principal normal and the rectifying plane will 
be simpler (Table 3.28). 


3.6.2.3 Curvature and Torsion 
1. Curvature of a Curve and Radius of Curvature 


The curvature of a curve at the point P is a measure which describes the 
deviation of the curve from a straight line in the very close neighborhood of 


this point. The exact definition is by the help of the tangent vector t = aS 
9 


(see Fig.3.242): 





AË R — 
K= lm —=—. (3.531) Figure 3.242 


PN 20 PN ds 
1. Radius of Curvature The radius of curvature is the reciprocal value of the absolute value of the 


curvature: 
1 








p= > (3.532) 
2. Formulas to calculate K and p 
a) If the curve is defined in the form (3.5252): 
P er ο TS 
K| E a 11/2 + y? + gm. (3.533) 
ds? 








where the derivatives are with respect to s. 
b) If the curve is defined in the form (3.524): 


drj (du (der 
m dt dt? dt dt? (x? 4 y? dE at yat a y? a 2/2) — (z'z" abs yy" 4 227. 
K = C PE = (a2 F y2 F z2) 
r 
a) 


The derivatives are calculated here with respect to t. 





(3.534) 
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Table 3.27 Vector and coordinate equations of accompanying configurations of a space curve 










































































Vector equation Coordinate equation 
Tangent: 
R-roAE X-v Y-y Z-z 
dt g' y a 
Normal plane: 
= dr 
(R-i),-0 z'(X —z)-y(Y—-y)-z(Z—2)20 
Osculating plane: 
= d? r i X-zY-yZ-—z 
—p——-20 2 y z |=0 
( a dt? x" y" ^I 
Binormal: 
3 dr dr Z-z 
R=r+à| -xX = 
(7 35) voy 
q" y" 
Rectifying plane: 
ια Φε moms gane 
(π-0-!πεΧπα]- i a y Zz |=0, 
dt \dt dt m s 
with l = y'z' —y'z 
m = 2a" — "t, 
n= aly!" 2s ally! 
Principal normal: 
> dr dr dr Y-y Z-z 
R=r+A— x | — x — = = 
dt (s τ) y z 2! α' m 
m n nl lm 
T position vector of the space curve, R position vector of the accomp. configuration 
! For the mixed product of three vectors see 3.5.1.6, p. 185 








Tabelle 3.28 Vector and coordinate equations of accompanying configurations 


























as functions of the arc length 
| Element of trihedral | Vector equation Coordinate equation 
3 dy X-2 Y-: Z-z 
Principal normal R=r+ MS x y y τη 
eos D E dr "(ντ "(ντ n 
Rectifying plane (R- D Ξ0 |z"(X—z)-c-y'(Y y) -z"(Z—2)-0 
ds 
Y position vector of the space curve, R position vector of the accomp. configuration 
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3. Helix The equations 

1 -—acost, y=asint, z=bt (a>0,b>0) (3.535) 
describe a so-called helix (Fig.3.243) as a right screw. If the observer is 
looking into the positive direction of the z-axis, which is at the same time 
the axis of the screw, then the screw climbs in a counter-clockwise direction. 
A helix winding itself in the opposite orientation is called a left screw. 
WB Determine the curvature of the helix (3.535). The parameter t is to 


be replaced by s = tya? +b. Then holds xv = acos———— AES 





E bs a 
Sin ————— , £ — —————— , and according to (3.533), K — ————, p— 
Va? +l? va? +b? gto( ) a? +b? d 
2 b2 
E . Both quantities K and p are constants. Figure 3.243 





Another method, without the parameter transformation in (3.534), produces the same result. 


2. 'Torsion of a Curve and Radius of the Torsion Circe 
The torsion of a curve at the point P is a measure which describes the deviation of the curve from 
a plane curve in the very close neighborhood of this point. The exact definition by the help of the 


binormal vector b (3.528), p. 257 (sce also Fig.3.244) is: 








Ab db 
T= lim = ds (3.536) 
PuNo PN “S 
The radius of torsion is 
1 
T= =. (3.537) 
IT] Figure 3.244 
1. Formulas for Calculating T and T 
a) If the curve is defined in the form (3.525a): 
a y z 
a" y 
dr a d q" tatt g!" 
T=p 
P (2 ds? E (2/2 4- y? 4- 272)" (3:538) 


where the derivatives are taken with respect to s. 
b) If the curve is defined in the form (3.524): 





$ r y 2 
dé PF BE a" y” z" 
νεα. gl! m 21 
T= pdt d? d? _ 2 y -. (3.539) 
3 (0 + y? + 22)3 





dr 
dt 
where p should be calculated by (3.532) and (3.533). 
The torsion calculated by (3.538), (3.539) can be positive or negative. In the case T > 0 an observer 
standing on the principal normal parallel to the binormal sees that the curve has a right turn; in the 


case T < 0 it has a left turn. 
Β The torsion of a helix is constant. With the notation R for the right screw and L for the left screw 








*For the mixed product of three vectors see 3.5.1.6, 2., p. 185. 
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the torsion is 

—asint acost b 

—acost —asint 0 
a? +b? asint —acost 0 b a? +b? b 

E = T ; —— : 

B a ((—a sin t)? 4- (a cos t)? 4- pp a? + b? b 5 a2 +b 

3. Frenet Formulas 4 

The derivatives of the vectors t, n, and b can be expressed by the Frenet formulas: 
t 3 di t€ b db ç 
meg tocco, (3.540) 
ds p ds p ds T 














Here p is the radius of curvature, and 7 is the radius of torsion. 
4. Darboux Vector 
The Frenet formulas (3.540) can be represented in the clearly arranged form (Darboux formulas) 


di 3 2 dü 4 db + - 
=dxt =dx =dx b. .541 

ds ! ds n, ds ο. 
Here d is the Darboux vector, which has the form 

2 qp tH 

d= -t + -b. (3.542) 

T p 

Remarks: 


1. With the help of the Darboux vector the Frenet formulas can be interpreted in the sense of kinematics 
(see [3.4]). 


2. The modulus of the Darboux vector equals the so-called total curvator A of a space curve: 
1 1 i 
à= 7 Fo Id]. (3.543) 
3.6.3 Surfaces 
3.6.3.1 Ways to Define a Surface 


1. Equation of a Surface 
Surfaces can be defined in different ways: 


a) Implicit Form: F(zv,y,z) — 0. (3.544) 
b) Explicit Form : z — f(z,y). (3.545) 
c) Parametric Form : v = z(u,v), y= y(ļu,v), z= z(u,v). (3.546) 
d) Vector Form : F=F(u,v) with T= π(ω, ο) + y(u, v)j + z(u, v)k. (3.547) 


Running the parameters u and v over all allowed values gives the coordinates and the radius vectors 
of all points of the surface from (3.546) and (3.547). The elimination of the parameters u and v from 
the parametric form (3.546) (if possible) yields the implicit form (3.544). The explicit form (3.545) is 
a special case of the parametric form with u = x and v = y. 


Β The equation of the sphere in Cartesian coordinates, parametric form, and vector form (Fig. 3.246): 
r? +y? +z? —a?=0; (3.548a) x = acosusinv, y = asinusinv, z = acosv; (3.548b) 
F = a(cosu sin vi + sin usin vj + cos vk). (3.548c) 
2. Curvilinear Coordinates on a Surface 


If a surface is given in the form (3.546) or (3.547), and the values of the parameter wu can be changed 
while the other parameter v = vo is fixed, then the points r(x, y, z) describe a curve r = r(u, vo) on 
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the surface. Substituting for v different but fixed values v — v1, v — v3,..., v — v, one after the other, 
gives a family of curves on the surface. When moving along a curve with v — const only u is changing, 
this curve is called the u-line (Fig. 3.245). Analogously one gets another family of curves, the v-lines, 
by varying v and keeping u = const fixed with wy, u2,..., Un. In this way a net of coordinate lines is 
defined on the surface (3.546), where the two fixed numbers u = u; and v = vp are the curvilinear or 
Gauss coordinates of the point P on the surface. 

If a surface is given in the form (3.545), the coordinate lines are the intersection curves of the surface 
with the planes x = const and y = const. With equations in implicit form F(u,v) = 0 or with the 
parametric equations u = u(t) and v = v(t) of these coordinates, curves on the surfaces can be defined. 
W In the parametric equations of the sphere (3.548b,c) u means the geographical longitude of a point 
P, and v means its polar distance. The v lines are here the meridians APB; the u lines are the parallel 
circles CPD (Fig. 3.246). 


3.6.3.2 Tangent Plane and Surface Normal 

1. Definitions 

1. Tangent Plane The precise general mathematical definition of the tangent plane is rather com- 
plicated, so here the investigation is to be restricted to the case, when the surface is defined by two 
parameters. Suppose, for a neighborhood of the point P(x, y, z), the mapping (u, v) — r(u, v) is in- 


: . PRUO P or 5 Or . 
vertible, the partial derivatives r, = Bu andr, = δν are continuous, and not parallel to each other. 
u 


u ) 
Then P(x, y, z) is called a regular point of the surface. If P is regular, then the tangents of all curves 
passing through P, and having a tangent here, are in the same plane, and this plane is called the tangent 
plane of the surface at P. If this happens, the partial derivatives ru, rẹ are parallel (or zero) only for 
certain parametrization of the surface. If they are parallel for every parametrization, the point is called 
a singular point (see 3., p. 263). 





surface normal 





Figure 3.245 Figure 3.246 Figure 3.247 

2. Surface Normal The line perpendicular to the tangent plane going through the point P is called 
surface normal at the point P (Fig. 3.247). 
3. Normal Vector The tangent plane is spanned by two vectors, by the tangent vectors 
, Or , OF 

Te Bu? δου 
of the u- and v-lines. The vector product of the tangent vectors f, x T, is a vector in the direction of 
the surface normal. Its unit vector 

ILI. 

| £, x Εν | 





(9.549) 
No = (3.549b) 


is called the normal vector. Its direction to one or other side of the surface depends on which variable 
is the first and which one is the second coordinate among u and v. 
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2. Equations of the Tangent Plane and the Surface Normal (see Table 3.29) 
W A: For the sphere with equation (3.548) is valid: 


a) as tangent plane: 2”(X — x) + 2y(Y — y) + 22(Z — z) =0 or aX +y¥ +2Z—a? =0, (3.550a) 
τα Y-y Z-z X Y Z 
B = or — = : 








b) as surface normal: om 2j 2; gU e (3.550b) 

W B: For the sphere with equation (3.548b) is valid: 

a) as tangent plane: X cosusinv + Y sin usinv + Z cosv — a, (3.550c) 
X Y Z 

b) as surface normal: ------------------ (3.550d) 





cosusinv  sinusinv σοβυ᾽ 


3. Singular Points of the Surface 
If for a point with coordinates © = zi, y — yi, z — zi of the surface with equation (3.544) all the 
equalities 

OF OF OF 

— = 0, — = 0, — = 0, F(z,y,z)=0 3-551 

Fe 7 gy =O ge m Fes) (3.551) 
are fulfilled, i.e., if every first-order derivative is zero at the point P(1, γι, 21), then this point is called 
a singular point. All tangents going through here do not form a plane, but a cone of second order with 
the equation 











OF OF OF OF 
E (X — a4 Oy? (Y - s)? 4 ag (Z a)’ + 2a (x x1)(Y — νι) 
OF OF 
au yi)(Z — 2) 4 25:0: 4 z1)(X — 21) — 0, (3.552) 





where the derivatives belong to the point P(x1, y:, 21). If also the second derivatives are equal to zero 
at this point, then there is a more complicated type of singularity. Then the tangents form a cone of 
third or even higher order. 


3.6.3.3 Line Elements of a Surface 


1. Differential of Arc 
Consider a surface given in the form (3.546) or (3.547). Let P(u, v) an arbitrary point and N(u+du, v+ 


dv) another one close to P, both on the surface. The arclength of the arc segment PN on the surface 
can be approximately calculated by the differential of an arc or the line element of the surface with the 
formula 

ds* = Edu? + 2F dudv + G dv’, (3.553a) 
where the three coefficients 


δελ” Oy E az\" OxOx  OyOy | OzOz 
ee [9 UL EORR - y 01 
B = (=) (25) T (2) p Ven Qu Qv T Ou Ov + ðu ðv ’ 


2 2 2 
Ox Oy Oz 
m 22 = di e 
G=r = (5) (2) + (2) (3.553b) 
are calculated at the point P. The right-hand side (3.553a) is called the first quadratic fundamental 


form of the surface. 
W A: For the sphere given in the form (3.548c) clearly 


E=oa'sin?v, F=0, G=a?, ds* =a?(sin?vdu? + dv’). (3.554) 
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W D: For a surface given in the form (3.545) clearly 
Oz Oz 


E=1+4+p*, F=pq, G=14+¢ with p=, q=. 
Ox Oy 


Table 3.29 Equations of the tangent plane and surface normal 
































Type of equation Tangent plane Surface normal 
OF OF Χ-α Y-y Z-z 
—(X- —(Y — εσας πο ος 
(3.544) Jz ( ) δι (Y - y) ΖΡ ar BF 
OF Ox Oy Oz 
+—(Z-2)= i y 
Oz 
X -=i Y —1 Z—z 
(3.545) Z—-z-p(X a) q(Y — y) EIAS ας 
p q =l 
X-rY-yZ-z X-2  Y-y _ Z-z 
Or Oy 3e [όν =] ~ [oe Be] ~ [Or By] 
(3.546) Ou ðu ðu |=0 ðu ðu ðu ðu ðu ðu 
ðv ðv Ov Ov Ov Ov Ov Ov Ov 
(3.547) m E DEUS EQ R Ji A κ 
or (R-r)N=0 or R=r+ AN 
In this table z, y, z and r are the coordinates and radius vector of the points P of the curve; 
X,Y,Z and R are the running coordinates and radius vectors of the points of the tangent 
ὃ ὃ > 
plane and surface normal in the point P; furthermore p = as q= 9s and N is the 
x y 
normal vector. Besides ry = . and r} = δ p= q= P $ 
1 For the mixed product of three vectors see 3.5.1.6, 2., p. 185 








2. Measurement on the Surface 
1. The Arclength ofasurface curve u = u(t), v = v(t) for to < t < tı is calculated by the formula 


5 5 du? du dv dv V? 

i= / ds =/ E n +G dt. (3.556) 
i j dt dt 
0 0 





dt dt 
2. The Angle Between Two Curves r; — r(ui(t), v(t)) and r9 — 
T(u2(t), v2(t)) on the surface T — r(u, v) isthe angle between their tangents 
with the direction vectors f, and f (Fig. 3.248). It is given by the formula 
îi fo 
Vf, & 
2222 S ow a ἡ 2 > 
Eù ù, + F(üjvo 4- v1à3) 4- Gv1V2 


Figure 3.248 V Eá; 4 2Fü,V, 4 GV, V Eà; t 2Füsvs - GV; 


cosa = 








(3.557) 








Here the coefficients E, F and G are calculated at the point P and th, ty, V, and V3 represent the first 
derivatives of u4(t), u(t), v1(t) and vo(t), calculated for the value of the parameter t at the point. P. 
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If the numerator of (3.557) vanishes, the two curves are perpendicular to each other. The condition of 
orthogonality for the coordinate lines v — const and u — const is F — 0. 

3. The Area ofa Surface Patch S bounded by an arbitrary curve which is on the surface can be 
calculated by the double integral 


s= I ds (3.5584) with dS — VEG — F?du dv. (3.558b) 
(S) 


dS is called the surface element or element of area. 

The calculation of length, angle, and area on a surface is possible with (3.556), (3.557), (3.558a,b) if the 
coefficients E, F, and G of the first fundamental form are known. So the first quadratic fundamental 
form defines a metric on the surface. 

3. Applicability of Surfaces by Bending 

If a surface is deformed by bending, without stretching, compression or tearing, then its metric remains 
unchanged. In other words, the first quadratic fundamental form is invariant under bendings. Two 
surfaces having the same first quadratic fundamental form can be rolled onto each other. 


ON 
2 
i AZ 





Figure 3.249 


3.6.3.4 Curvature of a Surface 


1. Curvatures of Curves on a Surface 

If different curves I’ are drawn through a point P of the surface (Fig. 3.249), their radii of curvature 
p at the point P are related as follows: 

1. The Radius of Curvature p of a curve I’ at the point P is equal to the radius of curvature of a 
curve C, which is the intersection of the surface and the osculating plane of the curve 7" at the point P 
(Fig. 3.2492). 

2. Meusnier's Theorem For every plane section curve C of a surface (Fig. 3.249b) the radius of 
curvature can be calculated by the formula 


p = Rcos(ii, N). (3.559) 
Here R is the radius of curvature of the normal section Cnorm, which goes through the same tangent 
NQ as C and also contains the unit vector N of the surface normal; X(n, N) is the angle between 
the unit vector ri of the principal normal section. of the curve C and the unit vector N of the surface 
normal. The sign of p in (3.559) is positive, if N is on the concave side of the curve Cnorm and negative 
otherwise. 
3. Euler's Formula The curvature of a surface for every normal section Cnorm at the point P can 
be calculated by the Euler formula 


1 cosa ας sin? a (3.560) 
R R Hy | 
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where R; and R2 are the radii of principal curvatures (see (3.562a)) and a is the angle between the 
planes of the section C and C| (Fig. 3.249c). 


2. Radii of Principal Curvature 

The radii of principal curvatures of a surface are the radii with maximum and minimum values. They 

can be calculated by the principal normal sections Cı and C2 (Fig. 3.249c). The planes of C1? and C3 
. "T as di ; 

are perpendicular to each other, their directions are defined by the value B which can be calculated 


from the quadratic equation 


dy dy 


2 
d) ete -ro edad 


where the parameters p, q, r, s, t are defined in (3.562b). If the surface is given in the explicit form 
(3.545), Ry and fis are the roots of the quadratic equation 


(rt — sS?) R? - h[2pqs — (1+ p?)t-(1+@)r]R+h*=0 with (3.562a) 


Oz Oz Oz Oz Pz 
pere By’ Re: P Dem t= Oy? and h=V/1l+p?+@.  (3.562b) 


The signs of R, Ry and R3 can be determined by the same rule as in (3.559). 
If the surface is given in vector form (3.547), then instead of (3.561) and (3.562a) the corresponding 
equations hold 








[ipa = sti +) È H [s(1 + p°) — rpq] = 0, (3.561) 

















2 
(GM Εν) (αι) Fat EN): r (FL— EM) — 0, (3.5632) 


du qu 

(LN — M?)R? — (EN — 2FM -- GD)R 4 (EG — F?) — 0, (3.563b) 
with the coefficients L, M, N of the second quadratic fundamental form. They are given by the equal- 
ities 

E d E d! E d" 
L=fyR , M-r,K , N-rQE ' 
VEG — F? VEG — F? VEG — F? 

Here the vectors Fuu, ,,, and r,, are the second-order partial derivatives of the radius vector r with 
respect to the parameters u and v. In the numerators there are the determinants 

















(3.563c) 














Ou? Ou? ðu? OuOv Oudv OuOv Ov? Ov? Ov? 
ἀ-|9α Oy δὶ gal Oe Oy z| gw | Ox ὃν Oz! (3 5634) 
Ou Ou Ou Ou ðu Ou Ou Ou Ou 
Ov Ov Ov Ov Ov Ov ðv Ov Ov 
The expression 
Ldu? + 2Mdudu + Ndv* (3.563e) 


is called the second quadratic fundamental form. It contains the curvature properties of the surface. 
Lines of curvature are the curves on the surface which have the direction of the principal normal section 
at every point. Their equations can be got by integrating (3.561) or (3.563a). 

3. Classification of the Points of Surfaces 


1. Elliptic and Umbilical Points If at a point P the radii Ry and Ry of the principal curvatures 
have the same sign, then every point of the surface is on the same side of the tangent plane in a close 
neighborhood of this point, and P is called an elliptic point (Fig. 3.250a) on the surface. This fact 
can be expressed analytically by the relation 


LN — M? » 0. (3.5642) 
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a) 





Figure 3.250 


2. Circular or Umbilical Point This is the point P of the surface where the radii of the principal 
curvatures are equal: 

Ry = Ro. (3.564b) 
Then for the normal sections R = const. 
3. Hyperbolic Point In the case of different signs of the radii Ry and Rez of the principal curvatures 
the concave sides of the principal normal sections are in opposite directions. The tangent plane inter- 
sects the surface, so the surface has a saddle form in the neighborhood of P. P is called a hyperbolic 
point (Fig. 3.250b); the analytic mark of this point is the relation 


LN — M? « 0. (3.564c) 
4. Parabolic Point If one of the two radii of principal curvatures R; or Rə is equal to oo, then either 
one of the principal normal sections has an inflection point here, or it is a straight line. At P there is a 
parabolic point (Fig. 3.250c) of the surface with the analytic mark 

LN — M? « 0. (3.564d) 
E ^l] the points of an ellipsoid are elliptic, those of a hyperboloid of one sheet are hyperbolic, and those 
of a cylinder are parabolic. 
4. Curvature ofa Surface 
Two quantities are used mostly to characterize the curvature of a surface: 


1/1 1 
1. Mean Curvature of a surface at the point P: H = = (= + x) i (3.565a) 
2\Ri Re 
1 


z 
; 565k 
RRs (3.565b) 


2. Gauss Curvature of a surface at the point P: K = 





E A: For the circular cylinder with the radius a these are H = πα and K — 0. 
a 


E B: For elliptic points K > 0, for hyperbolic points kK < 0, and for parabolic points A = 0 hold. 
3. Calculation of H and K, ifthe surface is given in the form z = f(x,y): 


r(1 + q’) — 2pqs + t(1 + p°) s .  ri-s : 
arera ~ E K= (3.566b) 


pg 
For the meaning of p, q, r, s, t see (3.562b). 
4. Classification of the Surfaces According to their Curvature 


H = 


1. Minimal Surfaces are surfaces with a zero mean curvature H at every point, i.e., with Rı = — R2. 
2. Surfaces of Constant Curvature have a constant Gauss curvature K = const. 
W A: K > 0, for instance the sphere. 
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WB B: K - O, for instance the pseudosphere (Fig. 3.251), i.e., the 
surface obtained by rotating the tractrix around the symmetry axis 
(Fig. 2.79). 


3.6.3.5 Ruled Surfaces and Developable Surfaces 
1. Ruled Surface 


A surface is called a ruled surface if it can be represented by moving a 
line in space. 

2. Developable Surface 

If a ruled surface can be developed upon a plane, i.e., rolled out without Figure 3.251 
stretching or contracting any part ofit, it is called a developable surface. 

Not every ruled surface is developable. 

Developable surfaces have the following characteristic properties: 





a) For all points the Gauss curvature is equal to zero, and 

b) if the surface is given in the explicit form z = f(a, y) the conditions of developability are fulfilled: 
a) K=0,  b) rt—s?=0. (3.567) 

For the meaning of r, t, and s see (3.562b). 

W A: The cone (Fig. 3.192) and cylinder (Fig. 3.198) are developable surfaces. 

BW B: The hyperboloid of one sheet (Fig. 3.196) and hyperbolic paraboloid (Fig. 3.197) are ruled 

surfaces but they cannot be developed upon a plane. 

3.6.3.6 Geodesic Lines on a Surface 


1. Concept of Geodesic Line 
(see also 3.4.1.1, 3., p. 160). A theoretic curve of the surface can go through every point P(u, v) of the 


surface in every direction determined by the differential quotient τ and it is called a geodesic line. It 
has the same role on the surface as the straight line on the plane, and it has the following properties: 
1. Geodesic lines are the shortest curves between two points on the surface. 

2. If a material point is moving on a surface drawn by another material point on the same surface, and 
no other forces have influence on it, then it is moving along a geodesic line. 

3. If an elastic thread is stretched on a given surface, then it has the shape of a geodesic line. 

2. Definition 


A geodesic line on a surface is a curve such that its principal normal at every point has the same direction 
as the surface normal. 


W On a circular cylinder the geodesic lines are circular helices. 
3. Equation of the Geodesic Line 


If the surface is given in the explicit form z — f(x, y), the differential equation of the geodesic lines is 
a, mg dy V ΓΝ dy 

++p +g) = pt + (2ps — qt) + (pr — 2qs) qr. (3.568) 
da? da: da: da 


If the surface is given in parametric form (3.546), then the differential equation of the geodesic lines is 
fairly complicated. For the meaning of p, q, r, s, and t see (3.562b). 





4 Linear Algebra 


4.1 Matrices 
4.1.1 Notion of Matrix 
1. Matrices A of Size (m, n) or Briefly A(m,n) 


are systems of m times n elements, e.g., real or complex numbers, or functions, derivatives, vectors, 
arranged in m rows and n columns: 


jj 042 `t: Ain «+ Ist row 
Acus a21 AH B d «+  2ndrow (i 
amı ama το bx + m-th row 
qo Um T 
Ist 2nd n-th column. 


With the notion size of a matrix matrices are classified according to their number of rows m and number 
of columns n: A of size (m,n). A matrix is called a square matriz if the number of rows and columns is 
equal, otherwise it is a rectangular matrix. 

2. Realand Complex Matrices 

Real matrices have real elements, complex matrices have complex elements. If a matrix has complex 
elements 


Guy T ib, (4.2a) 
it can be decomposed into the form 
A+iB (4.2b) 


where A and B have real elements only (arithmetical operations see 4.1.4, p. 272). 
Ifa matrix A has complex elements, then its conjugate complex matrix A* has the elements 


ᾱμν = Re (Qu) — ilm (ayy). (4.2c) 
3. Transposed Matrices AT 


Changing the rows and columns of a matrix A of size (m,n) gives the transposed matrix AT. This has 
the size (n,m) and 


(ayy) = (aw) (4.3) 
is valid. 
4. Adjoint Matrices 
The adjoint matriz AF. of a complex matrix A is the transpose of its conjugate complex matrix A* 
(which can not be confused with the adjoint matrix A55, see 4.2.2, p. 278): 

AP — (A*)T, (4.4) 
5. Zero Matrix 


A matrix 0 is called a zero matrix if it has only zero elements: 


00---0 
00---0 

0-|.: i14 (4.5) 
00-0 
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4.1.2 Square Matrices 
1. Definition 


Square matrices have the same number of rows and columns, i.e., m = n: 
011 777 Gin 
A= Amys | ic i| (4.6) 
Ani 7 Gnn 
The elements a,, of a matrix A in the diagonal from the left upper corner to the right lower one are 
the elements of the main diagonal. They are denoted by a11, a2, . .. , nn, i-e., they are all the elements 
Quy with y =v. 
2. Diagonal Matrices 


A square matrix D is called a diagonal matrix if all of the non-diagonal elements are equal to zero: 


ay, 0 0 ayy ο 
0 ag- 0 a 

Gjy, —0 fo u£v: D-|. = zx = ; i : (4.7) 
0 0 pa ὃν Oo Ann 


3. Scalar Matrix 
A diagonal matrix S is called a scalar matriz if all the diagonal elements are the same real or complex 
number c: 


c0---0 
Ος:::0 

ο... (4.8) 
00--.c 


4. Trace or Spur of a Matrix 


For a square matrix, the trace or spur of the matrix is defined as the sum of the main diagonal elements: 


Tr (A) = an 9 a22 9... + Ann = Ἂς Opp- (4.9) 
μ-ι 


5. Symmetric Matrices 
A square matrix A is symmetric if it is equal to its own transpose: 


A — AT, (4.10) 
For the elements lying in symmetric positions with respect to the main diagonal 

Gju = yy (4.11) 
is valid. 


6. Normal Matrices 
satisfy the equality 
AFA = AAF. (4.12) 
(For the product of matrices see 4.1.4, p. 272.) 
7. Antisymmetric or Skew-Symmetric Matrices 
are the square matrices A with the property: 


A — —AT, (4.13a) 
For the elements a,, of an antisymmetric matrix the equalities 
ᾱμν — --ἂνμ, App =Q (4.135) 


are valid, so the trace of an antisymmetric matrix vanishes: 


Tr (A) — 0. (4.13c) 
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The elements lying in symmetric positions with respect to the main diagonal differ from each other only 
in sign. 

Every square matrix A can be decomposed into the sum of asymmetric matrix A, and an antisymmetric 
matrix Ags: 


A=A,+A,, with A = PA + A™); Aas = L(a — AT). (4.13d) 


8. Hermitian Matrices or Self- Adjoint Matrices 
are square matrices A equal to their own adjoints : 

A = (A*)T — AB, (4.14) 
Over the real numbers the concepts of symmetric and Hermitian matrices are the same. The determi- 
nant of a Hermitian matrix is real. 
9. Anti-Hermitian or Skew-Hermitian Matrices 
are the square matrices equal to their negative adjoints: 


A — —(A*)T — — AE, (4.15a) 
For the elements a,, and the trace of an anti-Hermitian matrix the equalities 
Aw = —AQy, Gp = 0; Tr(A) =0 (4.15b) 


are valid. Every square matrix A can be decomposed into a sum of a Hermitian matrix Ay and an 
anti-Hermitian matrix Aan: 
A=A,+A,n with A, = 


(A +A"), Am = Z(A — AB). (4.156) 


NI = 
NI = 


10. Identity Matrix I 
is a diagonal matrix such that every diagonal element is equal to 1 and all of the non-diagonal elements 
are equal to zero: 


iosi 
01-..0 : 0 for V, 

[νετ τσ) νι ταν (4.16) 
Diss 


The symbol dy, is called the Kronecker symbol. 
11. Triangular Matrix 


1. Upper Triangular Matrix, U, is a square matrix such that all the elements under the main 
diagonal are equal to zero: 


R= (rw) with rj, for p>v. (4.17) 


2. Lower Triangular Matrix, L, is a square matrix such that all the elements above the main 
diagonal are equal to zero: 
L-(l,) with l,,—0 for μεν. (4.18) 


4.1.3 Vectors 


Matrices of size (n, 1) are one-column matrices or column vectors of dimension n. Matrices of size (1, n) 
are one-row matrices or row vectors of dimension n: 


ay 
α 
Column Vector: a — E , (4.19a) Row Vector: aT — (a4,a5,...,a4). (4.19b) 


απ 


By transposing, a column vector is changed into a row vector and conversely. A row or column vector 
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of dimension n can determine a point in the n dimensional Euclidean space IR". 
'The zero vector is denoted by 0 or 0 respectively. 


4.1.4 Arithmetical Operations with Matrices 


1. Equality of Matrices 
Two matrices A = (a,,) and B = (b,,) are equal if they have the same size and the corresponding 
elements are equal: 

A=B, when a,,—b,, for p=1,...,m;v=1,...,n. (4.20) 
2. Addition and Subtraction 
Matrices can be added or subtracted only if they have the same size. The sum/difference of two matrices 
is done by adding/subtracting the corresponding elements: 

AB = (aw) = (bw) = (Qu E bw). (4.214) 


i «5:7 3-50 4-27 
i (aia) tG 1250 qu). 
For the addition of matrices the commutative law and the associative law are valid: 


a) Commutative Law: A+B=B+A. (4.21b) 
b) Associative Law: (A+ B)+C=A+(B+C). (4.21c) 














3. Multiplication of a Matrix by a Number 
A matrix A of size (m, n) is multiplied by a real or complex number a by multiplying every element of 
A by a: 

aA = a (ayy) = (Ady). (4.22a) 


1 37 3 921 
" 3(ο -1 i) = Lo —3 Ti 
From (4.22a) it is obvious that one can factor out a constant multiplier contained by every element of 


a matrix. For the multiplication of a matrix by a scalar the commutative, associative and distributive 
laws for multiplication are valid: 








a) Commutative Law: aA = Aa; (4.22b) 
b) Associative Law: a(f8A) = (a3)A; (4.22c) 
c) Distributive Law: (at B)A=aA+ 6A; a(A+B)=aA+aB. (4.22d) 














4. Division of a Matrix by a Number 

The division of a matrix by a scalar y Z 0 is the same as multiplication by a = 1/7. 

5. Multiplication of Two Matrices 

1. The Product AB of two matrices A and B can be calculated only if the number of columns of 
the factor A on the left-hand side is equal to the number of rows of the factor B on the right-hand side. 
If A is a matrix of size (m, n), then the matrix B must have size (n, p), and the product A B is a matrix 
C — (cy) of size (m, p). The element c, is equal to the scalar product of the j-th row of the factor A 
on the left with the A-th column of the factor B on the right: 

















AB = (X ambn) = (cpa) ed (A2 1,2,...)m; A—1,2,...,2). (4.23) 
v=1 
T 3 T 3 9 
WA-—-|2-141/||,B-^| -—5/|1|]|. Theelement co» ofthe product matrix C in accordance with 
=7 0 1 0 13 











(4.23) is cg =2-2—-1-144-3=15. 
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2. Inequality of Matrix Products Even if both products A B and B A exist, usually AB ABA, 
i.e., in general the commutative law for multiplication is not valid. If the equality A B — B A holds, 
then one says that the matrices A and B are commutable or commute with each other. 

3. Falk Scheme Multiplication of matrices A B — C can be performed using the Falk scheme (Fig. 
4.1). The element c, of the product matrix C appears exactly at the intersection point of the p-th 
row of A with the A-th column of B. 

WB Multiplication of the matrices A(s,3; and B(,5) is shown in Fig. 4.2 using the Falk scheme. 
































Ρ 3 |2 
5 |1|B 
n” | B 0 [3 
πο 137 | -12 26 
m A |AB A 2434 11 [15] AB 
101/23 1 
Figure 4.1 Figure 4.2 


4. Multiplication of the Matrices K; and K, with Complex Elements For multiplication of 
two matrices with complex elements can be used their decompositions into real and imaginary parts 
according to (4.2b): K, = Ay +iB,, Ky = As + iB2. Here A1, A2, B1, B2 are real matrices. After 
this decomposition, the multiplication results in a sum of matrices whose terms are products of real 
matrices. 

E (A+ iB)(A — iB) = A? + B? + i(BA — AB) (Powers of Matrices see 4.1.5,8., p. 276). Of course 
when multiplying these matrices it must be considered that the commutative law for multiplication is 
not valid in general, i.e., the matrices A and B do not usually commute with each other. 





6. Scalar and Dyadic Product of Two Vectors 

If the vectors a and b are considered as one-row and one-column matrices, respectively, then there are 
two possibilities to multiply them according to the rules of matrix multiplication: 

If a has size (1, 1) and b has size (n, 1) then their product has size (1, 1), i.e. it is a number. It is called 
the scalar product of two vectors. If conversely, a has size (n, 1) and b has size (1, m), then the product 
has size (n, m), i.e., it is a matrix. This matrix is called the dyadic product of the two vectors. 


1. Scalar Product of Two Vectors "The scalar product of a row vector aT — (a4, a5,..., a4) with 
a column vector b = (by, be,..., bn)? — both having n elements — is defined as the number 
n 
alb — bla — abı + azb + -+ © + anbn = 5 ab. (4.24) 
μ-1 


The commutative law for multiplication is not valid for a product of vectors in general, so one must 
keep the exact order of a and b. If the order of multiplication is reversed, then the product ba? is a 
dyadic product. 

2. Dyadic Product or Tensor Product of Two Vectors The dyadic product of a column vector 
a — (a1, 02, ..., Q4)? of dimension n with a row vector bT = (b1, b2, . . . , bm) of dimension m is defined 
as the following matrix: 


αιδι 10» sons αιδῦπι 
azbı azb +++ Agbm 

abr = ae : 2 * (4.25) 
ath PEN ues ane 


of size (n,m). Also here the commutative law for multiplication is not valid in general. 


3. Hints on the Notion of Vector Products of Two Vectors In the domain of multi-vectors 
or alternating tensors there is a so-called outer product whose three-dimensional version is the well- 
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known vector product or cross product (see 3.5.1.5, 2., p. 184 ff). In this book the outer product of 
multi-vectors of higher rank is not discussed. 


7. Rank ofa Matrix 


1. Definition In a matrix A the maximum number r of linearly independent column vectors is equal 
to the maximum number of linearly independent row vectors. This number r is called the rank of the 
matriz and it is denoted by rank (A) = r. 

2. Statements about the Rank of a Matrix 

a) Because in a vector space of dimension m there exist no more than m linearly independent m- 
dimensional row or column vectors (see 5.3.8.2, p. 366), the rank r of a matrix A of size (m, n) cannot 
be greater, than the smaller of m and n: 


rank (A(5,)) — r € min (m, n). (4-26) 
b) A square matrix A, is called a regular matriz if 
rank (Anm) =r =n. (4.26b) 


A square matrix of size (n, n) is regular if and only if its determinant differs from zero, i.e., det A Æ 0 
(see 4.2.2, 3., p. 279). Otherwise it is a singular matriz. 
c) Consequently for the rank of a singular square matrix A(p,n), ie., det A = 0 

rank(A(5)) —r «n (4.26c) 
is valid. 
d) The rank of the zero matrix 0 is equal to zero: 


rank (0) = r = 0. (4.26d) 
e) The rank of the sum and product of matrices satisfies the relations 

[rank(A) — rank(B)| € rank(A + B) € rank(A) 4 rank(B), (4.260) 

rank(AB) X min(rank(A), rank(B)). (4.26f) 


3. Rules to Determine the Rank Elementary transformations do not change the rank of matrices. 
Elementary transformations in this relation are: 


a) Interchanging two columns or two rows. 
b) Multiplication of a row or column by a number. 
c) Addition of a row to another row or a column to an other column. 


In order to determine their ranks every matrix can be transformed by appropriate linear combinations 
of rows into a form such that in the j-th row (w = 2,3,..., m), at least the first j|. — 1 elements are equal 
to zero (the principle of Gauss algorithm, see 4.5.2.4, p. 312). The number of row vectors different from 
the zero vector in the transformed matrix is equal to the rank r of the matrix. 


8. Inverse Matrix 
For a regular matrix A — (a,,) there is always an inverse matriz A! (with respect to multiplication), 
i.e., the multiplication of a matrix by its inverse yields the identity matrix: 


AA C-AA-I (4.2772) 
The elements of A^* — (5,,) are 

c Avy 4 

Bu = Bx (4.27b) 


where A,,, is the cofactor belonging to the a,, element of the matrix A (see 4.2.1, 1., p. 278). For a 
practical calculation of A^! the method given in 4.2.2, 2., p. 278 should be used. In the case of a matrix 
of size (2, 2) holds: 


fab -1_ 1 d —b : 
a= (ca) a -uox C 2) (228) 
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Remark: Why not define division among matrices but instead use the inverse for calculations? This 
is connected to the fact that division cannot be defined uniquely. The solutions of the equations 

BX,=A (B isi} Xı = BA 

regular), 

XB = A . Xz = AB- 
are in general different. 
9. Orthogonal Matrices 
If the relation 

ΑἹ ΑΓ ο AAT=ATA=I (4.30) 
holds for a square matrix A, then it is called an orthogonal matrix, i.e., the scalar product of a row and 
the transpose of another one, or the scalar product of the transpose of a column and another one are 
zero, while the scalar product of a row with its own transpose or of the transpose of a column with itself 
are equal to one. 
Orthogonal matrices have the following properties: 


(4.29) 


a) The transpose and the inverse of an orthogonal matrix A are also orthogonal; furthermore, the 
determinant is 


det A = +1. (4.31) 
b) Products of orthogonal matrices are also orthogonal. 





E The rotation matrix D, which is used to describe the rotation of a coordinate system, and whose 
elements are the direction cosines of the new direction of axes (see 3.5.3.3,2. p. 212), is also an orthogonal 
matrix. 


10. Unitary Matrix 


If for a matrix A with complex elements 
(A*)T AC Or A(A*)T 2 (A*)TA —I (4.32) 


holds it is called a unitary matriz. In the real case unitary and orthogonal matrices are the same. 


4.1.5 Rules of Calculation for Matrices 

The following rules are valid of course only in the case when the operations can be performed, for in- 
stance the identity matrix I always has a size corresponding to the requirements of the given operation. 
1. Multiplication of a Matrix by the Identity Matrix 

is also called the identical transformation: 

AI=IA=A. (4.33) 
(This does not mean that the commutative law is valid in general, because the sizes of the matrix I on 
the left- and on the right-hand side may be different.) 

2. Multiplication of a Square Matrix A by a Scalar Matrix S 
or by the identity matrix I is commutative 


AS=SA=cA with S givenin (4.8), (4.340) AI=IA=A. (4.34b) 
3. Multiplication of a Matrix A by the Zero Matrix 0 


results in the zero matrix: 

A0—0, 0A — O0. (4.35) 
(The zero matrices above may have different sizes.) The converse statement is not true in general, i.e., 
from AB = O0 it does not follow that A — 0 or B — 0. 
4. Vanishing Product of Two Matrices 


The product of two matrices A and B can be the zero matrix even if neither of 
them is a zero matrix: 


AB=0 or BA=0 or both, although A Z0, BZz O. (4.36) 


11 
0 0 
BENE RN 
0 1/0 0 
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5. Multiplication of Three Matrices 





(A B)C — A (BC) (4.37 
i.e., the associative law of multiplication is valid. 
6. Transposing of a Sum or a Product of Two Matrices 
(A -- B)T — AT 4 BT, (AB)T — B7 AT, (AT)T — A. (4.38a 
For square invertible matrices A (nn): 
(AT)! — (A71)T (4.38b 
holds. 
7. Inverse of a Product of Two Matrices 
(AB) !-B^1A-^, (4.39 
8. Powers of Matrices 
AP— AA...A with p » 0, integer, (4.40a 
gri re. 
p factors 
A9?—I  (detA Z0), (4.40b 
AC —(AC? (p>0, integer; det A ¥ 0), (4.40c 
AP*— APA (p,q integer). (4.40d 


9. Kronecker Product 
The Kronecker product of two matrices A = (apn) of the type (m, n) and B = (bpv) of the type (p, r) is 
defined as the rule 

A 8 B= (aw B). (4.41) 
The result is a new matrix of type (m - p, n- r), arising from the multiplication of every element of A 
by the matrix B. 


3-50 13 
πλ-(} 1 3) of type (2,3), B = (5.7, of type (2,2). 


»( 2) GÀ)» y 3 9 —-5 -150 0 
2-1 2-1 2-1 6-3 -10 5 0 0 
ΑΦΒ-], γι 3 1.(2 3 3.(1i 31] 1260 1 3 39] 
πλ τα 2-1 2-1 4=2 2 =i 6-=3 
gives a matrix of type (4,6) . 
For the transpose and the trace are valid the equalities: 
(A & B)T — AT & BT, (4.42) 
Tr(A ® B) = Tr(A)-Tr(B). (4.43) 


10. Differentiation of a Matrix 
Ifa matrix A = A(t) = (a,,(t)) has differentiable elements a,,,(¢) of a parameter t then its derivative 
with respect to f is given as 


dA — (249) — (a^, (0). (4.44) 





dt dt μν 


4.1.6 Vector and Matrix Norms 


The norm of a vector or of a matrix can be considered as a generalization of the absolute value (mag- 
nitude) of numbers. Therefore a real number is assigned as ||v|| (Norm x) to the vector x or as ||A || 
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(Norm A) to a matrix A. These numbers must satisfy the norm axioms (see 12.3.1.1, p. 669). For 
vectors x € R^ they are: 


1. ||x|| > 0 for every x; ||xj|| 2 0 if and only if x = 0. (4.45) 


2. ||Ax|| = |A] ||x|| for every x and every real number A. (4.46) 


3. ||x+ yl] < ||xI| + |ly|| for every x and y (triangle inequality) (see also 3.5.1.1, 1., p. 182). (4.47) 
There are many different ways to define norms for vectors and matrices. But for practical reasons it is 
better to define a matrix norm ||A|| and a vector norm ||x|| so that they might satisfy the inequality 
Ακ «ΙΑ [Ix]. (4.48) 
This inequality is very useful for error estimations. If the matrix and vector norms satisfy this inequality, 
then one says that they are consistent with each other. If there is a non-zero vector x for every A such 
that the equality holds in (4.48), then one says the matriz norm || A]|| is the subordinate to the vector 
norm ||x| 


4.1.6.1 Vector Norms 


If x — (21, 22,..., 2,)T is a real vector of n dimensions, i.e., x € IR^ , then the most often used vector 
norms are: 





1. Euclidean Norm 


[κ||-- [[κ||2:-- |»; αἵ. (4.49) 
i=l 


2. Supremum or Uniform Norm 
[|x|] = |[X|]oo == max|-r;). (4.50) 
1<i<n 


3. Sum Norm 


Ix] — [xls :— 7 Exil. (4.51) 
i=1 


Bl In Γι), in elementary vector calculus ||x|| is considered as the magnitude of the vector x. The 
magnitude |x| = ||x||2 gives the length of the vector x. 


4.1.6.2. Matrix Norms 
1. Spectral Norm for Real Matrices 


Al] = |[Al]2 :— y Ama (ATA). (4.52) 


Here Ana (AT A) denotes the greatest eigenvalue (see 4.6.1, p. 314) of the matrix ATA. 
2. Row-Sum Norm 


ΑΙ -- ΙΑ] := maz iil. 4.53 
IAI =[1Albo = max X5 fas (4.53) 


3. Column-Sum Norm 


[Al] = ||Alla = m Σ |a;;]. (4.54) 
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It can be proved that the matrix norm (4.52) is the subordinate norm to the vector norm (4.49). The 
same is true for (4.53) and (4.50), and for (4.54) and (4.51). 


4.2 Determinants 
4.2.1 Definitions 


1. Determinants 
Determinants are real or complex numbers uniquely associated with square matrices. The determinant 
of order n associated with the (n, n) matrix A = (aw), 


Q11 412 °** Gin 
a21 022 *** Q2 

D = det A = det (ap) =| >.. i (4.55) 
n1 Qn2 *** Ann 


is calculated in a recursive way using the Laplace expansion rule: 


det A = Y aj, Ay, (ys fixed, expansion along the j-th row), (4.56a) 
ν-1 

det A= SO ayy Ay (v fixed, expansion along the v-th column). (4.56b) 
μι 


Here A,, is the subdeterminant belonging to the element a, multiplied by the sign factor (—1)^*". 

Ayr is called the cofactor or algebraic complement. 

2. Subdeterminants 

The subdeterminant of order (n — 1) belonging to the element a,,, of a determinant of order n is the 

determinant obtained by deleting the j-th row and the v-th column. 

W Expansion of a determinant of order four along the third row: 
Q41 012 013 014 
a21 Q22 Q23 Q24 
Q31 032 033 (134 
Q41 Q42 Q43 Q44 


Q11 012 013 
021 022 023 
Q41 G42 G43 


Q11 Q12 Q14 
021 022 024 
Q41 42 G44 


Q11 413 Q14 
21 023 424 
Q41 Q43 A44 


012 013 G14 
022 023 024 
442 443 G44 


= a31 — a32 + 433 — d34 . 


























4.2.2 Rules of Calculation for Determinants 

Because of the Laplace expansion the following statements about rows are valid also for columns. 

1. Independence of the Value of a Determinant 

The value of a determinant does not depend on which row was chosen. 

2. Substitution of Cofactors 

If during the expansion of a determinant the cofactors of a row are replaced by the cofactors of another 
one, then one gets zero: 


MawAy -0 (u, A fixed; À Z u). (4.57) 


v=1 
This relation and the Laplace expansion result in 

Aaa A = A Aaa = (det A)T. (4.58) 
The adjoint matrix of A, which is the transpose of the matrix made from the cofactors of A, is denoted 
by Aaa. There must not be a confusion of this adjoint matrix with the transposed conjugate of a 
complex matrix AP (see (4.4), p. 269). From the previous equality one gets the inverse matrix 


Ad- 


1 
= Taa A , (4.59) 
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3. Zero Value of a Determinant 
A determinant is equal to zero if 
a) a row contains zero elements only, or 
b) two rows are equal to each other, or 
C) a row is a linear combination of the others. 
4. Changes and Additions 
The value of the determinant does not change if 
a) its rows are exchanged for its columns, i.e., reflection in the main diagonal does not affect the value 
of it: 
det A — det AT, (4.60) 
b) any row is added to or subtracted from another one, or 
c) a multiple of any row is added to or subtracted from another one, or 
d) a linear combination of other rows is added to any row. 
5. Sign on Changing Rows 
If two rows are interchanged in a determinant, then the sign of the determinant changes. 
6. Multiplication of a Determinant by a Number 
The value of a determinant will be multiplied by a if the elements of a row are multiplied by this number. 


The next formula shows the difference between this and the multiplication of a matrix A of size (τε, τι) 
by a number a 


det (aA) = a"det A. (4.61) 


7. Multiplication of Two Determinants 
The multiplication of two determinants can be reduced to the multiplication of their matrices: 


(det A)(det B) = det (AB). (4.62) 
Since det A — det AT (see (4.60)), we have the equalities 
(det A)(det B) 7 det (AB) — det (ABT) — det (ATB) — det (ATBT), (4.63) 


i.e., it is permissible to take the scalar product of rows with columns, rows with rows, columns with 
rows or columns with columns. 

8. Differentiation of a Determinant 

Suppose the elements of a determinant of order n are differentiable functions of a parameter t, i.e., 
Quy = p(t) . In order to differentiate the determinant with respect to t, one differentiates one row at 
one time and finally one adds the n determinants. 


W For a determinant of size (3, 3) follows: 























P 1 / 
Q11 012 013 G1; 019 013 011 012 13 Q11 0312 413 
/ 1 , 
dt a21 022 Q23 | = | 021 022 423 | T | 091 055 053 | + | Q21 022 023 | . 
$ / / 
031 032 033 031 032 033 a31 432 Q33 Q31 035 033 


4.2.3 Evaluation of Determinants 


1. Value ofa Determinant of Second Order 


411 012 
a21 022 








= 011022 — 021012. (4.64) 


2. Value of a Determinant of Third Order 
The Sarrus rule gives a convenient scheme for the calculations, but it is valid only for determinants of 
order three. It is the following: 
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a21 daa. deal Ari. 029 — 011022033 c 012023031 + 013021032 
a31 Q32 @33| a31 32 





a Soa te ai ae 


— (3122043 d- 4320423411 + 433421412). (4.65) 
The first two columns are copied after the determinant, then the sum of the products of the elements 


along the redrawn declining segments is calculated, then the sum of the products of the elements along 
the dotted inclining segments is subtracted. 


3. Value ofa Determinant of n-th Order 

By the expansion rule the calculation of the value of a determinant of order n is reduced to the evalu- 
ation of n determinants of order (n — 1). But for practical reasons (to reduce the number of required 
operations), first one transforms the determinant with the help of the rules discussed above into a form 
such that it contains as many zeros as possible. 























2994 |2 59 4 2 53 4 
m 2-312 8 2-75 s ..2-74 8|-&| sr-s|-elai.s 
4 83-—5| 4 03-—5| "|4 01-5 “hie a ia d 
126 4| |1 06 4 102 4 Lael 
(rule 4) (rule 6) = 0 (rule 3) 
11 0 - 
—-2141-—5 --2(51 -|i i) = 147. 
12 4 
(rule 4) 


Remark: An especially efficient method to determine the value of a determinant of order n can be 
obtained by transforming it in the same way as it is done in order to determine the rank of a matrix 
(see 4.1.4, 7., p. 274), i.e., all the elements under the diagonal a11, a22, - . . , Ann are equal to zero. Then 
the value of the determinant is the product of the diagonal elements of the transformed determinant. 


4.3 Tensors 


4.3.1 Transformation of Coordinate Systems 


1. Linear Transformation 
By the linear transformation 


11 — Q1124 + 01222 4- 01333 
— Ax Or 19 — Aq Xz 4- 02312 + A2313 (4.66) 
74 = 3111 + 03232 + 03333 


[αι 


a coordinate transformation is defined in the three-dimensional space. Here x, and %,, (4s = 1, 2,3) are 
the coordinates of the same point but in different coordinate systems K and K . 

2. Einstein's Summation Convention 

Instead of (4.66) one can write 


3 
T= M arx, — (n—1,2,3) (4.672) 
ν-1 


or briefly by Einstein 

Ly = Guy Ly, (4.67b) 
i.e., it is to calculate the sum with respect to the repeated index v and put down the result for j, — 1, 2,3. 
In general, the summation convention means that if an index appears twice in an expression, then the 
expression is added for all values of this index. If an index appears only once in the expressions of an 
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equation, for instance jz in (4.67b), then it means that the equality is valid for all possible values of this 
index. 


3. Rotation of a Coordinate System 
If the Cartesian coordinate system K is given by rotation of the system K, then for the transformation 
matrix in (4.66) A = D is valid. Here D = (d,,,) is the orthogonal rotation matrix. The orthogona 
rotation matrix D has the property 

D” =D". (4.68a 
The elements d,,, of D are the direction cosines of the angles between the old and new coordinate axes. 
From the orthogonality of D, i.e., from 

DD'=I and  DTD-I, (4.68b 
it follows that 


3 3 
Y dada = Suv, Y drudiv: = ὃμν (p, v — 1,2,3). (4.68c 
i-l k-l 
The equalities in (4.68c) show that the row and column vectors of the matrix D are orthonormalized, 
because ὄμν is the Kronecker symbol (see 4.1.2, 10., p. 271). 
The elements d, of the rotation matrix can be determined by the Cardan angles (see 3.5.3.5, p. 214 
or Euler angles (see 3.5.3.6, p. 215). For rotation in the plane see 3.5.2.2,2., p. 191; in space see 3.5.3.3, 
p. 213. 





4.3.2 Tensors in Cartesian Coordinates 


1. Definition 

A mathematical or a physical quantity T can be described in a Cartesian coordinate system K by 3” 
elements tjj..,5,, the so-called translation invariants. Here the number of indices i, j, . . . , m is exactly 
equal to n (n 2 0). The indices are ordered, and every of them takes the values 1,2 and 3. 


If under a coordinate transformation from K to K for the elements tij..m according to (4.66) 


a3 3 
linens Dr ee 5 Gjat * * * Gpmtij--m » (4.69) 


i-1j-1 πιεῖ 


is valid, then T' is called a tensor of rank n, and the elements tj;...m (mostly numbers) with ordered 
indices are the components of the tensor T. 


2. 'Tensor of Rank 0 


A tensor of rank zero has only one component, i.e., it is a scalar. Because its value is the same in every 
coordinate system, one talks about the invariance of scalars or about an invariant scalar. 


3. Tensor of Rank 1 
A tensor of rank 1 has three components t4, t9 and ta. The transformation law (4.69) is now 


3 
μπι αμ (uw = 1,2,3). (4.70) 
i=1 


It is the transformation law for vectors, i.e., a vector is a tensor of rank 1. 
4. Tensor of Rank 2 


If n — 2, then the tensor T has nine components t;;, which can be arranged in a matrix 


ti tia tis 
T =T= {91 t22 tog š (4.71a) 


t31 032 t33 
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The transformation law (4.70) is now: 


3 3 
tu = ΣΣ Quidyjlij (p, v — 1,2,3). (4.71b) 
i=l j=l 


So, a tensor of rank 2 can be represented as a matrix. 
E A: The moment of inertia O, of a solid with respect to the line g, which goes through the origin 
and has direction vector 4 = a’, can be represented in the form 


Oz =O Ozz 
Θ,-- α.Θα (4.722) with O —-(05)-|-O€, y -Oy |, (4.72b) 
—O;. -Oy. O, 
the so-called inertia tensor. Here Oz, Oy and ©, are the moments of inertia with respect to the coordi- 
nate axes, and Oy, Oz, and Oy, are the deviation moments with respect to the coordinate axes. 
E B: The load-up conditions of an elastically deformed body can be given by the tension tensor 


011 012 913 
g — 021 022 093 š (4.73) 
031 032 033 

The elements oj, (i, k = 1,2,3) are determined in the following way: At a point P of the elastic body 
a small plane surface element is chosen whose normal vector points to the direction of the x-axis of 
a right-angle Cartesian coordinate system. The power per surface unit on this element, depending on 
the material, is a vector with coordinates 041, 012 and 013. The other components can be explained 
similarly. 
5. Rules of Calculation 
1. Elementary Algebraic Operations The multiplication of a tensor by a number, and addition 
and subtraction of tensors of the same rank are defined componentwise, similarly to the corresponding 
operations for vectors and matrices. 
2. Tensor Product Suppose there are given a tensor A of rank m and a tensor B. of rank n with 
components aj;... and b,,... respectively. Then the 3"**" scalars 

Cij--rs = αι Ups... (4.74a) 
give the components of a tensor C of rank m+n. It is denoted by C = AB and it is called the tensor 
product of A and B. The associative and distributive laws are valid: 

(AB)C=A(BC), A(B+C)=AB+AC. (4.74b) 


3. Dyadic Product The product of two tensors of rank 1A = (aj, a@2,a3) and B = (by, ba, bs) 
gives a tensor of rank 2 with the elements 


Gj =b; — (5j—1,2,3), (4.752) 
i.e., the tensor product results in the matrix 
αιὖι aiba αιὖα 
αοδι abe agb3 3 (4.75b) 
ag3by, azb a3b3 


This will be denoted as the dyadic product of the two vectors A and B. 


4. Contraction Setting two indices equal to each other in a tensor of rank m (m > 2), and summing 
with respect to them, then one gets a tensor of rank m — 2, which is called the contraction of the tensor. 
W The tensor C of rank 2 of (4.75a) with c;; — ajb; , which is the tensor product of the vectors A = 
(a1, a2, 43) and B — (by, bg, bs), can be contracted by the indices i and 7, 


aibi = a,b, + agbe + a3b3 (4.76) 
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giving a scalar, which is a tensor of rank 0. This gives the scalar product of vectors A and B. 


4.3.3 Tensors with Special Properties 
4.3.3.1 Tensors of Rank 2 


1. Rules of Calculation 


For tensors of rank 2 the same rules are valid as for matrices. In particular, every tensor T' can be 
decomposed into the sum of a symmetric and a skew-symmetric tensor: 


T= (+77) + : (P=2"). (4.77a) 
A tensor T' — (tij) is called symmetric if 

lia for all i and j (4.77b) 
holds. In the case 

lij 2 —tj for all ? and j (4.77c) 


it is called skew- or antisymmetric. Obviously the elements t1 , 133 and t33 of a skew-symmetric tensor 
are equal to zero. The notion of symmetry and antisymmetry can be extended for tensors of higher 
rank if refering to certain pairs of elements. 

2. Transformation of Principal Axes 

For a symmetric tensor T, i.e., if tu = ty, holds, there is always an orthogonal transformation D such 
that after the transformation the tensor has a diagonal form: 


- iu 0 0 
T =| 0 i» 0 |. (4.78a) 
0 0 t33 


The elements f;; , t22 and £33 are called the eigenvalues of the tensor T. They are equal to the roots A1, 
Àa and As of the algebraic equation of third degree in A: 
Πλ ti tis 

91 99 -- λ tog =U, (4.78b) 

t31 t32 t33 — A 
The column vectors d; , dy and dg of the transformation matrix D are called the eigenvectors corre- 
sponding to the eigenvalues, and they satisfy the equations 

Td, — Xd, (v = 1,2,3). (4.78c) 

Their directions are called the directions of the principal axes, and the transformation T to diagonal 
form is called the transformation of the principal axes. 


4.3.3.2 Invariant Tensors 
1. Definition 


A Cartesian tensor is called invariant if its components are the same in all Cartesian coordinate sys- 
tems. Physical quantities such as scalars and vectors, which are special tensors, do not depend on the 
coordinate system in which they are determined; they must not change their value either under trans- 
lation of the origin or rotation of a coordinate system K. One talks about translation invariance and 
about rotation invariance or in general about transformation invariance. 


2. Generalized Kronecker Delta or Delta Tensor 
If the elements t;; of a tensor of rank 2 are the Kronecker symbols, i.e., 
1 fori-—j, 


tig = Oy = { 0 otherwise, (4.792) 
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then from the transformation law (4.71b) in the case of a rotation of the coordinate system considering 
(4.68c) follows 


tay = ἁμιάν] Ξ- ὃμν, (4.79b) 
i.e., the elements are rotation invariant. Putting them into a coordinate system so that they are inde- 
pendent of the choice of the origin, i.e., they will be translation invariant, then the numbers 5;; form a 
tensor of rank 2, the so-called generalized Kronecker delta or delta tensor. 
3. Alternating Tensor 
If &j, €; and €j are unit vectors in the directions of the axes of a right-angle coordinate system, then for 
the mixed product (see 3.5.1.6, 2., p. 185) holds 
l, ifüj,k cyclic (right-hand rule), 
tk — &i (€; x &) — 4 —1l, ifi, j,k anticyclic, (4.80a) 
0, otherwise. 
Altogether there are 3? = 27 elements, which are the elements of a tensor of rank 3. In the case of a 
rotation of the coordinate system from the transformation law (4.69) it follows that 
dy dvi dor 
dio (να do 
ἆμα ΠΡ (ρα 


= wp: (4.80b) 


tuvp = dyidyjdpkeijk = 








i.e., the elements are rotation invariant. Putting them into a coordinate system so that they are in- 
dependent of the choice of the origin, i.e., they are translation invariant, then the numbers cjj; form a 
tensor of rank 3, the so-called alternating tensor. 

4. Tensor Invariants 

There must not be a confusion between tensor invariants and invariant tensors. Tensor invariants are 
functions of the components of tensors whose forms and values do not change during the rotation of 
the coordinate system. 


ΒΑ: If for instance the tensor T = (t;;) is transformed in T = (y) by a rotation, then the trace 
(spur) of it does not change: 

Tr(T) — ti + t22 + tas = tr + tro + - (4.81) 
The trace of the tensor T is equal to the sum of the eigenvalues (see 4.1.2, 4., p. 270). 
E B: For the determinant of the tensor T = (t;;) 


ti tia tis iu lo ha 
t21 022 (23 | — | t21 t22 t23 (4.82) 
t31 t32 t33 t31 t32 t33 








is valid. The determinant of the tensor is equal to the product of the eigenvalues. 


4.3.4 Tensors in Curvilinear Coordinate Systems 

4.3.4.1 Covariant and Contravariant Basis Vectors 

1. Covariant Basis 

By the help of the variable position vector are introduced the general curvilinear coordinates u, v, w: 
T= T (u,v, w) = z(u,v, w)&; -- y(u, v, w)&, 4- z(u, v, w)e;. (4.83a) 

The coordinate surfaces corresponding to this system can be got by fixing the independent variables 

u,v, w in F(u, v, w), one at a time. There are three coordinate surfaces passing through every point of 

the considered region of space, and any two of them intersect each other in a coordinate line, and of 

course these curves pass through the considered point, too. The three vectors 


Or Or Or 


ο... (4.83b) 
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point along the directions of the coordinate lines in the considered point. They form the covariant basis 
of the curvilinear coordinate system. 


2. Contravariant Basis 


The three vectors 


1 ar Or 1. ὃτ η or 1 Ir or (4.844) 
DiOv dw)’ D\dw” du}? D\du~ dv - 


with the functional determinant (Jacobian determinant see 2.18.2.6,3., p. 123) 





D(x, y,z Vy Vy Tw 
D= Demne) Yu Yo Yo (4.84b) 
D(u, v, w) 2. £z 








are always perpendicular to the coordinate surfaces at the considered surface element and they form 
the so-called contravariant basis of the curvilinear coordinate system. 


Remark: In the case of orthogonal curvilinear coordinates, i.e., if 














ty tre ty: ái s E dA ο. - a 
D(x, y, z 11 t12 t13 tu t12 t3 | gF gr Or 8r or Or 
P = | t21 t22 t23 | = | t21 t22 t23 Bu de 0, OI E 0, ms mo 0, (4.85) 
ae t31 t32 t33 tz1 t32 t33 








then the directions of the covariant and contravariant basis are coincident. 


4.3.4.2 Covariant and Contravariant Coordinates of Tensors of Rank 1 


In order to be able to apply the summation convention of Einstein the following notation is introduced 
for the covariant and contravariant basis: 











Or ., 987 Or οπή 
δα gi v 92; D g3 anı NT 
1 oF OF -. 1 Br ΠΡ ag d or | Or ud (4.86) 
D\dv Ow I> Daw * du 9* piu" 8v E 

Then the following representations hold for v: 
T=V'j +V? +V’ jh =V"ĝ o —Wj4Vg?4 Vig. (4.87) 


The components V* are the contravariant coordinates, the components V; are the covariant coordinates 
of the vector v. For these coordinates the equalities 


V*—g"VW and | W- guV! (4.883) 


are valid, where 


Ikl = Jik = Fk ` Ju and g” =g" =9*-g' (4.880) 
respectively. Furthermore using the Kronecker symbol the equality 

k- F’ = δει, (4.89a) 
holds, and consequently 

9" Jim = Srm- (4.89b) 


The transition from V* to V, or from V; to V* according to (4.88b) is described by raising or lowering 
the indices by oversliding. 


Remark: In Cartesian coordinate systems covariant and contravariant coordinates are equal to each 
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other. 


4.3.4.3 Covariant, Contravariant and Mixed Coordinates of Tensors of 
Rank 2 
1. Coordinate Transformation 


In a Cartesian coordinate system with basis vectors e; , €? and €3 a tensor T of rank 2 can be represented 
as a matrix 


ti tie tis 
T = | tar ta tog |. (4.90) 
t31 t32 t33 
To introduce curvilinear coordinates u4, U2, u3 the following vector is used: 
T = © 1 (Uy, Ug, Ug)e1 + zo(u3, ua, Ug )E2 + V3 (U1, Us, U3) e3 - (4.91) 
The new basis is denoted by the vectors gi, J2 and ga. Now it holds: 
or Ox, επι, ὅσα,  ὅσες 











9 Om Ou n Tu "E Qui Qui Sh (£92) 
Substituting ē& = ğ' , then follows g; and g' as covariant and contravariant basis vectors. 
2. Linear Vector Function 
In a fixed coordinate system with the tensor T given as in (4.90) by the equality 

Ü =T (4.93a) 
the following vector representations 

T= VJ" =V" fk, T= Weg" = W" Ẹk (4.93b) 


define a linear relation between the vectors & and 1). So (4.93a) is to be considered as a linear vector 
function. 


3. Mixed Coordinates 
Changing the coordinate system, the equality (4.93a) will have the form 
ὦ ΤΟ. (4.944 
The relation between the components of T and T is the following: 
E Quy Otn 
tu= Or, Ou’ 
Um, l 
Introducing the notation 
fy — T*, (4.94c 
one talks about mixed coordinates of the tensor; k contravariant index, | covariant index. For the com- 
ponents of vectors U and w holds 
w= TEV. (4.944 
ὦ the yu iant basis gj, is replaced by the contravariant basis g*, then one gets similarly to (4.94b) and 
4.946 





(4.94b 


Ota, Ou ! 

Quy Óx, ^^ 

and (4.94d) is transformed into 
W, — T;'V. (4.95b 

For the mixed coordinates 7; and T* holds the formula 


TE sg aT. (4.95c 


Tq (4.95a 
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4. Pure Covariant and Pure Contravariant Coordinates 
Substituting in (4.95b) for V; the relation Vi = gimV™, then one gets 


We = Ty gim V” = Tem V”, (4.96a) 
also considering that 
Tk! gum = Thm - (4.96b) 


The Tym are called the covariant coordinates of the tensor T, because both indices are covariant. Sim- 
ilarly one gets the contravariant coordinates 





Tem 2 gnipk. (4.97) 
The explicit forms are: 
OLm Org Qu, Ou 
Tg = tans 4.982 TH —* ος 4.981 
E Qui, Qui ( a) OXm IEn ( i 


4.3.4.4 Rules of Calculation 


In addition to the rules described on 4.3.2, 5., p. 283, the following rules of calculations are valid: 

1. Addition, Subtraction Tensors of the same rank whose corresponding indices are both covariant 
or contravariant can be added or subtracted elementwise, and the result is a tensor of the same rank. 
2. Multiplication The multiplication of the coordinates of a tensor of rank n by the coordinates of 
a tensor of rank m results in a tensor of rank m+n. 

3. Contraction If making the indices of a covariant and a contravariant coordinates of a tensor of 
rank n (n 7 2) equal, then can be used the Einstein summation convention for this index, and one gets 
a tensor of rank n — 2. This operation is called contraction. 

4. Oversliding Oversliding of two tensors is the following operation: Multiply both, then making a 
contraction so that the indices by which the contraction is made belong to different factors. 

5. Symmetry A tensor is called symmetric with respect to two covariant or two contravariant stand- 
ing indices if when exchanging them the tensor does not change. 

6. Skew-Symmety A tensor is called skew-symmetric with respect to two covariant or two con- 
travariant standing indices if when exchanging them the tensor is multiplied by —1. 

W The alternating tensor (see 4.3.3.2, 3.,p. 284) is skew-symmetric with respect to two arbitrary co- 
variant or contravariant indices. 


4.3.5 Pseudotensors 


The reflection of a tensor plays a special role in physics. Because of their different behavior with respect 
to reflection polar and axial vectors are distinguished (see 3.5.1.1, 2., p. 181), although mathematically 
they can be handled in the same way. Axial and polar vectors differ from each other in their determi- 
nation, because axial vectors can be represented by an orientation in addition to length and direction. 
Axial vectors are also called pseudovectors. Since vectors can be considered as tensors, the general 
notion of pseudotensors is introduced. 


4.3.5.1 Symmetry with Respect to the Origin 


1. Behavior of Tensors under Space Inversion 
1. Notion of Space Inversion The reflection of the position coordinates of points in space with 
respect to the origin is called space inversion or coordinate inversion. In a three-dimensional Cartesian 
coordinate system space inversion means the change of the sign of the coordinates: 

(x,y,z) > (=x, =y, —2). (4.99) 
By this a right-hand coordinate system becomes a left-hand system. Similar rules are valid for other 
coordinate systems. In the spherical coordinate system holds: 

(r, 9,9) — (—r,* — 9, 4- 7). (4.100) 
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Under this type of reflection the length of the vectors and the angles between them do not change. The 
transition can be given by a linear transformation. 

2. Transformation Matrix According to (4.66), the transformation matrix A = (a,,) of a linear 
transformation of three-dimensional space has the following properties in the case of space inversion: 


αμν - --ὃμν. detA — —1. (4.101a) 
For the components of a tensor of rank n (4.69) 
iu i uso (4.101b) 


holds. That is: In the case of point symmetry with respect to the origin a tensor of rank 0 remains a 
scalar, unchanged; a tensor of rank 1 remains a vector with a change of sign; a tensor of 
rank 2 remains unchanged, etc. 


z M z ae 
; 180 , D q 
Y t.i | 
} [ € r^ ` | 
ο t ΄ 


Figure 4.3 





2. Geometric Representation 

The inversion of space in a three-dimensional Cartesian coordinate system can be realized in two steps 
(Fig.4.3): 

1. By reflection with respect to the coordinate plane, for instance the x, z plane, the coordinate system 
x,y,z turns into the coordinate system x, —y, z. A right-hand system becomes a left-hand system (see 
3.5.3.1,2., p. 209). 

2. By a rotation of the system x, y, z around the y-axis by 180° we have the complete coordinate system 
x,y,z reflected with respect to the origin. This coordinate system stays left-handed, as it was after the 
first step. 

Conclusion: Space inversion changes the orientation of a polar vector by 180°, while an axial vector 
keeps its orientation. 


4.3.5.2 Introduction to the Notion of Pseudotensors 


1. Vector Product under Space Inversion Under space inversion two polar vectors a and b 
are transformed into the vectors —a and — b, i.e., their components satisfy the transformation formula 
(4.101b) for tensors of rank 1. However, if considering the vector product c — a x b as an example 
of an axial vector, then one gets c — c under reflection with respect to the origin. This is a violation 
of the transformation formula (4.101a) for tensors of rank 1. Therefore the axial vector c is called a 
pseudovector or generally a pseudotensor. 

W The vector products r x v, 7 x Ë, V x v= rot ð with the position vector f, the speed vector v, the 
power vector F and the nabla operator V are examples of axial vectors, which have “false” behavior 
under reflection. 

2. Scalar Product under Space Inversion If using space inversion for a scalar product of a 
polar and an axial vector, then again there is a case of violation of the transformation formula (4.101b) 
for tensors of rank 1. Because the result of a scalar product is a scalar, and a scalar should be the 
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same in every coordinate system, here it is a very special scalar, which is called a pseudoscalar. It has 
the property that it changes its sign under space inversion. Pseudoscalars do not have the rotation 
invariance property of scalars. 

W The scalar product of the polar vectors F (position vector) and v (speed vector) by the axial vector 
@ (angular velocity vector) results in the scalars F- & and 7- ð, which have the “false” behavior under 
reflection, so they are pseudoscalars. 

3. Mixed Product under Space Inversion The mixed product (a x b): c (see 3.5.1.6, 2., p. 185) 
of polar vectors a, b, and c is a pseudoscalar according to (2.), because the factor (a x b) is an axial 
vector. The sign of the mixed product changes under space inversion. 

4. Pseudovector and Skew-Symmetric Tensor of Rank 2 The tensor product of axial vectors 
a = (a1, a2,a3)” and b = (b1, b2, b3)” results in a tensor of rank 2 with components t;; = a;b; (i,j = 
1, 2,3) according to (4.74a). Since every tensor of rank 2 can be decomposed into a sum of a symmetric 
and a skew-symmetric tensor of rank 2, according to (4.81) 


1 1 a 

tij = 9 (ajbj + ajbi) + 9 (ab; = ajbi) (i j = 1. 2, 3) (4.102) 
holds. The skew-symmetric part of (4.102) contains exactly the components of the vector product 
(a x b) multiplied by H so the axial vector c = (a x b) with components ci, c», cz can be considered 
as a skew-symmetric tensor of rank 2 

0 cp «8 C23 = a2b3 — a3b2 = c1, 
C=c=|—c2 0 c3 (4.103a) where C31 — aab, — a1b3 — €, (4.103b) 

—c13 —C23 0 C12 — 403 — a2b1 = C3, 


whose components satisfy the transformation formula (4.101b) for tensors of rank 2. 


Consequently every axial vector (pseudovector or pseudotensor of rank 1) c — (c1, c», c3)? can be con- 
sidered as a skew-symmetric tensor C of rank 2: 
0 ὦ -ῶ 
σ-ε-[|-α 0 ει |. (4.104) 
ο --σ 0 


5. Pseudotensors of Rank n The generalization of the notion of pseudoscalar and pseudovector 
is a pseudotensor of rank n. It has the same property under rotation as a tensor of rank n (rotation 
matrix D with det D = 1) but it has a (—1) factor under reflection through the origin. Examples of 
pseudotensors of higher rank can be found in the literature, e.g., [4.1]. 


4.4 Quaternions and Applications 


Quaternions were defined by Sir William Rowan Hamilton in 1843. The basic question which resulted 
the discovering of quaternions was that how could division of vectors in the three dimensional Euclidian 
space be defined. It is possible by embedding them into R^, and introducing the quaternion multipli- 
cation, what leads to the division ring of quaternions. 

Quaternions, like complex numbers, both are special cases of a Clifford-algebra of order n, with 2” 
generalized numbers as basis (see [4.5], [22.22]): 


2^ 
ΑΞ Σ ijaj (ij hyper-complex elements, a; complex numbers). (4.1052) 
151 


The following special cases have practical importance: 
n — 1: 2-dimensional complex numbers with 

i = 1,ip =i (iimaginary unit), à1,a9 (real numbers). (4.105b) 
n — 2: Quaternions as 4-dimensional numbers with hyper-complex elements 

i = 1, iz = İ, i = j, i4 = k (hyper-complex elements), a4,82,03,a4 (real numbers) (4.105c) 
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and the rules of multiplication (4.107). In physics the PAULI’s spin matrices and spinors are represented 
as quaternions. 

n = 3: Biquaternions (s. 4.4.3.6, 1., p. 306) 

n — 4: Clifford-numbers are known in physics as DIRAC-matrices. 

Quaternions are used most often to describe rotations. The advantages of the quaternions are: 

e the rotation is performed directly around the required axis, 

e the gimbal-lock problem does not occur. Gimbal is a pivoted support allowing the rotation around 
a single axis (e.g. gyrocompass), and gimbal lock means that the axes of two of the three gimbals are 
driven into parallel configuration. 

The disadvantage of quaternions is that only rotations can be described with them. To represent trans- 
lations, scaling or projections matrices are needed. This disadvantage can be overcome by biquater- 
nions, by which every motion of rigid bodies can be represented. 

Quaternion are used in computer-graphics, satellite-navigation, in vector analysis, in physics and in 
mechanics. 


4.4.1 Quaternions 
4.4.1.1 Definition and Representation 
1. Imaginary Units 


Quaternions are generalized complex numbers in the form 


w+ ia +jy+kz, (4.106) 
where w, x, y, z are real numbers, and the generalized imaginary units are i, j,k, which satisfy the 
following rules of multiplication: 






































PsP Sk =<, ij=k=-—ji, jk=i=-kj, ki=j ik. (4.107) 
ilj[k| k The multiplication of the generalized units is shown in 
i|[-1| k |-;j| / d the accompanying table of multiplication. Alternatively 
j|-k|-1] i | the multiplication rules can be represented by the cycle 
k| j |- i|-1) i 5j shown in Fig.4.4. Multiplication in the direction of an 
arrow results in a positive sign, opposite to the arrow di- 

Multiplication table Figure 4.4 rection results in a negative sign. 


Consequently, the multiplication is not commutative but associative. The four-dimensional Euclidian 
vector space R4 provided with quaternion-multiplication is denoted by H in honour of R.W. HAMIL- 
TON. Quaternions form an algebra, namely the division ring of quaternions. 

2. Representation of Quaternions 

There are different representation of quaternions: 

e as hyper-complex numbers q = w + ix + jy + kz = qo + q with scalar part qo = Scq and vector part 
q — Vecq, 

e as four dimensional vector q = (w, x, y, z)? = (qo, q)7 consisting of the number w € R and the vector 
(z,y, 2)" € E? 

e in trigonometric form q = r(cosy + ngsiny), where r = |g| = Vw? + x? + y? + 2? is the length of 
p 1 


: . ] w 
the four dimensional vector in Rf, and cos y = —, and nq = lesa] 
2 (yz 


lal 


(x,y, 2)". ng is a unit vector 


in R?, depending on q. 
Remark: The multiplication rules for quaternions differ from the usually introduced rules in R? and 
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RR (see (4.109b), (4.114), (4.115)). 


3. Relation between Hyper-complex Number and Trigonometric Form 


q-— qo - q — || (£ | 2) = |q| (5 | a a = r(cosy + nasin y), (4.108a) 
lal lal la| lal lal a 


if |q| A 0. If|q| = 0, then there is 








q — % = 4ο] (4.108b) 
4. Pure Quaternions 

A pure quaternion has a zero scalar part: go = 0. The set of pure quaternions is denoted by Ho. It is 
often useful to identify the pure quaternions q with the geometric vector q € RË, i.e. 


do — [qo] — [qo] cos for qo > 0, 
lao] [ao] (—1) — |qo| cos s for qo < 0, 


q, if q represents a pure quaternion, 
q, if q is interpreted as a geometric vector. 


q=%+ { (4.109a) 


The multiplication rule for p, q € Ho is 

pa--p:dctpxd, (4.109b) 
where - and x denote the dot-product and cross-product in R?, respectively. The result of (4.109b) is 
to interpret as a quaternion. 
Ov Ov OF s = T 
H Let V = bo + 55 + 3z“ be the nabla-operator (see 13.2.6.1, p. 715), and let V — vi(x, y, z)i ἠ- 

a D z 

valz, Y, EE vs(x, y, z)k be a vector-field. Here i,j, K are unit-vectors being parallel to the coordinate 


axes in a Cartesian coordinate system. If V and V are interpreted as pure quaternions then according 
to (4.107) their product is: 


ον vı vz Qus Y. Ov  OvyV  g(Ovi Ova], k Ove Ov, 
Or Oy Oz | ὃν Oz i. ὃς Ox) | Or Oy) 


This quaternion can be written in vector interpretation: 
Vy — —div V 4 rot V, 





but the result should be considered as a quaternion. 

5. Unit Quaternions 

A quaternion q is a unit quaternion if |g| = 1. The set of unit quaternions is denoted by H;. H; is 
a so-called multiplicative Lie-group. The set H4 can be identified with the three dimensional sphere 
S? — (x e R : x| 2 1). 

4.4.1.2 Matrix Representation of Quaternions 

1. Real Matrices 

If the number 1 is identified with the identity matrix 


1000 


19 


0100 
TR (4.1102) 
0001 


furthermore 
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nA 
= 


[» 


k^ 


looo 


= 


0 0001 
0 0 0-10 
1 0100 
0 -1000 


O Omo 
ooo! 


1 


then a quaternion g = w + ix + jy + kz can be represented as a matrix 
w =g =y 2 

aA| £ w —z—y 

“ly z vr 
=3 y -g W 


q 


2. Complex Matrices 


Quaternions can be represented by complex matrices: 
sA f 0i .a (0—1 k^ —i 0 
1742307»: 9711075» * V0 ij 


w-—iz —ir—y 
—ir+y w+iz j` 


> 


So 
ου 


Remarks: 


(4.110b) 


(4.110c) 


(4.111a) 


(4.111b) 


1. On the right hand side of equations (4.111a,b) i represents the imaginary unit of complex numbers. 
2. Matrix representation of quaternions is not unique, i.e. it is possible to give representations different 


from the ones in (4.110b,c) and (4.111a,b). 
3. Conjugate and inverse Element 
The conjugate of quaternion q = w + ix + jy + kz is the quaternion 
q= w -— irv — jy — kz. 
Obviously 
la? =a9=Gqew te ty +2. 
Consequently every quaternion q € H V (0) has an inverse element 
p q 
q= rz. 
lal? 
4.4.1.3 Calculation Rules 


1. Addition and Subtraction 
Addition and subtraction of two or more quaternions are defined as 


αι tq2—43... 

















(wi + wz — w3 +...) +il(£1 + £2 — £3 +...) +j(y1 + Y2 — y3 +...) 
+k(z1 + 22 — z3 + ...). 
Quaternions are added and subtracted as vectors in R4, or as matrices. 
2. Multiplication 
The multiplication is associative, so 
dido — ( wi ira 9 jui 4- Kzi)(wa 4- iva + jye + kz) 
— (wwe. — x1t3 — yiya — 2122) + (weg + wai 4- y1z2 — zy3) 1- 
4 j(wiya 4- wayi 1- 2129 -- 2211) ἠ- k(w1z2 + W221 + iya — vayi). 
Using the usual vector products in R? (see 3.5.1.5, p. 184) it can be written in the form 
9142 = (Go1 +4,) (402 + dy) = q01402 — di: da * di X d; 


(wy + ivy + jy: + kz) + (we + ive + jye + kze) — (w3 + iz3 + jys + kz3) +... 
) 


(4.112a) 


(4.112b) 


(4.112c) 


(4.113) 


(4.114) 


(4.115) 
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where d; - d» is the dot-product, and d1 x d is the cross-product of the vectors di, d € R?. Next 
is to identify the IR? with the space Ho of the pure quaternions. 

Remark: Multiplication of quaternions is not commutative! 

'The product q1q2 corresponds to the matrix multiplication of matrix L,, with vector q2, and it is equal 
to the product of matrix R,, with qi: 


Ὄωι πι —-Y1 δι Wa W2 —T2 —Y2 —22 w 
= —| 71 Wi —41 Yı T2 | . —| Ὁ; 29 —y2 Ti j 
qıq2 =La q2 = z w-mllwl"Pehth7liu wc τν 46) 
uou 1 σι ya Y2 2 W2 T2 yı 
£j —9i πι Wy E 2ο. ο -ἵο 10ο zı 


3. Division 
The definition of division of two quaternions is based on the multiplication: q1, q € H, q2 4 0, 


q E q: 
2 = qp =a. (4.117) 
q2 [A 


The order of the factors is important. 


1 
E Let qı = 1 +j, q = (1 — k), then |q2| = 1 , qa = (1 +k) and so 


gs v2 


q 1 
ia (1=i+j+k). 


qd. 12 g= 
lao? V2 


1 
= := q oa = (1 4+i+j+k 
da πω zí J )# 


4. Generalized Moivre Formula 





Let q € H, whrer q = qo +q = r(cosy - ng sing) with r = |q| and p = arccos IB , cosy = ia’ 
~ - ᾳ q 
ing = αἱ ae 
sing — la then for arbitrary k € IN : 
qE = rrd? = r*(cos(k p») 4- ng sin(k o)) . (4.118) 
5. Exponential Function 
For q = qo + q € H its exponential expression is defined as 
oo „k 
d=) T — e^ (cos |q| - n sin [q]). (4.119) 
Properties of the exponential function: 
For any q € H holds: 
e 4e? —],  (4120a) ef #0,  (4.120b) el = eVta — euet, (4.120c) 
ea" = —1, especially e" = e" = e" = —1. (4.120d) 
Unit quaternion u and V € R : e?" — cos -F usinó. (4.120e) 
If qı q2 = go qm then e™+# = e® e® . But it does not follow from e™+# = e™ e? that q@q@ = qo qi. 
H Since (ir)(jr) = kn? 4 —kr? = (jr) (ir) therefore also holds 
οἴπ οἷπ.. x , if: ae = pin+jn __ , irj X 
ee" = (cos 7)(cos 7) = (—1)(—1) = 1, but e = (s (ντ) + a sin (νο) τε], 
6. Trigonometric Functions 
For q € H let 
1 n —n i n —n, 
c08g:— 5 Gi +e ==} ; sing :— —nq G —e A : (4.121) 
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cos q is an even function, against which sin q is an odd function. 
Addition formula: It is valid for any q = qo + q € H 


cos q = cos qo cos q — sin qgosinq, sing = sin qo cos q + cos qo sin q . (4.122) 
7. Hyperbolic Functions 
For q € H let 

1 

cosh q := 5 (et + es) s sinhq := -04 (e — ε) : (4.123) 
cosh q is an even function, against which sinh q is an odd function. 
Addition formula: It is valid for any q = qo + q € H 

cosh g = cosh qo cos q — sinh qosinhq, sinh q = sinh qo cos q + cosh qo sinh q. (4.124) 


8. Logarithmic Function 
For q = q +q = r(cosy + nasin y) € H and k € Z the k-th branch of the logarithmic function is 
defined as 





Inr +naly +2kr), |a| #0 or |aļ|= 0 and qo > 0, 
TI 3 a a 2 
logy a { not defined for |q|/=0 and q <0. (4.125) 
Properties of the logarithmic function: 
e'& 1 — ¢ for any q € H, for which log, qis defined , (4.126a) 
logg ef = q for any q € H with |q| z (21 -- 1), 1 € Z, (4.126b) 
: T. : T. π 
logg;1—0,  (4.126c) logoi = 3b logoj = 5^ log; k = ΖΚ (4.126d) 


In the case when log q1 and log qo or q1 and qo commute, then, if k is suitable defined, the following 
known equality (4.127) holds: 


log(q: 2) = log qı + log q2. (4.127) 
For the unit quaternions q € H, holds |g| — 1 and q — cos -- nq sin and so 


logq := logo q = ng Y for q # —1, (4.128) 


9. Power function 
Let q € H and a €E R., then 


g* :— eelosa, (4.129) 


4.4.2 Representation of Rotations in IR? 

Spatial rotations are performed around an axis, the so-called rotation axis. It goes through the origin. 
It is oriented by a direction vector à 4 0 (on the axis). The positive direction on this axis is chosen 
by a. The positive rotation (rotation angle v > 0) is a counterclockwise rotation with respect to the 
positive direction. The direction vector is usually given normed, i.e. |a| = 1. 


The equality 

w=Rv, (4.130a) 
means vector w arises from vector V by the rotation matrix R, i.e., the rotation matrix R, transforms 
vector V into W. Since rotation matrices are orthogonal matrices holds 


R =R" (4.130b) 
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and (4.130a) is equivalent to 

V=R W = RW. (4.130c) 
Remark: At spatial transformations it is necessary to distinguish between the followings: 
a) geometric transformations, i.e. when geometric objects are transformed with respect to a fixed co- 
ordinate system, and 
b) coordinate transformations, i.e. the object is fixed while the coordinate system is transformed with 
respect to the object (see 3.5.4, p. 229). 
Now, geometric transformations are handled with quaternions. 


4.4.2.1 Rotations of an Object About the Coordinate Axes 


In a Cartesian coordinate system the axes are oriented by the basis vectors. The rotation around the 
x axis is given by matrix R;, around the y axis by R, and around the z axis by R, , where: 


1 0 0 cosB 0 sing cosy — sin y 0 
R,(a) := (° cos a -sna ) , R,(8) := ( 0 1 0 j , R(7) = (5m COS ^ o) . (4131) 
0 sina cosa — sin fj 0 cos B 0 0 1 


The relation between a rotation of an object and the rotation of the coordinate system (see 3.5.3.3,3., 
p. 213)) is 


R,(a) = D?(a) ,R,(8) = D7(8) ,R.(7) = DT(7). (4.132) 


Remark: The representation of the rotation matrices in homogenous coordinates is given in 3.5.4.5,1.,2., 
p. 234. 


4.4.2.2 Cardan-Angles 

Every rotation R around an axis passing through the origin can be given as a sequence of rotations 
around the coordinate axes in a given coordinate system (see also 3.5.3.5, p. 214), where here 

e the first rotation is around the x axis by an angle ag, then 

e the second rotation is around the y axis by an angle Bc, then 

e the third rotation is around the z axis by an angle yo. 

The angles oc, Bc, yc are called Cardan-angles. Then the rotation matrix is 


R = Re := Β.(γο)Βι(βο)Β.(αο) (4.133a) 


cos Be cos yc Sinagsin Be cos yc — cosacsinyc cosacsin Bo cos yc + sin ac sin yc 
= | cos fcsinye sin ac sin fc sin yc + cos ac cos ye cos ac sin Bo sin yc — sin ac cos yc | .(4.133b) 
— sin fc sin ag cos Bo cos ac cos fc 


Adventages: 

e very popular representation of rotations, 

e clear structure. 

Disadventages: 

e the order of rotations is important i.e. in general holds 


R;(ac)R,(6c)R:(4c) 7 R:(3c) R(Bc)Rs(ac) , (4.1330) 
e the representation is not unique since R(ac, bc, yc) = R(—ac + 180°, Bc + 180°, yo + 180°) , 
e not suitable for rotations after each other (e.g. at animations), 
e gimbal lock can happen (rotation of an axis by 90° goes into an other axis) 
E Gimbal Lock case: rotation around the y axis by 90° 


0 sin(ac—^c)  cos(ac — Yc) 

R(ac, 90°, yc)= | 0 cos(ac — 3c) -—sin(ac — c) | . 
-1 0 0 

It can be seen that one degree of freedom is lost. In practical applications it can lead to unpredictable 

motions. 




















(4.133d) 
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Remark: It can be realized that Cardan-angles are called sometimes as Euler-angles, in the literature 
however their definitions can be different (see 3.5.3.6, p. 215). 


4.4.2.3 Euler Angles 


The Euler-angles 7), J, i» are often introduced as follows (see 3.5.3.6, p. 215): 
e the first rotation around the z axis by angle wv, 

e the second rotation around the image of x axis by angle 0, 

e the third rotation around the image of z axis by angle y. 

The rotation matrix is 


R — Rg :— E, (g)R,(0)RÁ() (4.134a) 
cosy cosy — sinycos¥siny | — cos sing — sin cosücosq sin sin Ü 
sin Y cosp + cosy cos siny  — sin sing «- cosi cosü cos | — cosi sind | .(4.134b) 
sin V sin y sin Ü cos q cos 


4.4.2.4 Rotation Around an Arbitrary Zero Point Axis 


The counterclockwise rotation around a normed vector 4 = (az, ay, a;) with |a] — 1 by an angle q is 
made in 5 steps: 

1. Rotation of a around the y axis until reaching the x,y plane: à^ — R4& using R4 according to 
(4.135a). The result: vector a’ is in the z, y plane. 

2. Rotation of a’ around the z axis until becoming parallel to the x axis a’’ = Rea’ by Rz according 
to (4.135b). The result: vector à" is parallel to the x axis. 


ας az 
E. j t 
1/02 + a2 a2 + a? ya +a? ay 0 
— 1 4 5a = 
Rı = i. , (4.135a) R —ay ya +a? 0|. (4135b) 





IU juo ese - 0 0 1 
Vata a a2 


3. Rotation around the x axis by an angle y by Ra: 


1 0 0 
Rs; = R,(y) = | 0 cosy —siny }. (4.135c) 
0 sinp cosy 


Rotations R4 and R are inverted in the following two steps. 
4. Inverse rotation of Rs, i.e. rotation around the z axis by the angle 9 where sin 9 — ay, cos B — 


4/a2 + a? according to (4.135d). 


5. Inverse rotation of Rj, i.e. rotation around the y axis by —o where 








sin(—a) = = cos(—a) = ^? — around the y axis according to (4.1350). 
a2 4- a2 yè ta 
ας us: 
yt -ay 0 y a -- a2 \/a2 + a2 
Ry! = ay [2 1α20}], (41354) Rī'= 9 1 0 .  (4.135e) 
0 0 1 Z 0 ------ 
a? + a2 a2 4- a2 








Finally the composition matrix is: 


R(a, o) 2 Rj'R;'!R4R3R; — (4.135f) 


4.4 Quaternions and Applications 297 





cos y 4- a2(1 — cos q) Azdy(1— cosy) —a,sing a,a,(1— cosy) + ay sing 
dya,(1—cosp)+a,sing cospy+a mel — cos q) aya;(l— cosq) — a;sing | . (4.135g) 
a;a,(l— cosp) — aysing  a;ay(1— cosq)-Fa,sinp cosy + a2(1 — cos y) 


Matrix R(&, ) is an orthogonal matrix, i.e. its inverse is equal to its transpose: R71(a, y) = R™(a, y). 
The following formulas are also valid: 


Rx = R(a, y)x 








(cos y)¥ + (1 cse) j^ | TTE” X (4.136a) 
a 
= (cos p)X + (1 — cos y)Xg + (sin Oia x X. (4.136b) 


In these formulas the vector X is decomposed into two components, one is parallel, the other is perpen- 


- 22 X8. I οσο ος 
dicular to a. The parallel part is Xz = dap the perpendicular part is r — X — Xa. The orthogonal 
a 


part is in a plane whose normal vector is a, so its image is cos y¥ + sin yr*, where r* is obtained from 





E 
T by a 90? rotation in positive direction: r* ale x r. The result of the rotation of vector X is 
- a c 365 x : d 
Xa + cos yr + sin yr* = lap apa (cos y) + (sin MI xr (4.136c) 
a a 
with ax F=ax (K— η =axx. (4.136d) 


Advantages: 

e ,,Standard representation” in computer-graphics, 
e CARDAN-angles should not be determined, 

e no gimbal lock. 


Disadvantage: Not suitable for animation, i.e. for interpolation of rotations. 


4.4.2.5 Rotation and Quaternions 


If the unit vector & in (4.135f) is identified as the pure quaternion a (while the angle of rotation yp 
remains the same), then one gets: 


qi --d—d2—43 2q1q» — 2qoqs 2q193 + 2q042 
R(a.v) — | 20142 4- 2q0q3 dà — di di — dà 29293 = 20 | -: Μία) (4.1372) 
2q1q3 — 2qoq2 2q2q3 + 2qoq1 ας - qd — d$ +H 


where go = cos a and q = (q1, q2, 93)" = (de, dy, az)" sin? „i.e. gis the unit quaternion g = g(a, y) = 
cos 2 + asin T € H, . If vector X is considered as R? 5 X — zji-- 25j 4- 4k € Ho , then 


Ria, y)x = R(q)x = xq. (4.137b) 
Especially the columns of the rotation matrix are vectors q e; 7: 


1 0 0 
rep- (afos (i)a a(o)a] =u qjq qkq). (4.1370) 


Consequences: 


e The matrix of rotation can be determined with the help of quaternion q — cos a + asin F 
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e For the rotated vector R(a, y)x holds R(a, y)x — qxq in the sense of quaternion multiplication 
and identifying IR? with the set of pure quaternions Hp . 

For every unit quaternion q € H; q and —q determine the same rotation, so H; is a double covering 
of SO(3). Performing rotations one after the other corresponds to multiplication of quaternions, i.e. 


Βί(ῳ) Β(αι) - ΒίΦ αι); (4.138) 
and the conjugate quaternion corresponds to the inverse rotation: 
R-'(j) - R(). (4.139) 
H Rotation by 60° around the axis defined by (1, 1, 1)T. First the direction vector should be normed: 
1 3 1 
a = ——(1,1,1)T. Then with siny = sin60° = NE and cosy = cos60° = = the rotation matrix 
v3 2 2 
becomes 


i 7 1{2-12 
R{—~(1,1,1)7, 60°?) == { 2 2-1). 
(5 ) 312312 2 


The quaternion describing the rotation is: 


q=4 (50 1,1)", αν} — cos 30? 4- iG +j + k) sin 30° 








V3 
- aa “πῶ τὴ Hk) = τ | n ΣΣ; 
Furthermore 
1 
ο. (C+ 2a i+j ος. eere) 
0 6 2 6 
- (2X6 i+j +w) aite gE rz) 


6 
24. 24, 12, 1 1 
Lb] k= 3 (21 + 2j Ἣ 





36 36 36 
The two other columns are determined analogously: 


^ -1 
ο ο ο ο ο 


9 
τς τν ZI Seni) ται ica S (31). 


ο ο ο ο) 


4.4.2.6 Quaternions and Cardan Angles 
The rotation matrix in Cardan angles (see (4.133a,b), p. 295) is exactly a matrix of rotation with a unit 
quaternion q € Hi. 


Re(ac, Bo, ¥c) ^ Rz(yc) Rj(&c)R. (ac) (4.1402) 




















w| = 
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cosficcos^go sinagsin/fjocos^c — cosagsin^c cosac sin fo cos ^o 4- sin ac sin ^c 
Ξ{ εο5βοδίπγο sin ac sin fc sin yc + cos ac cos ye cos ac sin fcsin^o — sinagcos?o | (4.140b) 


— sin fc sin ag cos Bo cos ag cos ag 
2 +q -— q2- 2q -2 2q193 + 2 
" do - di — d5 — 93 qd 2 093 3 193 Joq2 
= [riglij-1 = | 20192 + 2¢098 d) — di-- 42 — q3 24293 — 2G0% = R(q) (4.140c) 
24193 — 24092 2q2q3 + 2qoq1 dj—di— 4i di 


ld) Qe) o 


Comparing the matrix elements one gets 


T21 . T32 
tanc — —, sinfgo- --ται, tanac = —. (4.141a) 
11 T33 
In general, the solution is not unique, which is typical in trigonometric problems. However the unique- 


ness of angles can be reached by discussion of the defined domains. 


Reversed, it is easy to get the unit quaternion from the rotation matrix. 
4qoqı = r32 — r23; 4qoq2 = ria — T31, 49093 = 721-12, (4.141b) 





4ο -1-- 40 - ος - ος- οὗ G = r11 + r22 + T33- (4.141c) 
Since q and —q define the same rotation, go can be determined as 


1 
do = ον Τα T T23 c T33 d- l. (4.141d) 


'The other components are 








T32 — T23 Tis = Ta Toy — T12 
= = 4.141e 
τ 4g P 4do ' 440 ( 9 
W Let the rotation matrix be the following: 
1 1 
v2 =3V6 5V2 
1 
E 1 1 
R=5| v3 3v6 -3⁄2 
0 1 νὸ 
1. Determination of the Cardan angles: Based on the above formulas sin fo — —ra1 — 0, so fo = kr, 
k € Z. Furthermore tan yc = c 1, so yc = e + kr, k € Z, and from tan ac = e it follows 
ri 4 r33 v3 


T 
that ag = é + kr, k € Z. The angles are unique, if they are determined as the ,,possible smallest” 


: ' π 
ones, i.e. the rotation whose angles have an absolute value < 7 So the angles are 


T T 
w= g Bo — 0, > 


2. Determination of the unit quaternion which results this rotation: 


1 1 1 1 1 
Ad -1- 5 (V9 5 V6 v3) also go = 5\[1 + 5(V2 5 V6 -- V3) s 0,8924 — cos £. 


The (possible smallest) angle is p = 53.6474°, so sin £ = 0.4512. 


3. Determination of the further components of q and the direction of the axis of rotation 
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a — (as, Gy, dz)": 











| 1 
T32 — T23 (5 + 372) qı 
qj — ————— = -= g 0,2391 80 d, — —5 R: 0,5299, 
4qo Aqo sin $ 
1 1 
, Totya 
Τα Τι 2 2 q2 
LI IR 1 S = x 0,21 
qo Ads i 0, 099 SO dy sin 2 0, 2195, 
1 1 
mm 3(V2*3V6) à 
Q = ——— = =~__=—_~ #0, 3696 so a, = — 5 & 0,8192. 
4qo 4qo sin $ 


Remark: At the calculation of the components in (4.141e) it can be a problem when qo is zero or 
close to zero. In this case the unit quaternion can not be determined by the formulas in (4.141e). To 
understand this situation one discusses the trace of the rotation matrix: 


TrR = ri + r22 + r33 = 4q — 1. (4.142a) 


1 
If Tr R > 0, then qo = gv TrR +1 > 0, and the formulas (4.1416) can be used without any problem. 


If TrR. € 0, then qo can be close to zero. In this case the greatest element of the main diagonal is 
considered. Assume, it is rjj. Then |g;| is greater than |go| or |g3]. The components qi, q2, q3 also can 
be determined from the elements of the main diagonal of the rotation matrix. Choosing the positive 
sign for the square-roots follows: 








1 1 1 
a= gv! F rir — 122 — 133, 42 = ονι H T22 — T11 — T33, 43 gv! Hras — P11 — T22 -(4.142b) 


Calculation rules: From this facts the following calculation rules are derived: 


e If Tr R. E 0 and rij 2 729 and rjj 2. 0733, then q1 has the greatest absolute value, so 











T32 — T23 Τοι + ria rig +731 
qo = ; = > G= : 4.142c 
4qı Aq Aq ( ) 
e If TrR < 0 and r22 > r1, and r22 È r33, then q2 has the greatest absolute value, so 
T13 — T31 ΤΟΙ d T12 T23 + T32 
do = » a= >: ΒΞ : 4.142d 
Aq» 48 48 ) 
e If TrR < 0 and r33 > r1ı and r33 > r22, then q3 has the greatest absolute value, so 
T21 — T12 Tài +113 T23 + T32 
go = > a= > Q= . 4.142e 
48 44 448 


Since the CARDAN-angles define the rotations around the corresponding axes, one can find the assign- 
ments given in the following table. Then the rotation 


R(a, 8, 7) = R((0,0, 1)", y)R((0, 1,0)", 8)R((1, 0,0)", a) (4.142f) 
corresponds to the unit quaternion 


q-Q; Qy Qr. (4.142g) 
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rotation Cardan angle | around quaternion | 

a a 
Ro((1, 0,0)", ac) ac zaxis | Qa := cos + isin £ | 
Ro((0, 1,0)", Bc) Bo y axis | Q, :— cos E + jsin m | 
Ro((0, 0, 1)7, yc) ^c zaxis | Q,:— cos B + ksin > | 





: T T ' ium ; ; 
E Knowing the Cardan angles «o — —, fc — 0, yc = T the quaternion describing this rotation can 


6 
be determined in the following way: 
a .. Q πο, π 
Q, — cos € Lisin € — cos — + isin ; 
2 2 12 12 














[- 

9 

Ων -- cos ^£. + jsin £ = cos0 +jsin0 = 1, 
Q, = cos W 4 ksin © = cos = + ksin z 


2 2 8 8' 
The final result coincides with that given on page 299: 


q:— Q:QyQx (ss + ksin 5) 1 (cos i + isin =) 








π π : To. esu TUS ccv o TE ELS π 
= cos—- cos +1cos —=-sin + Jsin = - sin H ksin — - cos 


8 12 8 12 8 12 8 ^ 12 
0, 8924 + 0, 2391i -- 0, 0991j -- 0, 3696k . 


4.4.2.7 Efficiency ofthe Algorithms 


To estimate the efficiency of the algorithms standard operations are defined from which the more com- 
plex operations are originated. For complicated comparisons with other methods see [4.12]. 

Let 

e M: number of multiplications, 

e A: number of additions and subtractions, 

e D: number of divisions, 

e S: number of standard functions calls, e.g. trigonometric functions, which are composed of a consid- 
erable number of multiplications, divisions and additions, 

e C: number of comparisons of expressions, which increase the computing time by interrupting the al- 
gorithm. 



































Operation AIMIDISIC | 
Quaternion into Matrix 12 | 12 | 
Matrix into Quaternion (TrR >0)| 6/5]1]1|1 | 
Matrixinto Quaternion (IrFR €0) | 6 | 5 |1|14,3 | 
Rotation of a vector | A | M Remarks 





with rotation matrix | 6 | 9 


with unit quaternion | 24 | 32 | normal quaternion multiplication | 








with unit quaternion | 17 | 24 | fast quaternion multiplication 
with unit quaternion | 18 | 21 changing into rotation matrix 
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Rotation of n vectors A M Remarks 

with rotation matrix 6n 9n 

with unit quaternion | 24n 32n | normal quaternion multiplication 
with unit quaternion l7n 24n fast quaternion multiplication 
with unit quaternion | 12+6n | 12+9n changing into rotation matrix 








composition of two rotations | A | M 





with rotation matrix 18 | 27 

















with unit quaternion 12 | 16 





Summary: An algorithm based on quaternions is faster only when rotations are performed after each 
other. It occurs mainly in computer graphics at animations, i.e. at approximations of rotations. 


4.4.3 Applications of Quaternions 
4.4.3.1 3D Rotations in Computer Graphics 


To describe motion flows interpolation of rotations are used. Since the 3D-rotations can be represented 
by unit quaternions, algorithms are developed for interpolation of rotations in computer graphics. The 
easiest idea is to start analogously to the definition of linear interpolation in Euclidian-spaces. Basic 
algorithms are Lerp, Slerp and Squad. 
1. Lerp (linear interpolation) 
Let p,q € H4 and t € [0, 1], then 

Lerp(p, q, t) 2 p(1 — t) t qt. (4.143) 
e This is a linear segment in R^ , connecting p € H, — S? C R* with q € Hı » S? c R*. 
e This segment is inside of the unit sphere in IR^, and does not represent any connecting curve on the 
unit sphere $? ~ Hı. 
e Therefore the rotation is determined by normalizing the found quaternion. 


This simple algorithm is almost perfect. The only problem is that after finding the interpolation points 
on the secant between the given points and normalizing the found quaternions, the resulted unit quater- 
nions are not equidistant quaternions. This problem is solved by the following algorithm. 


2. Slerp (Spherical linear interpolation) 
Let p,q € Hi, t € [0, 1] and (0 € y < r) the angle between p and q. Then 
= 1-t sin(t 
Slerp(p, q, t) — p(pa)! 2 pq — o| (1-99) ] +q | ί 2| ; (4.144) 


sin Y sin y 





e Interpolation along the great circle on the unit 
sphere S3 C R4, p and q are connected; 
e The shortest connection is chosen, —Sc (pq) = 
(p, q) > 0 must hold (where (, ) denotes the dot prod- 
uct of p and q in R?). 
In Fig.4.5 the interpolations according to Lerp(a) 
and Slerp(b) are compared. 

Figure 4.5 


Special case p = 1: Let p = 1 = (1, 0, 0, 0)" and g — cose -- nq sin o, then 
Slerp(p, q, t) = cos(ty) -- nqsin(t v). (4.145) 
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Special case equidistant grids: Let y = m then 
n 


1 


sin Y 





k 
qk := Slerp(p, q, a = (sin(y — k y) p + sin(kv)q), k-—0,1,...,m. (4.146) 


Interpretation of the Slerp interpolation: To show the equivalence of the two expressions in 


(4.144) first Q — p^!q — "uu = pq is calculated. Since p,q € H; the scalar part is 
p 
Qo = ScQ = Sc (pq) = (p, ο) = cosy. (4.147) 


Since p = p- 1, and q = pp^!q = pQ, the interpolation between 1 and Q is multiplied by p to keep the 
interpolation between p and q. 
Qt) = sin((1 — t)p) c gente) ο sin((1 — thy) end sin(tp) ji agony) 
sin sin q sin q sin y sin 
sin y cos(tp) — sin(ty) cosy + sin(ty) cos | ., sin(tio)sing 
= Ti 
sin y i sin y 

cos(ty) + rig sin(ty) = elige — gtlosQ Qt, (4.148) 


It follows that 


a(t) = pQ(t) =p 





sin((1— t)y) E sin(ty) 
sin y sin Y 





=pQ = ppg} = pi. (4.149) 


3. Squad (spherical and quadrangle) 
For qi, qi+1 € H4 and t € [0, 1] the rule is 

Squad(qi qi+1, Si, Si+1, t) = Slerp(Slerp(qi, qi+1, t), Slerp(si, Si+1, t), 2t(1 — t)) (4.150) 
log(q; qii) “- eus eal] 





4 

e The resulted curve is similar to a Bézier curve, but it keeps the spherical instead of the linear inter- 
polation. 

e The algorithm produces an interpolation curve for a sequence of quaternions qo, q1, ---, qN- 

e The expression is not defined in the first and last interval, since g_1 is necessary to calculate so and 
qN44 to calculate sy . A possible way out is to choose so = qo and sy = qn, (or to define g_; and qy+1). 


with si = qi exp ( 


There are additional algorithms based on quaternions: nlerp, log-lerp, islerp, quaternion de Casteljau- 
splines. 


4.4.3.2 Interpolation by Rotation matrices 
The Slerp-algorithm can be described completely analogously with the help of rotation matrices. The 
logarithm of a 3 x 3 rotation matrix R is needed (i.e. an element of group SO(3)) and it is defined by a 
group-theoretical context as the skew-symmetric matrix r (i.e. an element of the Lie group so(3)), for 
which e" = R. Then the Slerp-algorithm can be used to interpolate between rotation matrices Ro and 
R,,which is described as 

R(t) 2 Ro(R; Ra) — Roexp(tlog(R5!Ru)). (4151) 
In general it is more simple to use the quaternions based algorithm and to determine R(t) from q(t) 
according to the calculations of the rotation matrix representing the unit quaternion. 


4.4.3.3 Stereographic Projection 
If 1 € Hı ~ S? is taken as the North pole of the three dimensional sphere 5? , then unit quaternions 
or elements of the three dimensional sphere can be mapped by the stereographic projection H; 3 q > 
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14 q)(1- q)*! € Hy ~ R? into pure quaternions or into IR? respectively. The corresponding inverse 
3 8 
mapping is 


R? ~ Ho p> (p-1)(p+1) EH ~ S. (4.152) 
4.4.3.4 Satellite navigation 


The orientation of an artificial satellite 
circulating around the Earth is to be de- 
termined. The fixed stars are consid- 
ered to be infinitely far away, so their 
direction with respect to the Earth and 
C the satellite are identical (see Fig.4.6). 
satellite Any difference in measurements can be 

deduced from the different coordinate 

systems and so from the relative rota- 
Figure 4.6 tion of the coordinate systems. 


fixed coordinate 
system 


i-th fixed star, infinitely 
far away 












Let a; be the unit vector pointing into the direction of the i-th fixed star from in the Earth’s fixed 
coordinate system, and b; be the unit vector pointing into the direction of the i-th fixed star in the 
satellite’s fixed coordinate system. The relative rotation of both coordinate systems can be described 
by a unit quaternion h € Hy: 

b; —h&ih. (4.153) 
If more fixed stars are considered, and the data are overlapped by measuring errors, then the solution 
is determined by the least squares method, i.e. as the minimum of (4.154), where h is a unit quaternion 


and a; — à; and b; — b; are unit vectors: 


Q = So |b; — hash? = 7 (b; — ha; h) - (b; — ha; h) 
i=1 i=1 
- Y bi-hajh)(b; -ha;jh) - 3 (2— b;ha;h — ha;hby). (4.154) 
i=1 i—l 


Since the group H, of the unit quaternions form a Lie group, the critical points of Q? can be determined 
by the help of derivative 








o9vp 
ðh = lim Eo o a (v, h quaternions, J real) (4.155) 
0—0 9 
from 
8,Q?! 2 —V (b; vha;h t bjha,(vh) - vha;hb,; 4 ha;(v h) b) =0. (4.156) 
ici 
Here v, b; and ha; h are pure quaternions, so V — —v, and therefore (4.156) can be simplified: 
aR = —4v - D ha;hx bi) =0. (4.157) 
ici 
Since here v is arbitrary, this expression vanishes if 
Mhahxb,-0. (4.158) 
ici 


Let R. be the rotation matrix represented by the unit quaternion h, i.e. hajh = Ra;. With the 3 x 3 
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matrix 


0 —az; ay 
K(a)={ a, 0 —a, (4.159) 
—ay az 0 

defined by vector 4 = (az, ay, az) € R? for any vector b c R? one gets: 

K(a)b=axb, K(K(a)b) =ba™— ab". (4.160) 
From this relation the critical points of the minimum problems are determined: 

So K (Ra; x bi) =0 = S1(b; aT R™—- Ra, bt) =O — ΒΡ -- ΡΤΗΣ (4.161) 

i=l i=l 
where P — 3748; br . If P is decomposed into the product P = RIS, where matrix S is symmetric 
and P = R, is a rotation matrix, then from (4.161) follows 

RRIS -SR,R?, (4.162) 
and 

R-R, (4.163) 
is obviously a solution, since in this case R;,RTS = S = SR;RT, because R;RT — E. However there 
are three other solutions, namely 

R=R;R, (j =1,2,3), (4.164) 
where R, denotes the rotation by 7 around the j-th eigenvector of S , i.e., there is R;|SR; = S. That 
R = RR, is a solution of (4.162) which can be seen from R;R,R>S — SR;RTR7 —5 RjS — 
SR? <> R,SR;=S. 
The solution for which Q? is minimal is 

R= Rp, falls detP>0, 

~ | R,, Rp, falls detP <0, 

where R; is the rotation by m around the eigenvector of S associated with the eigenvalue of the smallest 
absolute value. 
4.4.3.5 Vector Analysis 


If the V operator (see (13.67), 13.2.6.1, p. 715) and a vector V (see 13.1.3, p. 704) are identified with 
VQ and v in quaternion calculus, i.e. 


(4.165) 


ð ð ὃ 
ει εἰς. 41 
Vo i aw áp (4.166 
y(x, y, z) = v(x, y, zji + v(x, y, z)j + v(x, y, z)k (4.167 


with i,j,k (according to (4.107), p. 290), then the multiplication rule for quaternions (see (4.109b), 
p. 291) gives 


Voy — —V - V 4- V x V — —div V 4 rot V, (4.168 
(see also Bl in 4.4.1.1, 4. p. 291). 
Substituting 
ὃ ὃ ὃ ὃ 
D==—+4+i—+j—+k— d 4.169a 
at ἶδα δρ τ 5; an (4.169a 


w(t, x, y, z) = wolt, v, y, z) + w(t, z, y, zji 4- wa(t, x, y, z)] 4- wa(t, x, y, z)k 
— wo(t, v, y, z) + w(t, x, y, z), (4.169b 
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then 


o 
Dw = gT divw + rot w + grad wo. (4.169c) 


Especially, for an arbitrary twice continuously differentiable function f(t, x, y, z) 
Pf Pf Pf 
ðr? Oy ^02 
OF Of PF 
VeVoaf=-VVf= : =—Asgf, 4.170k 
eVaf f διῦ Oy? 2 T ( D 
where A denotes the Laplace operator in IR? (see (13.75) in 13.2.6.5, p. 716). 
LINES Op op of of 
DDf =DDf = f =A 4.170c 
f / ot? T Ox? Oy? T 0z? af ( c) 
where A; denotes the Laplace-operator in R4. VQ, just as D is often called the Dirac- or Cauchy- 
Riemann operator. 


4.4.3.6 Normalized Quaternions and Rigid Body Motion 


1. Biquaternions 


VoVo f 7 VoVo f 2 VV f 





— Ag f und (4.170a) 








A biquaternion À has the form 

h=ho+eh, with ho, hie H (4.171) 
Here € is the dual unit, that commutes with every quaternion, furthermore e = 0. The multiplication 
is the usual quaternion multiplication (see 4.115, p. 292). 
2. Rigid Body Motion 


By the help of unit biquaternions, i.e. biquaternions with 


























TI T T ho ho =l 
hh = (h hı )(h hj))-1 € T τ. 4.172 
ih = (ho 4- ehi)(ho 4- ehi) bunt is (4.172) 
rigid-body motion (rotation and translation after each other) can be described in IR?. 
Table 4.1 Rigid body motions with biquaternions 
Element Representation by 
point p = (Pz, Py, Pz) in space Ď = 1 + pe with p = pri + pyj + p-k 
rotations r € H; unit quaternions 
: 1 : : : 
translations t = (tz, ty, tz) 1+ gue witht =t,i+tyj+t,k 
The unit biquaternions 
ο 1 1 
h=ho+me=(1+5te) r=r+ re tcHo, rcH;, (4.173) 


give a double covering over the group SE(3) of the rigid-body motion in IR? since À and — describe 
the same rigid-body motion. 
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4.5 Systems of Linear Equations 


4.5.1 Linear Systems, Pivoting 
4.5.1.1 Linear Systems 

















A linear system contains m linear forms 
Yı = anTı + atz +: + anin + a 
= dat + aggte +++: + amin + a 
Ya 2111 22:02 2nUn 2 or y - Axa (4.1742) 
Ym = Amit, + Ame%2 + ++ + Amntn + dq 
with 
Qi 412 *** Gin αι, Ti, yi 
021 022 *'** Gan a2, T2, ya; 
A-|. as]. [bp x-[|.]|. y=]. (4.174b) 
Am1 Am2 ``’ Amn Am Tn Ym 
The elements ay, of the matrix A of size (m,n) and the components a, (4 = 1,2,...,m) of the column 
vector a are constants. The components x, (m = 1,2,...,n) of the column vector x are the independent 


variables, the components y, (u — 1,2,..., m) of the column vector y are the dependent variables. 


4.5.1.2 Pivoting 


1. Scheme of Pivoting 

If an element a; is not equal to zero in (4.1742), the variable y; can be exchanged for an independent one 
and the variable x; for a dependent one in a so-called pivoting step. The pivoting step is the basic ele- 
ment of pivoting, the method by which for instance systems of linear equations and linear optimization 
problems can be solved. The pivoting step is achieved by the schemes 

















αι Ug +++ Le cts In 1 
Ένα αἱ t Qik +++ Gin αι 
Y2|G21 022 *:* Q2k *** Aan a2 
ΡΟ νιν yet 4.175 
Yi | Qil Qi |QOik| t7 Gin Qi’ 
Ym|Ami m2 *'* Ümk "7" Ümn Üm. 
Tk|O Oi *** Qik *** On Oh 





where the left scheme corresponds to the system (4.1742). 


2. Pivoting Rules 

The framed element in the scheme aj, (ai, 4 0) is called the pivot element; it is at the intersection of 
the pivot column and pivot row. The elements a and q, of the new scheme on the right-hand side 
will be calculated by the following pivoting rules: 





1 Auk ; 
l. ogg--——, (41763) 2. oy -— HR (u —1,...,muz i), (4.176b) 
Qik Qik 
Qiy αι 
3. αν----, oj—-—— (v=1,2,... n;v £ k), (4.176c) 
Qik Qik 
Qiy 
4. αμν-αμν- uk = Oy Ἔαμκαν, Qp = αμ o αμκαι 
ik 


(for every pp ZZ i and every v Z k). — (4.176d) 
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To make the calculations easier (rule 4) one writes the elements aj, in the (m-+1)-th row of the pivoting 
scheme (cellar row). With this pivoting rule one can change further variables. 


4.5.1.3 Linear Dependence 

The linear forms (4.1742) are linearly independent (see 9.1.2.3, 2., p. 553), if all y, can be changed for 
an independent variable z, . The linear independence will be used, for instance to determine the rank 
of a matrix. Otherwise, the dependence relation can be found directly from the scheme. 


After three pivoting steps 
(for instance y4 — x4, 

Uo — Xi, yi — c3) the 
table becomes: 

















No further change is possible because a32 = 0, and one can see the dependence relation ya — 2yi — 
3y2 + 10. Also for another sequence of pivoting, there remains one pair of not exchangeable variables. 


4.5.1.4 Calculation of the Inverse of a Matrix 
If A is a regular matrix of size (n, n), then the inverse matrix A^! can be obtained after n steps using 
the pivoting procedure for the system y — Ax. 
351 [v1 v2 23 [να 21 23 | y3 πο νο | vs my 
4 351 4| 3-1-—5 11) 18|-1|-5 z| 13 -I -5 
αλ- 2415 5. 8 : : 
215) = 4493 45* 3 0 ]|1' z|-2 0 1' z3 -2 0 1 
wl 22  zz1-2-2 m 5 -2-2 «|-21 2 8 






































2 8-21 
After rearranging the elements one gets A1 — -1-5 13 . (The columns are to be arranged with 
0 1 -2 


respect to the indices of y;, the rows with respect to the indices of xp.) 


4.5.2 Solution of Systems of Linear Equations 
4.5.2.1 Definition and Solvability 


1. System of Linear Equations 


A system of m linear equations with n unknowns z1, £2, .. . , En 
3121 c 01232 c ::: c Ainin = 04 
is Tidetur da cd or briefly Ax = a, (4.177a) 
m + mata + -** + ürmmin = üm 





is called a linear equation system. Here the following designations are used: 


Q11 412 *** Gin αι, Ti, 
a21 G22 *** Gan a2, T2, 

A-|. » a=]. |, x= |. (4.177b) 
Gm1 Am2 `` Amn Am Tn 


If the column vector a is the zero vector (a = 0), then the system of equations is called a homogeneous 
system, otherwise (a # 0) it is called an inhomogeneous system of equations. The coefficients a,,, of the 
system are the elements of the so-called matrix of coefficients A, and the components a,, of the column 
vector a are the constant terms (absolute terms). 
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2. Solvability of a Linear System of Equations 
A linear system of equations is called solvable or consistent or compatible if it has a solution, i.e., there 
exists at least one vector x — « such that (4.177a) is an identity. Otherwise, it is called inconsistent. 
The existence and uniqueness of the solution depend on the rank of the augmented matriz (A, a). One 
gets the augmented matrix by attaching the vector a to the matrix A as its (n 4- 1)-th column. 
1. General Rules for Inhomogeneous Linear Systems of Equations An inhomogeneous linear 
system of equations Ax — a has at least one solution if 

rank (A) = rank (A, a), (4.178a) 
is valid. Furthermore, if r denotes the rank of A, i.e., r = rank (A), then 

a) for r = n the system has a unique solution, (4.178b) 


b) for r < n the system has infinitely many solutions, (4.178c) 


i.e., the values of n — r unknowns as parameters can be chosen freely. 


ΒΑ: 


21 -- 222 1- 973 -- πι 2ης 2 The rank of A is 2, the rank of the augmented matrix of co- 
321 — 2X2 + 523 — 334 — 45 = 6 efficients (A, a) is 3, i.e., the system is inconsistent. 
2a, + wo + 2x3 — 2x4 — 325 = 8 
E B: : 
ee | Both the matrices A and (A,a) have rank equal to 3. Be- 
i th ie cause r = n = 3 the system has a unique solution: xı = 
@— 2% — t= 2 . 
; 2 EI = 10 1 2 
321 — X9 cT 9X3 — 3 =, T2 = — >, q3-—-—-. 
—2z; 4 229 -- 3:3 — —4 T 7 7 





Both the matrices A and (A, a) have rank equal to 2. The 


HC: system is consistent but because r < n it does not have 
tı — t2 + Z3 — t4= 1 a unique solution. Therefore n — r — 2 unknowns can 
tı — 99 — + t= Ü be considered as free parameters: r9 — 21 — E 3 = 
σι -- z2— 223 4 24 — -1 i 

z4 + y (z1,x4 arbitrary values). 
HD: 

21 {22} -- 13+ t=1 'There is the same number of equations as unknowns but 
2r, — X_ + 243 + 2x4 = 2 the system has no solution because rank(A) = 2, and 
32, + T2 + rg 344 = 3 rank (A , a) — 3. 

dc 920 "P 323 T = 0 








2. Trivial Solution and Fundamental System of Homogeneous Systems 
a) The homogeneous system of equations Ax = 0 always has a solution, the so-called trivial solution 
Gy = 29—...— 24-0. (4.179a) 
(The equality rank (A) — rank (A , 0) always holds.) 
b) If the homogeneous system has the non-trivial solutions @ — (01,02,...,05) and B. — (Ei, Bo, . .., 
Ên), ie., œ AOand B 7 0,thenx — s +t @ is also a solution with arbitrary constants s and t, 
i.e., any linear combination of the solutions is a solution as well. 
Suppose, the system has exactly l non-trivial linearly independent solutions oc, o2... . , ou. Then these 
solutions form a so-called fundamental system of solutions (see 9.1.2.3, 2., p. 553), and the general 
solution of the homogeneous system of equations has the form 
x = kya, + koe +--+ + ἴμαι (kı, k2, . . . , kı arbitrary constants). (4.179b) 
If the rank r of the coefficient matrix A of the homogeneous system of equations is less than the number 
of unknowns n, i.e., r « n, then the system of equations has | — n — r linearly independent non-trivial 
solutions. If r — n, then the solution is unique, i.e., the homogeneous system has only the trivial 
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solution. 


To determine a fundamental system in the case r < n one chooses n — r unknowns as free parameters, 
and expresses the remaining unknowns in terms of them. If reordering the equations and the unknowns 
so that the subdeterminant of order r in the left upper corner is not equal to zero, then one gets for 
instance: 


vi = 01 (Xp41,@ry2,---;2n) 
X9 — Ta(Tr41, Ur42,--- (Bn) (4 180) 
gy m grita, cc Wn): 


Then one can get a fundamental system of solutions choosing the free parameters, for instance in the 
following way: 








Tre] Ur42 Ur43 01+ Tn 
1. fundamental solution: 1 0 0.0 
2. fundamental solution: 0 1 0.0. (4.181) 
(n —r)-th fundamental solution: 0 0 0 >> 1 
HE: The rank of the matrix A is equal to 2. The system can be 
πω μα ο «ει olved fo and Iting to 
solved for x; and x9 resulting to: zi — —zza — T4, T2 = 
a+ a — 225 - 3:4 — 0 τ 2 8 l 23 a 
321 — zo 4 823 - 4,0 . . 
@ + 3x29 — 9x3 + 724 0 23 — 2x4 (x3, 74 arbitrary). Fundamental solutions are a= 
37 
im z 1,0)7 and e — (—1, —2,0, 1)7. 


4.5.2.2 Application of Pivoting 


1. System of Linear Functions Corresponding to a Linear System of Equations 
In order to solve (4.1772), a system of linear functions y — Ax— a is assigned to the system of equations 
Ax = a so the use of pivoting (see 4.5.1.2, p. 307) is possible: 


Ax=a (4.182a) is equivalent to y=Ax-a=0. (4.182b) 


The matrix A is of size (m, n), a is a column vector with m components, i.e., the number of equations 
m must not be equal to the number of unknowns n. After finishing the pivoting one substitutes y = 0. 
The existence and uniqueness of the solution of Ax = a can be seen directly from the last pivoting 
scheme. 
2. Solvability of Linear Systems of Equations 
The linear system of equations (4.182a) has a solution if one of the following two cases holds for the 
corresponding linear functions (4.182b): 
Case 1: All y, (u — 1,2,..., m) can be exchanged for some z,. This means the corresponding system 
of linear functions is linearly independent. 
Case 2: At least one y, cannot be exchanged for any z,, i.e., 

Us — Ai t Aaa too Amin + Ao (4.183) 
holds and also Ag — 0. This means the corresponding system of linear functions is linearly dependent. 
3. Inconsistency of Linear Systems of Equations 


The linear system of equations has no solution if in case 2 above Ag Z 0 holds. In this case the system 
has contradictory equations. 


4.5 Systems of Linear Equations 311 








πι -- 2πο -- 4x43 — 44 = 2 
—9z, t 3919 — 9z3 F Ar, — 3 
2π1 -- ὅπο + ὅσα -- ὅσα ---1 After three pivoting steps 
for instance > t 
n vii ea ο. ( Uu 1 
Tı T2 T3 T4 l Us — T4, Y2 — x2) fol- 
1-2 4-1-2 lows: 





ψα]|- 3-3 4-3 
2-3 5-3 1 
This calculation ends with case 1: y1, y2, y3 and 23 are independent variables. Substituting y; — ya — 
y3 = 0, and 23 = t (—oo < t < oo isa parameter) consequently, the solution is: x1 = 2t+1, x2 = 3t—2, 
te = t, 04 = 3: 





4.5.2.3 Cramer's Rule 


There is the very important special case when the number of equations is equal to the number of un- 
knowns 
Guti Faita H=- +F linfa = âL 
02111 + A2212 + ` ** + AmnTn = ag 





(4.184a) 
ei + an2tT2 +-+ -— = απ 

and the determinant of the coefficients does not vanish, i.e., 
D-detA Z 0. (4.184b) 


In this case the unique solution of the system of equations (4.184a) can be given in an explicit and 
unique form: 

Di D2 D, 

—. Hu — uuo ge 4.184c 
D 2D i= p ( ) 
D, denotes the determinant, which is obtained from D by replacing the elements a of the v-th column 
of D by the constant terms a,,, for instance 


zı = 


Q11 Q1 Q13 *** Q1n 
Q21 12 133 *** Q2n 

D: = -— “ὦν Ἕ : (4.184d) 
Anl An Qn3 *** Qnn 


If D = 0 and there is at least one D, 4 0, then the system (4.184a) has no solution. 
In the case D = 0 and D, — 0 for all v — 1,2,...,n, then it is possible that the system has a solution 
but it is not unique. (see Remark p. 311). 
























































211 T ορ τ 924 = 9 2 13 
α τι - τ) w3 = -2 Ῥ Ξ51 -21 189, 
831 4- 2x9 4- 23 T: 3 22 
9 13 2 93 2.1 9 
D,2|-2-21|25-13, D=]|1-—21|=26, D= |1 -—2 —2| = 39. 
7 22 3 72 2 7 
: s D D D: 
The system has the unique solution x 5 1, £2 n 2, 23 n 3. 


Remark: From practical consideration the Cramer rule is not useful for higher-dimensional problems. 
As the dimension of the problem increases, the number of required operations increases very fast, so, 
for numerical solutions of linear systems of equations one uses the Gauss algorithm or pivoting or an 
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iteration procedure (see 19.1.1, p. 949). 
4.5.2.4 Gauss's Algorithm 


1. Gauss Elimination Method In order to solve the linear system of equations Ax — a (4.177a) of 
m equations with n unknowns one can use the Gauss elimination method. With the help of an equation 
one unknown is to be eliminated from all the other equations. So one gets a system of m — 1 equations 
and n — 1 unknowns. This method will be repeated until the result is a system of equations in row 
echelon form, and from this form one can determine the existence and uniqueness of the solution easily, 
and the solution itself can be found if it exists. 

2. GaussSteps The first Gauss step is to be demonstrated on the augmented matrix of coefficients 
(A, a) (see examples on p. 309): 

Supposing a1; 4 0, otherwise exchanging the first equation for another one. In the matrix 


Q11 G12 *** Gin | a1 
Q21 22 *** Aan | a2 (4 185 ) 
Gm1 Gm2 '** Gmn Gm 


the appropriate multiple of the first row is to be added to the others in order to make the coefficients of 






P P a21 a31 ami 
xı equal to zero, i.e., multiply the first row by — —, — —, ..., — —— then add them to the second, 
ai ai au 
third,..., m-th row. The transformed matrix has the form 
Q11 012 ``’ Q1n |Q1 
0 aby +++ Ahn | a 
22 2n | 42 
aoe a ala (4.185b) 
/ . / 
0 aie Ormin 
After applying this Gauss step (r — 1) times the result is a matrix in row echelon form 
Qil 0012 Q3 ... αι ει :»' in ay 
/ / / / / 
429 ing + Q2r+1 o Gon 95 
a a a n 
0 433 Ü3pyp 5500 03 a3 
" τ ` 4.186) 
(r-1) τ--1 r—1 ( 
0 αγγ --- a(r- D aD 
(r-1) 
0 0 0 0 05.41 
Q4 sic cee 0 0 id 0 al) 


3. Existence and Uniqueness of the Solution The Gauss steps are elementary row operations 
so they do not affect the rank of the matrix (A, a), consequently the existence and uniqueness of the 
solution and the solution itself do not change. Formula (4.186) implies that the following cases may 
occure concerning the solutions of the inhomogeneous linear system of equations: 


Case 1: The system has no solution if any of the numbers at D, al”, ἐν ας differs from zero. 
Case 2: The system has a solution if at? = aly? =... = a!) = (is valid. Then there are two 


Cases: 


a) r =n: The solution is unique. 
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b) r « n: The solution is not unique; n — r unknowns can be chosen as free parameters. 


If the system has a solution, then the unknowns are to be determined in a successive way starting with 
the last row of the system of equations with the matrix in row echelon form (4.186). 
WA: x 2x 3x 4y4 — —2 1 2 3 4.2 

2m, H d T P E da — 9 After three Gauss steps 0—1-2-7 6 

Sz, Az. X. 2r. — 2 the augmented matrix of 0 0.4 4.4 

ο. de L 22s + 314 — .9  cocffcients has the form 0 0 0 40—40 
The solution is unique and from the corresponding system of equations with a triangular matrix follows: 
z4 = —1, z3 = 0, x2 = 1, zı — 0. 





BÓ B: —:; — 32» — 123 — —5 —1 —3 -12|-5 
αι 2x3 5x3 — 2 After two Gauss steps the 0.5 17 7 

545 c l7r3 — 7 augmented matrix of coeffi- 00 0 0 

3x; — 2k 2x3 —  l cients has the form 00 0 0 

7a, — 4%. -  x3— 0 ο ο 00 


There is a solution but it is not unique. Choosing one unknown as a free parameter, for instance 73 = t 


T I 4 9 
(—oo « t « oo), and one gets x4 — t, 33 — 2 — —t, 1 — 2 — -t. 
5- 8 5 5 


4.5.3 Overdetermined Linear Systems of Equations 


4.5.3.1 Overdetermined Linear Systems of Equations and Linear 
Least Squares Problems 


1. Overdetermined System of Equations 
Consider the linear system of equations 

Ax=b (4.187) 
with the rectangular matrix of coefficients A = (aij) (i = 1,2,...,m; j — L,2,...,n; m > n). 
The matrix A and the vector b — (bi, 05, . .., b,,)T on the right-hand side are given, and the vector 
X = (£1, %2,..., En)” is unknown. Because m > n holds this system is called an over-determined 
system. One can tell the existence and uniqueness of the solution and sometimes also the solution, for 
instance by pivoting. 
2. Linear Least Squares Problem 
If (4.187) is the mathematical model representing a practical problem (i.e., A, b, x are reals), then 
because of measuring or other errors it is impossible to find an exact solution of (4.187) such that it sat- 


isfies all of the equations. Substituting any vector x there will be a residual vector £ = (r1, T2,- , f'm)! 
given as 
r=Ax-—b, r#0. (4.188) 


In this case x is to be determined to make the norm of the residual vector r as small as possible. Suppose 
now A, b, x are real. If considering the Euclidean norm, then 


Sor? =r'r = (Ax— b)"(Ax — b) = min (4.189) 
i=l 


must be valid, i.e., the residual sum of squares must be minimal. Gauss already had this idea. The 
formula (4.189) is called a linear least squares problem. The norm ||r|| — vrr of the residual vector r 
is called the residue. 

3. Gauss Transformation 

The vector x is the solution of (4.189) if the residual vector r is orthogonal to every column of A. That 
is: 


ATr = AT(Ax—b) =0 or AT Ax — ATb. (4.190) 





314 4. Linear Algebra 





Equation (4.190) is actually a linear system of equations with a square matrix of coefficients. One refers 
to it as the system of normal equations. It has dimension n. The transition from (4.187) to (4.190) is 
called Gauss transformation. The matrix ATA is symmetric. 

If the matrix A has the rank n (because m > n all columns of A are independent), then the matrix 
ATA is positive definite and also regular, i.e., the system of normal equations has a unique solution if 
the rank of A is equal to the number of unknowns. 


4.5.3.2 Suggestions for Numerical Solutions of Least Squares Problems 
1. Cholesky Method 


Because the matrix ATA is symmetric and positive definite in the case rank (A) = n, in order to solve 
the normal system of equations one can use the Cholesky method (see 19.2.1.2, p. 958). Unfortunately 
this algorithm is numerically fairly unstable although it works fairly well in the cases of a “big” residue 
||r|| and a “small” solution ||[x||. 

2. Householder Method 

Numerically useful procedures in order to solve the least squares problem are the orthogonalization 
methods which are based on the decomposition A = QR. Especially useful is the Householder method, 
where Q is an orthogonal matrix of size (m,m) and R is a triangular matrix of size (m,n) (see 4.1.2, 
11., p. 271). 

3. Regularized Problem 

In the case of rank deficiency, i.e., if rank (A) < n holds, then the normal system of equations no longer 
has a unique solution, and the orthogonalization method gives useless results. Then instead of (4.189) 
the so-called regularized problem is considered: 


rlr + axx = min! (4.191) 
Here a > 0 is a regularization parameter. The normal equations for (4.191) are: 
(ATA + al)x = ATb. (4.192) 


The matrix of coefficients of this linear system of equations is positive definite and regular for a > 0, 
but the appropriate choice of the regularization parameter a is a difficult problem (see [4.7]). 


4.6 Eigenvalue Problems for Matrices 


4.6.1 General Eigenvalue Problem 


Let A and B be two square matrices of size (n,n). Their elements can be real or complex numbers. 
The general eigenvalue problem is to determine the numbers λ and the corresponding vectors x Z 0 
satisfying the equation 

Ax — ABx. (4.193) 
The number 4 is called an eigenvalue, the vector x an eigenvector corresponding to A. An eigenvector 
is determined up to a constant factor, because if x is an eigenvector corresponding to À, so is cx (c = 
constant) as well. In the special case when B = I holds, where I is the unit matrix of order n, i.e., 

Ax = Ax or (A — AI)x = 0, (4.194) 
the problem is called the special eigenvalue problem. It occures very often in practical problems, espe- 
cially with a symmetric matrix A, and so it is to be discussed later in detail. More information about 
the general eigenvalue problem can be found in the literature (see [4.16]). 
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4.6.2 Special Eigenvalue Problem 
4.6.2.1 Characteristic Polynomial 


The eigenvalue equation (4.194) yields a homogeneous system of equations which has non-trivial solu- 
tions x Z 0 only if 


det (A — AT) — 0. (4.195a) 
By the expansion of det (A — AI) one gets 


dij — A aye Q13 *** Gin 
det (A — AI) = a an — À a23 ttt 028 
oc Uu Web rdg e 
= P (A) 2 (C1) a, 4A 1 4 --- E a1À 4 ag — 0. (4.195b) 


So the determination of the eigenvalues is equivalent to the solution of a polynomial equation. This 
equation is called the characteristic equation; the polynomial P,,(A) is the characteristic polynomial. Its 
roots are the eigenvalues of the matrix A. For an arbitrary square matrix A of size (n,n) the following 
statements hold: 

Case 1: The matrix A(n nm) has exactly n eigenvalues Aj, Az, ..., An, because a polynomial of degree n 
has n roots if they are considered with their multiplicity. The eigenvalues of a real symmetric matrix 
are real numbers, in other cases the eigenvalues can also be complex. 

Case 2: If all the n eigenvalues are different, then the matrix A(,,,; has exactly n linearly independent 
eigenvectors x; as the solutions of the equation system (4.194) with À — A;. 

Case 3: If \; has multiplicity n; among the eigenvalues, and the rank of the matrix Ain) — Ait is 
equal to r;, then the number of linearly independent eigenvectors corresponding to A; is equal to the 
so-called nullity n — r; of the matrix of coefficients. The inequality 1 € n — r; € n; holds, i.e., fora 
real or complex quadratic matrix A(nn) there are at least one and at most n real or complex linearly 
independent eigenvectors. 











2-3 1 2—A -3 1 
WB A: 3 1 3], det(A—AI)— 31—A 3|2—33 — A? + 2A=0. 
—5 2-4 —5 2-4-}λ 
The eigenvalues are A; = 0, Az = 1, 4 — —2. The eigenvectors are determined from the corresponding 
homogeneous linear system of equations. 
ολι -- 0: 231 -- 329 + tq = 0 
301 + fat 323 =0 
—b5z1 d 229 = 423 =; 
: I : 3 11 . 
One gets for instance by pivoting: x, arbitrary, x9 — 194» 25 = —27, + 342 = -iotr Choosing 
10 
1; — 10 the eigenvector is xı = C1 3 |, where C4 Æ 0 is an arbitrary constant. 
—11 
e \2 = 1: The corresponding homogeneous system yields: «3 is arbitrary, 72 = 0,271 = 342-413 = — T3. 
=1 
Choosing x3 = 1 the eigenvector is x2 — C5 ( o) , where C2 ¥ 0 is an arbitrary constant. 
το 1 
: : : . 4 
e \3 = —2: The corresponding homogeneous system yields: x2 is arbitrary, 71] = =%2, 73  --4πι + 


3 
7 4 
Όσο = -372 Choosing x2 = 3 the eigenvector is xg = C3 , where C3 ¥ 0 is an arbitrary 
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constant. 
30-1 ὃ-λ 0 -1 
HB: ( 14 1), det (A — AT) = 14-2 1| 2 —23 + 10A? — 321 + 32 = 0. 
Hie 3 -1 03—A 
The eigenvalues are À4 2 2, 4 2 A3 — 4. 
e A, — 2: One obtains x3 is arbitrary, v2 = —23,%1, = x3 and chooses for instance 73 = 1. So the 


1 
corresponding eigenvector is xı = C1 ( —1 |, where Ci ¥ 0 is an arbitrary constant. 


e \y = A3 = 4: One obtains x2, 73 are arbitrary, rx; = —x3. There are two linearly independent 


0 -1 
eigenvectors, e.g., for vg = 1,23 = 0 and a2 — 0, £3 = 1: x2 = C2 (1), X3 — C3 o), where 
1 
C2 z- 0, C3 Æ 0 are arbitrary constants. 


4.6.2.2 Real Symmetric Matrices, Similarity Transformations 


In the case of the special eigenvalue problem (4.194) for a real symmetric matrix A the following state- 
ments hold: 


1. Properties Concerning the Eigenvalue Problem 
1. Number of Eigenvalues The matrix A has exactly n real eigenvalues A; (i — 1,2,...,n), count- 
ing them by their multiplicity. 
2. Orthogonality ofthe Eigenvectors The eigenvectors x; and x; corresponding to different eigen- 
values A; Z Aj are orthogonal to each other, i.e., for the scalar product of x; and x; 

x} xX = (x;, Xj) =0 (4.196) 
is valid. 
3. Matrix with an Eigenvalue of Multiplicity p For an eigenvalue which has multiplicity p (A = 
Ay = Ag =... = Ap), there exist p linearly independent eigenvectors x,,X5,...,X,. Because of (4.194) 
all the non-trivial linear combinations of them are also eigenvectors corresponding to A. Using the 
Gram-Schmidt orthogonalization process one can choose p of them such that they are orthogonal to 


each other. 
Summarizing: The matrix A has exactly n real orthogonal eigenvectors. 


110 


e A; — às — —1: From the corresponding homogenous system of equations one gets: z4 is arbitrary, 
Yq is arbitrary, 73 = —x, — ra. Choosing first x1 = 1, x2 = 0 then x, — 0, za = 1 one gets the linearly 


O11 
E A= (: 0 ] , det (A — ATI) 2 —A? + 3A + 2 — 0. The eigenvalues are À; = Àz = —1 and A3 — 2. 


il 0 
independent eigenvectors x; = C ( o) and xy = Cy ( 1 , where C4 Z 0 and C$ Z 0 are arbitrary 
-1 -1 
constants. 
e \3 = 2: One gets: x, is arbitrary, rg = £1, 53 = x4, and choosing for instance x7, = 1 one gets the 
1 
eigenvector x3 = C3 t) , where C3 4 0 is an arbitrary constant. The matrix A is symmetric, so the 


eigenvectors corresponding to different eigenvalues are orthogonal. 


4. Gram-Schmidt Orthogonalization Process Let V,, be an arbitrary n-dimensional Euclidean 
vector space. Let the vectors X,,X9,.-.,X, E€ Vn be linearly independent. Then there exists an or- 
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thogonal system of vectors y ,, y ...., y, € Va which can be obtained by the vectors x; as follows: 
E (xy y) 
yo—XuyX.—Xp— 9.4 y (ο ο] (4.197) 
icl Yyy) 
Remarks: 
1. Here (Xp, y i) zR y, is the scalar product of the vectors x, and y ,. 
2. Corresponding to the orthogonal system of the vectors y,,y,,..-,y,, one gets the orthonormal 
a ν "ee Yi - Yo - Ya : 
system X,,X5,..., X, with X, — -= , Š = —2.,..., €, — .—"— , where ||ly || = ,/(y,, y,) is 
. lly II lly ol "fly all = ee 


the Euclidean norm of the vector y z 


0 1 1 
Bx,-— [) X= (0) X45 (G) From here it follows: 
1 1 0 


(κο. Υι) | 1 | lod ῃ 
PY3—-X27 $5 = -1/2 and X —|-1]: 
i EN 


1/2 
(Xa. Yi) (Xa. yj) 203 2 1 : 
Y, =X γι γο 2/3 | and &3 = —= 1]. 
(Y, Yj) (Y) zu V3 \_4 
2. Transformation of Principal Axes, Similarity Transformation 
For every real symmetric matrix A, there is an orthogonal matrix U and a diagonal matrix D such that 
A = UDUT. (4.198) 
The diagonal elements of D are the eigenvalues of A, and the columns of U are the corresponding 
normed eigenvectors. From (4.198) it is obvious that 
D = UTAU. (4.199) 
Transformation (4.199) is called the transformation of principal axes. In this way A is reduced to a 
diagonal matrix (see also 4.1.2, 2., p. 270). 
If the square matrix A (not necessarily symmetric) is transformed by a square regular matrix G such 
a way that 


GCIAGHA (4.200) 


5 








then it is called a similarity transformation. The matrices A and A are called similar and they have 
the following properties: 


1. The matrices A and A have the same eigenvalues, i.e., the similarity transformation does not affect 
the eigenvalues. 


2. If A is symmetric and G is orthogonal, then Ais symmetric, too: 
A=GTAG  wih  GTG-I (4.201) 


The relation (4.201) is called an orthogonal-similarity transformation. In this context (4.199) means 
that a real symmetric matrix A can be transformed orthogonally similar to a real diagonal form D. 


4.6.2.3 Transformation of Principal Axes of Quadratic Forms 
1. Real Quadratic Form, Definition 


A real quadratic form Q of variables 21, 13,. .., r4 has the form 


Q=) J agar; =x" Ax, (4.202) 


i=1 j=1 
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where x = (x1, £2, . - . , £n)" is the vector of real variables and the matrix A = (aj) is a real symmetric 
matrix. 

The form Q is called positive definite or negative definite, if it takes only positive or only negative values 
respectively, and it takes the zero value only in the case x; = £2 — ... — r4 — 0. 


The form Q is called positive or negative semidefinite, if it takes non-zero values only with the sign 
according to its name, but it can take the zero value for non-zero vectors, too. 
A real quadratic form is called indefinite if it takes both positive and negative values. According to the 
behavior of Q the associated real symmetric matrix A is called positive or negative definite, semidefinite 
or indefinite. 
2. Real Positive Definite Quadratic Form, Properties 
1. In a real positive definite quadratic form Q all elements of the main diagonal of the corresponding 
real symmetric matrix A are positive, i.e., 

a4 >0 (¢=1,2,...,n) (4.203) 
holds. (4.203) represents a very important property of positive definite matrices. 
2. A real quadratic form Q is positive definite if and only if all eigenvalues of the corresponding matrix 
A are positive. 
3. Suppose the rank of the matrix A corresponding to the real quadratic form Q = x? Ax is equal to 
r. Then the quadratic form can be transformed by a linear transformation 

x= Cx (4.204) 


into a sum of pure quadratic terms, into the so-called normal form 
Q=X"Kx= ype? (4.205) 


where p; — (sign À;)k; and ki, ks, .. . ‚kr are arbitrary, previously given, positive constants. 

Remark: Regardless of the non-singular transformation (4.204) that transforms the real quadratic 
form of rank r into the normal form (4.205), the number p of positive coefficients and the number 
q =r —p of negative coefficients among the p; of the normal form are invariant (the inertia theorem of 
Sylvester). The value p is called the index of inertia of the quadratic form. 

3. Generation of the Normal Form 


A practical method to use the transformation (4.205) follows from the transformation of principal axes 
(4.199). First it is to perform a rotation on the coordinate system by the orthogonal matrix U, whose 
columns are the eigenvectors of A (i.e., the directions of the axes of the new coordinate system are the 
directions of the eigenvectors). This gives the form 


= x"Lx = 2 Ad. (4.206) 

Here L is a η matrix with the eigenvalues of A in the diagonal. Then a dilatation is performed 

by the diagonal matrix D whose diagonal elements are d; — Dr The whole transformation is now 
2 


given by the matrix 
C-UD, (4.207) 
and one gets: 
Q = €" Ax — (UDX) A(UDX) - £'D'U'AUD)X 
— X" D'LDX- xl" Kx. (4.208) 
Remark: The transformation of principal axes of quadratic forms plays an essential role at the classi- 
fication of curves and surfaces of second order (see 3.5.2.11, p. 206 and 3.5.3.14, p. 228). 
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4. Jordan Normal Form 
Let A be an arbitrary real or complex (n, n) matrix. Then there exists a non-singular matrix T such 
that 

TAT =J (4.209) 
holds, where J is called the Jordan matriz or Jordan normal form of A. The Jordan matrix has a block 
diagonal structure of the form (4.210), where the elemnts J; of J are called Jordan blocks: 


Ji ^ 
J2 oO λο ο 
J= a (4.210) J d 
oO doi ο Anci 
Ji An 


(4.211) 


They have the following structure: 
1. If A has only single eigenvalues Aj, then J; = A; and k = n, i.e., J is a diagonal matrix (4.211). 


2. If Aj is an eigenvalue of multiplicity p;, then there are Xd 

one or more blocks of the form (4.212) where the sum of 2 Ad o 

the sizes of all such blocks is equal to p; and Ya pj2n g,— irr (4.212) 
holds. The exact structure of a Jordan block depends on : i X 1 ᾿ 

the structure of the elementary divisors of the character- J 

τ s : Oo A; 

istic matrix A — AI. J 


For further information see [4.15], [19.16] vol. 1. 


4.6.2.4 Suggestions for the Numerical Calculations of Eigenvalues 


1. Eigenvalues can be calculated as the roots of the characteristic equation (4.195b) (see examples on 
p. 315). In order to get them the coefficients a; (i — 0,1,2,..., n — 1) ofthe characteristic polynomial 
of the matrix A must be determined. However, one should avoid this method of calculation, because 
this procedure is extremely unstable, i.e., small changes in the coefficients a; of the polynomial result 
in big changes in the roots A;. 

2. There are many algorithms for the solution of the eigenvalue problem of symmetric matrices. Two 
types can be distinguished (see [4.7]): 

a) Transformation methods, for instance the Jacobi method, Householder tridiagonalization, QR algo- 
rithm. 

b) Iterative methods, for instance vector iteration, the Rayleigh-Ritz algorithm, inverse iteration, the 
Lanczos method, the bisection method. As an example the power method of Mises is discussed here. 
3. The Power Method of Mises Assume that A is real and symmetric and has a unique dominant 
eigenvalue. This iteration method determines this eigenvalue and the associated eigenvector. Let the 
dominant eigenvalue be denoted by Aj, that is, 


[Al > [l > |λ4| 5 --- 5 |λπ|. (4.213) 
Let Χι,Χο,...,Χῃ be the associated linearly independent eigenvectors. Then: 
1. Ax;—- Xx; (6—2,...,n). (4.214) 
2. Each element x € R” can be expressed as a linear combination of these eigenvectors x;: 
ΧΞΟΙΧΙ ΙΓ CXg+-:-+ eX, (cj const; i=1,2,...,n). (4.215) 
Multiplying both sides of (4.215) by A k times, then using (4.214) follows 
k k k k k Aa F Xn å 
A"x — cATX4 o 03A3X3 o d 64A X, — Mex + c2 (3) Xo te: +o s) x]. 


(4.216) 
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From this relation and (4.213) one can see that 
A*x : k k 
pe — xQ,ask oo, thatis A"x& cA;X4. 
Mey 
This is the basis of the following iteration procedure: 
Step 1: Select an arbitrary starting vector x € R”. 
Step 2: Iterative computation of 
xD — Ax (k= 0,1,2,...;x is given). 
From (4.218) and keeping in mind (4.217) follows: 
x) = Ary x οιλξκι. 
Step 3: From (4.218) and (4.219) it follows that 
xt) - Ax) -- Α(Α1κώ}, 
A(A"x 0) z4 A(cMx4) — e AF (Ax), 
c (M Ax) — X(e Mx) z& Mx), therefore 


x FH) ej Maxi), 


(4.217) 


(4.218) 


(4.219) 


(4.220) 


that is, for large values of k the consecutive vectors xt) and x“) differ approximately by a factor Ai. 


Step 4: Relations (4.219) and (4.220) imply for x ; and A: 
(x9 : x) 


X 8/ REED λι 56 


W For example, let 


233 115 177 1 
A- (^5 9.25 EJI x(0) = (o) 























1.77 —2.13 1.56 0 
x| xU x?» x ?| normalization x(4) x)| normalization 
1 3.23 14.89 88.27 1 7.58 67.75 1 
0 |—1.15 —18.12 —208.03 —2.36 |—24.93 —256.85 —3.79 
0 1.77 10.93 82.00 0.93 8.24 79.37 1.17 
à 9.964 10.177 
x(9) x| normalization x9 x] normalization 
9.66 96.40 1 10.09 102.33 l 
— 38.78 —394.09 —4.09 |—41.58 —422.49|| —4.129 | = x, 
11.67 117.78 1.22 | 12.38 125.73]\ 1229 
10.16 10.161 = A4 
Remarks: 


(4.221) 


1. Since eigenvectors are unique only up to a constant multiplier, it is preferable to normalize the 


vectors x) as shown in the example. 


2. The eigenvalue with the smallest absolute value and the associated eigenvector can be obtained by 
using the power method of Mises for A71. If A^! does not exist, then 0 is this eigenvalue and any vector 


from the null-space of A can be selected as an associated eigenvector. 


3. The other eigenvalues and the associated eigenvectors of A can be obtained by repeated application 
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of the following idea. Select a starting vector which is orthogonal to the known vector x,, and in this 
subspace Az becomes the dominant eigenvalue that can be obtained by using the power method. In order 
to obtain As, the starting vector has to be orthogonal to both x, and x5, and so on. This procedure is 
known as matriz deflation. 


4. Based on (4.218) the power method is sometimes called vector iteration. 


4.6.3 Singular Value Decomposition 


1. Singular Values and Singular Vectors Let A be a real matrix of size (m, n) and its rank be 
equal to r. The matrices AAT and ATA have r non-zero eigenvalues Ày, and they are the same for 
both of the matrices. The positive square roots dy = Và, (v = 1,2,...,r) of the eigenvalues A, o 
the matrix ATA are called the singular values of the matrix A. The corresponding eigenvectors u, 
of ATA are called right-singular vectors of A, the corresponding eigenvectors v, of AAT left-singular 
vectors: 


ATAu,— Xu, AATv, = àw, (v=1,2,... sr). (4.222. 
The relations between the right and left-singular vectors are: 

Au,-—d,v, Alv, —d,u,. (4.222b 
A matrix A of size (m, n) with rank r has r positive-singular values d, (v — 1,2,...,r). Thereexist r 
orthonormalized right-singular vectors u, and r orthonormalized left-singular vectors v,. Furthermore, 
there exist to the zero-singular value n — r orthonormalized right-singular vectors u,(v — r 4- 1,...,mn 
and m — r orthonormalized left-singular vectors v, (v — r 4- 1,..., m). Consequently, a matrix o 


size (m, n) has n right-singular vectors and m left-singular vectors, and two orthogonal matrices can 
be made from them (see 4.1.4, 9., p. 275): 


U = (ui, Ue,...,Un), VW = (vi, v2,---,Vm)- (4.223 





2. Singular-Value Decomposition The representation 


T TOWS 


A-—VÁU! (4224) with Â = (4.224b) 


Tn, — T TOWS 





r columns n — r columns 


is called the singular-value decomposition of the matrix A. The matrix A, as the matrix A, is of size 
(m,n) and has only zero elements except the first r diagonal elements ap, = dy, (v =1,2,...,r). The 
values d, are the singular values of A. 

Remark: If substituting A® instead of AT and consider unitary matrices U and V instead of orthogo- 
nals, then all the statements about singular-value decomposition are valid also for matrices with com- 
plex elements. 

3. Application Singular-value decomposition can be used to determine the rank of the matrix A 
of size (m,n) and to calculate an approximate solution of the over-determined system of equations 
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Ax — b (see 4.5.3.1, p. 313) with the so-called regularization method, i.e., to solve the problem 








m n 2 n 

||Ax — b|P? - a||x]|? 2 M7 p αμ — | +a) aj = min!, (4.225) 
i=1 Le=1 k=1 

where a > 0 is a regularization parameter. 


5 Algebraand Discrete Mathematics 


5.1 Logic 


5.1.1 Propositional Calculus 


1. Propositions 

A proposition is the mental reflection of a fact, expressed as a sentence in a natural or artificial language. 
Every proposition is considered to be true or false. This is the principle of two-valuedness (in contrast 
to many-valued or fuzzy logic, see 5.9.1, p. 413). "True" and "false" are called the truth value of the 
proposition and they are denoted by T (or 1) and F (or 0), respectively. The truth values can be 
considered as propositional constants. 


2. Propositional Connectives 

Propositional logic investigates the truth of compositions of propositions depending on the truth of the 
components. Only the extensions of the sentences corresponding to propositions are considered. Thus 
the truth of a composition depends only on that of the components and on the operations applied. So 
in particular, the truth of the result of the propositional operations 





"NOT A" (24), (5.1) “A AND P" (A^ B), (5.2) 

“AOR B” (AV B), (5.3) "IF A, THEN B" (A= B) (5.4) 
and 

“A IF AND ONLY IF B”(A © B) (5.5) 


are determined by the truth of the components. Here “logical OR” always means “inclusive OR”, i.e., 
“AND/OR”. In the case of implication, for A = B also the following verbal forms are in use: 


A implies B, B is necessary for A, A is sufficient for B. 
3. Truth Tables 
In propositional calculus, the propositions A and B are considered as variables (propositional variables) 


which can have only the values F and T. Then the truth tables in Table 5.1 contain the truth functions 
defining the propositional operations. 


Table 5.1 Truth tables of propositional calculus 





Negation Conjunction Disjunction Implication Equivalence 
A || 7A A|B\AAB A|B|AvB A|B|A-—B A|JB[|AeB 

F| T F| F F F| F F F| F T F| F T 

T|F F/T F F/T T F/T T F/T F 

T|F F T|F T T|F F T|F F 

ΤΙΤ T TIT T EE T ET T 


4. Formulas in Propositional Calculus 

Compound expressions (formulas) of propositional calculus can be composed from the propositional 
variables in terms of a unary operation (negation) and binary operations (conjunction, disjunction, 
implication and equivalence). These expressions, i.e., the formulas, are defined in an inductive way: 


1. Propositional variables and the constants T, F are formulas. (5.6) 
2. If A and B are formulas, then (7A) (AAB) (AVB) (A=B) (A€ B) (5.7) 
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are also formulas. 
Το simplify formulas parentheses are omitted after introducing precedence rules. In the following se- 
quence every propositional operation binds more strongly than the next one in the sequence: 


αι, AS OM η, πι 
Often the notation A instead of “~A” is used, and the symbol A is omitted. By these simplifications, 
for instance the formula ((A V (2B)) — ((A^ B) V C)) can be rewritten more briefly in the form: 
Αν B ^ ABVN C. 


5. Truth Functions 

Assigning a truth value to every propositional variable of a formula, the assignment is called an inter- 
pretation of the propositional variables. Using the definitions (truth tables) of propositional operations 
a truth value can be assigned to a formula for 
every possible interpretation of the variables. 








A | H | C | AVB | ABV C | AVB= ABVC Thus for instance the formula given above de- 
FFÍIF T F F termines a truth function of three variables (a 
FIFIT T T T Boolean function see 5.7.5, p. 413). 
FIT!F F F T W In this way, every formula with n proposi- 
FITT F T T tional variables determines an n place (or n 
ΤΙΕΙΕ T. F F ary) truth function, i.e., a function which as- 
ΤΙΕΙΤ Τ Τ Τ signs a truth value to every n tuple of truth 
ΤΙΤΙΕ T T ip values. There are 2?” n ary truth functions, 
TITT T T T in particular these are 16 binary ones. 


6. Elementary Laws in Propositional Calculus 

Two propositional formulas A and B are called logically equivalent or semantically equivalent, denoted 
by A = B, if they determine the same truth function. Consequently, the logical equivalence of propo- 
sitional formulas can be checked in terms of truth tables. So there is , e.g., AVB > ABVC = BVC, 
i.e., the formula A V B > AB V C does not in fact depend on A, as follows from its truth table above. 
In particular, there are the following elementary laws of propositional calculus: 

1. Associative Laws 


(A^B)AC—AA(BAC), (5.84) (AV B)VC=AV(BVC). (5.8b 
2. Commutative Laws 

ANB=BAA, (5.9a) AVB=BVA. (5.9b 
3. Distributive Laws 

(AV B)C = AC V BC, (5.10a) ABVC=(AVC)(BVC). (5.102 
4. Absorption Laws 

A(AV B) =A, (5.11a) AV AB — A. (5.11b 
5. Idempotence Laws 

AA — A, (5.12a) ΑΝ ΑΞ Α. (5.128 
6. Excluded Middle 

AA=F, (5.13a) Αν Α--Τ. (5.198 


7. De Morgan Rules 
AB=AVB, (5.14a) AV B = AB. (5.14b 





5.1 Logic 325 





8. Laws for T and F 


AT — A, (5.15a) ΑΝΕΞΑ, (5.15b) 

AF =F, (5.156) AVT=T, (5.15d) 

T=F, (5.15e) F=T (5.15f) 
9. Double Negation 

A=A. (5.16) 
Using the truth tables for implication and equivalence, gives the identities 

A=>B=AVB (5.17a) and AS B= ABV AB. (5.17b) 


Therefore implication and equivalence can be expressed in terms of other propositional operations. 
Laws (5.17a), (5.17b) are applied to reformulate propositional formulas. 


E The identity A V B > AB V C = B V C can be verified in the following way: A V B > AB V C = 
AVBV ABVC=ABV ABVC=ABV ABVC=(AVA)BVC=TBVC=BVC. 


10. Further Transformations 
A(AV B) = AB, (5.18a) AVAB=AVB, (5.18b) 


(AVC)(BV C)(AV B) = (AV C)(BV OC), (5.189) ACV BCV AB = ACV BC. (5.184) 


11. NAND Function and NOR Function As it is known, every propositional formula determines 
a truth function. Checking the following converse of this statement: Every truth function can be rep- 
resented as a truth table of a suitable formula in propositional logic. Because of (5.17a) and (5.17b) 
implication and equivalence can be eliminated from formulas (see also 5.7, p. 395). This fact and the 
De Morgan rules (5.14a) and (5.14b) imply that one can express every formula, therefore every truth 
function, in terms of negation and disjunction only, or in terms of negation and conjunction. There are 
two further binary truth functions of two variables which are suitable to express all the truth functions. 
They are called the NAND function or Shef- 
.3 NOR function fer function (notation “|”) and the NOR 
ALB function or Peirce function (notation *.|"), 
with the truth tables given in Tables 5.2 
and 5.3. Comparison of the truth tables 
for these operations with the truth tables 
of conjunction and disjunction makes the 
terminologies NAND function (NOT AND) 
and NOR function (NOT OR) clear. 


Table 5.2 NAND function Table 


A|B A 





5 5 
B B 
Ε Aly F T 
T| T T F 
F T F F 
T F T F 


HHH j 
HA τὴ 


7. Tautologies, Inferences in Mathematics 

A formula in propositional calculus is called a tautology if the value of its truth function is identically 
the value T. Consequently, two formulas A and B are called logically equivalent if the formula A <= B 
is a tautology. Laws of propositional calculus often reflect inference methods used in mathematics. As 
an example, consider the law of contraposition, i.e., the tautology 


A—-BeB-A. (5.194) 
This law, which also has the form 
A>B=B=A, (5.19b) 


can be interpreted in this way: To show that B is a consequence of A is the same as showing that A is a 
consequence of B. The Indirect proof (see also 1.1.2.2, p. 5) is based on the following principle: To show 
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that B is a consequence of A, one supposes B to be false, and under the assumption that A is true, one 
derives a contradiction. This principle can be formalized in propositional calculus in several ways: 


ΑΞ Β-- ΑΡ - Α (5.20a) or AS>B=AB>B or (5.20b) 


A=>B=AB=SF. (5.20c) 


5.1.2 Formulas in Predicate Calculus 


For developing the logical foundations of mathematics one needs a logic which has a stronger expressive 
power than propositional calculus. To describe the properties of most of the objects in mathematics 
and the relations between these objects the predicate calculus is needed. 

1. Predicates 

The objects to be investigated are included into a set, i.e., into the domain X of individuals (or uni- 
verse), e.g., this domain could be the set N of the natural numbers. The properties of the individuals, as, 
e.g., “ nis a prime ", and the relations between individuals, e.g., “ m is smaller than n ” , are considered 
as predicates. An n place predicate over the domain X of individual is an assignment P: X^ — {F,W}, 
which assigns a truth value to every n tuple of the individuals. So the predicates introduced above on 
natural numbers are a one-place (or unary) predicate and a two-place (or binary) predicate. 

2. Quantifiers 

A characteristic feature of predicate logic is the use of quantifiers, i.e., that of a universal quantifier or 
“for every” quantifier Y and existential quantifier or “for some” quantifier 4. If P is a unary predicate, 
then the sentence “P() is true for every x in X” is denoted by V x P(x) and the sentence“ There exists 
an zin X for which P(x) is true "is denoted by 3x P(x). Applying a quantifier to the unary predicate 
P, gives a sentence. If for instance IN is the domain of individual of the natural numbers and P denotes 
the (unary) predicate “n is a prime”, then Yn P(n) is a false sentence and 3n P(n) is a true sentence. 
3. Formulas in Predicate Calculus 

The formulas in predicate calculus are defined in an inductive way: 


1. If £1,... , &n are individual variables (variables running over the domain of individual variables) and 
P is an n-place predicate symbol, then 

P(zx1,..., £n) is a formula (elementary formula). (5.21) 
2. If A and B are formulas, then 

(^4), (A^ B), (AV B), (A B), (A & B), (Yz A) and (3z A) (5.22) 


are also formulas. 

Considering a propositional variable to be a null-place predicate, the propositional calculus can be 
considered as a part of predicate calculus. An occurrence of an individual variable x is bound in a 
formula if x is a variable in V x or in dx or the occurrence of x is in the scope of these types of quantifiers; 
otherwise an occurrence of x is free in this formula. A formula of predicate logic which does not contain 
any free occurrences of individual variables is called a closed formula. 

4. Interpretation of Predicate Calculus Formulas 

An interpretation of predicate calculus is a pair of 

e a set (domain of individuals) and 

e an assignment, which assigns an n-place predicate to every n-ary predicate symbol. 

For every prefixed value of free variables the concept of the truth evaluation of a formula is similar to 
the propositional case. The truth value of a closed formula is T or F. In the case of a formula containing 
free variables, one can associate the values of individuals for which the truth evaluation of the formula 
is true; these values constitute a relation (see 5.2.3, 1., p. 331) on the universe (domain of individuals). 
W Let P denote the two-place relation € on the domain IN of individuals, where IN is the set of the 
natural numbers then 
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e P(x,y) characterizes the set of all the pairs (c, y) of natural numbers with x X y (two-place or binary 
relation on IN); here z, y are free variables; 

e Vy P(x,y) characterizes the subset of N (unary relation) consisting of the element 0 only; here x is 
a free variable, y is a bound variable; 

e Jr Vy P(x, y) corresponds to the sentence “ There is a smallest natural number ”; the truth value is 
true; here x and y are bound variables. 


5. Logically Valid Formulas 
A formula is called logically valid (or a tautology) if it is true for every interpretation. The negation o 
formulas is characterized by the identities below: 


-VrP(r)-—-3r-P(x) or —-dzP(r)-—-Vzr-P(x). (5.23 
Using (5.23) the quantifiers Y and 3 can be expressed in terms of each other: 
VrP(r)—-3zrAP(r) or dz P(x)-—-Vzr-P(x). (5.24 


Further identities of the predicate calculus are: 
VrVyP(vr,y)—VyNz P(v,y), ( 
3rdyP(xr,y) —-3y3x P(x, y), ( 

Vx P(r) AVx Q(v) 2 Vr (P(r) ^ Q(x)), (5.27 
3r P(x) V 3x Q(x) 2 3x (P(x) V Q(x)). ( 


The following implications are also valid: 


Va P(x) V Vx Q(x) > Va (P(x) V Ω(α)), (5.29 
Jz (P(x) ^ Q(z)) 2: 3x P(x) ^3x Q(x), (5.30 
Va (P(x) > Q(2)) > (Va P(x) 2 Vx Q(x)), (5.31 
Va (P(x) &Q(z)) > (Va P(x) 6 Vz Q(2)), (5.32 
daVy P(x,y) > Vydsu P(a,y). (5.33 





The converses of these implications are not valid, in particular, one has to be careful with the fact tha: 
the quantifiers V and 3 do not commute (the converse of the last implication is false). 

6. Restricted Quantification 

Often it is useful to restrict quantification to a subset of a given set. So, there is considered 





Va eX P(x) asashortnotation of. Vr(rc X-— P(x)) and (5.34 
3r c X P(r) asashort notation of. 3x (x € X ^ P(x)). (5.35 


5.2 Set Theory 


5.2.1 Concept of Set, Special Sets 


The founder of set theory is Georg Cantor (1845-1918). The importance of the notion introduced by 
him became well known only later. Set theory has a decisive role in all branches of mathematics, and 
today it is an essential tool of mathematics and its applications. 


1. Membership Relation 

1. Setsandtheir Elements The fundamental notion of set theory is the membership relation. A 
set A is a collection of certain different things a (objects, ideas, etc.) that belong together for certain 
reasons. These objects are called the elements of the set. One writes “a € A” or “a ¢ A” to denote “a 
is an element of A” or “a is not an element of A”, respectively. Sets can be given by enumerating their 
elements in braces, e.g., M = {a,b,c} or U = {1,3,5,...}, or by a defining property possessed exactly 
by the elements of the set. For instance the set U of the odd natural numbers is defined and denoted by 
U = {x | xis an odd natural number}. For number domains the following notation is generally used: 
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N = {0,1,2,...} set of the natural numbers, 

Z ={0,1,-1,2,—-2,...} set of the integers, 

Q= E | pqceZ^qz o} set of the rational numbers, 
q 

R set of the real numbers, 

C set of the complex numbers. 


2. Principle of Extensionality for Sets Two sets A and B are identical if and only if they have 
exactly the same elements, i.e., 
A=BeVa(teEASsre B). (5.36) 
W The sets (3,1,3, 7, 2) and (1,2,3, 7) are the same. 
A set contains every element only “once”, even if it is enumerated several times. 
2. Subsets 
1. Subset If A and B are sets and 
Vr (r€A— vc B) (5.37) 
holds, then A is called a subset of B, and this is denoted by A C B. In other words: A is a subset of D, 
if all elements of A also belong to B. If for A C B there are some further elements in B such that they 
are not in A, then A is called a proper subset of B, and it is denoted by A C B (Fig. 5.1). Obviously, 
every set is a subset of itself A C A. 
@ Suppose A = {2,4,6,8,10} is a set of even numbers and B = {1,2,3,...,10} is a set of natural 
numbers. Since the set A does not contain odd numbers, A is a proper subset of B. 
2. Empty Set or Void Set It is important and useful to introduce the notion of empty set or void 
set, 0, which has no element. Because of the principle of extensionality, there exists only one empty set. 
B A: The set (z|z € IRA 2? -- 2x - 2 = 0} is empty. 
W B: () C M for every set M, i.e., the empty set is a subset of every set M. 
For a set A the empty set and A itself are called the trivial subsets of A. 
3. Equality of Sets Two sets are equal if and only if both are subsets of each other: 
A=BSACBABCA. (5.38) 
This fact is very often used to prove that two sets are identical. 
4. Power Set The set of all subsets A of a set M is called the power set of M and it is denoted by 
P(M),ie,P(M)- {A| AC M}. 
Wi For the set M = {a,b,c} the power set is 
P(M) — (0, (a), {b}, {c}, {a, b}, {a, ch, {b,c}, {a, b, ch}. 
It is true that: 
a) If aset M has m elements, its power set P(M) has 2” elements. 
b) For every set M there are M, Ø € P(M), i.e., M itself and the empty set are elements of the power 
set of M. 
5. Cardinal number The number of elements of a finite set M is called the cardinal number of M 


and it is denoted by card M or sometimes by |M |. 
For the the cardinal number of sets with infinitely many elements see 5.2.5, p. 335. 


5.2.2 Operations with Sets 


1. Venn diagram 

The graphical representations of sets and set operations are the so-called Venn diagrams, when repre- 
senting sets by plane figures. So, Fig. 5.1, represents the subset relation A C B. 

2. Union, Intersection, Complement 

By set operations new sets can be formed from the given sets in different ways: 
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Figure 5.1 Figure 5.2 Figure 5.3 





1. Union Let A and B be two sets. The union set or the union (denoted by AU B) is defined by 
AUB={r|zreAVzEe B}, (5.39) 
in words “A union B” or “A cup B”. If A and B are given by the properties E, and Ex respectively, the 


union set AU B has the elements possessing at least one of these properties, i.e., the elements belonging 
to at least one of the sets. In Fig. 5.2 the union set is represented by the shaded region. 
E {1,2,3} U {2,3,5,6} = {1, 2,3,5,6}. 
2. Intersection Let Aand B be two sets. The intersection set, intersection, cutor cut set (denoted 
by AN B) is defined by 
AnB-íz|rcA^zrcB] (5.40) 
in words “A intersected by B” or “A cap B”. If A and B are given by the properties E, and E> respec- 
tively, the intersection AN B has the elements possessing both properties Æ, and E3, i.e., the elements 
belonging to both sets. In Fig. 5.3 the intersection is represented by the shaded region. 
W With the intersection of the sets of divisors T (a) and T (b) of two numbers a and b one can define the 
greatest common divisor (see 5.4.1.4, p. 373). Fora — 12 and b — 18 holds T (a) = {1, 2, 3, 4, 6, 12} and 
T(b) = {1,2,3,6,9, 18}, so T(12) N T(18) contains the common divisors, and the greatest common 
divisor is g.c.d. (12,18) = 6. 
3. Disjoint Sets Two sets A and B are called disjoint if they have no common element; for them 
ANB=0 (5.41) 


holds, i.e., their intersection is the empty set. 





E The set of odd numbers and the set of even numbers are disjoint; their intersection is the empty set, 
i.e., 
{odd numbers} N {even numbers} = 0. 


4. Complement Considering only the subsets of a given set M, then the complementary set or the 
complement Cy (A) of A with respect to M contains all the elements of M not belonging to A: 


Cyu(A) 2 (r| ve M^x 4 A), (5.42) 
in words “complement of A with respect to M”, and M is called the fundamental set or sometimes the 
universal set. Ifthe fundamental set M is obvious from the considered problem, then the notation A 


is also used for the complementary set. In Fig. 5.4 the complement A is represented by the shaded 
region. 





M 


J] >) EDD 


Figure 5.4 Figure 5.5 Figure 5.6 











3. Fundamental Laws of Set Algebra 
These set operations have analoguous properties to the operations in logic. The fundamental laws of 
set algebra are: 
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1. Associative Laws 














(An B)n C — An (Bn C), 5.43) (AU B)UC — AU(BUC). (5.44) 
2. Commutative Laws 

ANB=BNA, 5.45) AUB=BUA. (5.46) 
3. Distributive Laws 

(AU B)nC —(AnC)U(BnC), (5.47) (An B)JUC — (AUC)n (BUC). (5.48) 
4. Absorption Laws 

AN(AUB) =A, 5.49) AU(ANB)=A. (5.50) 
5. Idempotence Laws 

ANA=A, 5.51) AUA=A. (5.52 
6. De Morgan Laws 

AnB-AUB, 5.53) AUB=ANB. (5.54 
7. Some Further Laws 

An4A - f, 5.55) AUA-M (M fundamentalset), (5.56 

ANM=A, 5.57) AU — A, (5.58 

An — 0, 5.59) AUM=M, (5.60 

M - $, 5.61) 0 — M. (5.62 

A=A. 5.63) 
This table can also be obtained from the fundamental laws of propositional calculus (see 5.1.1, p. 323 





using the following substitutions: A by N, V by U, T by M, and F by Ø. This coincidence is not acci- 
dental; it will be discussed in 5.7, p. 395. 
4. Further Set Operations 
In addition to the operations defined above there are defined some further operations between two sets 
A and B: the difference set or difference A \ B, the symmetric difference AAB and the Cartesian 
product A x B. 
1. Difference of Two Sets The set of the elements of A, not belonging to B is the difference set or 
difference of A and B: 

A\ B={x|ceE Adz ¢ B}. (5.644) 
If A is defined by the property E; and B by the property E», then AX B contains the elements having 
the property £ but not having property E». 
In Fig. 5.5 the difference is represented by the shaded region. 
m {1,2,3,4} \ {3,4,5} = {1,2}. 
2. Symmetric Difference of Two Sets The symmetric difference AAB is the set of all elements 
belonging to exactly one of the sets A and B: 

AAB={x|(cE AAc€ B)V(teE BAxc¢ A)}. (5.64b) 
It follows from the definition that 

AAB = (A\ B)U(B\ A) = (AUB) \ (ANB), (5.64c) 
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i.e., the symmetric difference contains the elements which have exactly one of the defining properties 


E; (for A) and Es (for B). 

In Fig. 5.6 the symmetric difference is represented by the shaded region. 

E {1, 2,3, 4}A{3, 4,5} = {1,2,5}. 

3. Cartesian Product of Two Sets The Cartesian product of two sets A x B is defined by 


Ax B= {(a,b)|ae ANbE B}. (5.65a) 
The elements (a, b) of A x B are called ordered pairs and they are characterized by 

(a,b) = (c,d) & a =c^b=d. (5.65b) 
'The number of the elements of a Cartesian product of two finite sets is equal to 


card (A x B) — (card A)(cardB). (5.65c 
WB A: For A = {1,2,3} and B = {2,3} one gets A x B = {(1,2), (1,3), (2, 2), (2, 3), (3, 2), (3, 3) 
and B x A = {(2,1), (2,2), (2,3), (3, 1), (3, 2), (3, 3)} with cardA = 3, cardB = 2, card(A x B) 
card(B x A) = 6. 
ΒΒ: Every point of the x, y plane can be defined with the Cartesian product R x R, (R is the set of 
real numbers). The set of the coordinates x, y is represented by R x R, so: 

R? =R x R= {(xz,y)| x € R,y € R}. 
4. Cartesian Product of n Sets 


vwu 


From n elements, by fixing an order of sequence (first element, second element, ..., n-th element) an 
ordered n tuple is defined. If a; € A; (i — 1,2,...,n) are the elements, the n tuple is denoted by 
(a1, a2,..., an), Where a; is called the i-th component. 
For n = 3, 4, 5 these n tuples are called triples, quadruples, and quintuples. 
The Cartesian product of n terms A; X Az X --- X An is the set of all ordered n tuples (a1, a2, . . . , an) 
with a; € A; : 

Ay x... An = {ίαι,... απ) | a; E€ Ai (69 1,...,m)). (5.66a) 
If every A; is a finite set, the number of ordered n tuples is 

card(A, x Ag X +- X An) = cardA; card Ag --+cardAn. (5.66b) 


Remark: The n times Cartesian product of a set A with itself is denoted by A". 
5.2.3 Relations and Mappings 


1. nary Relations 
Relations define correspondences between the elements of one or different sets. An n ary relation or 
n-place relation R between the sets A1,..., A, is a subset of the Cartesian product of these sets, i.e., 
RC A x... X Απ. If the sets Aj, i = 1,...,n, are all the same set A, then R C A” holds and it is 
called an n ary relation in the set A. 
2. Binary Relations 
1. Notion of Binary Relations of a Set The two-place (binary) relations in a set have special 
importance. 
In the case of a binary relation the notation a.b is also very common instead of (a,b) € R. 
Β Asan example, the divisibility relation in the set A = {1,2,3,4} is considered, i.e., the binary 
relation 

T — {(a,b) |a,b€ AA aisa divisor of b} (5.67a) 

= {(1,1), (1, 2), (1, 3), (1, 4), (2, 2), (2, 4), (3, 3), (4, 4)}. (5.67b) 


2. Arrow Diagram or Mapping Function Finite binary relations R in aset A can be represented 
by arrow functions or arrow diagrams or by relation matrices. The elements of A are represented as 
points of the plane and an arrow goes from a to b if aRb holds. 

Fig. 5.7 shows the arrow diagram of the relation T in A = {1, 2,3, 4}. 
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3. Relation Matrix The elements of A are used as row and column entries of a matrix (see 4.1.1, 
1., p. 269). At the intersection point of the row of a € A with the column of b € B there is an entry 
1 if aRb holds, otherwise there is an entry 0. The above scheme shows the relation matrix for T' in 
A = {1,2,3,4}. 
3. Relation Product, Inverse Relation 
Relations are special sets, so the usual set operations (see 5.2.2, p. 328) can be performed between 
relations. Besides them, for binary relations, the relation product and the inverse relation also have 
special importance. 
Let R C A x B and S C B x C be two binary relations. The product Ro S of the relations R, S is 
defined by 

RoS — ((a,c) | 3b(b€ B^aRb^bsSc)). (5.68) 
The relation product is associative, but not commutative. 


The inverse relation R7! of a relation R is defined by 


R™ = {(b,a) | (a,b) € R}. (5.69) 
For binary relations in a set A the following relations are valid: 

(RUS)oT —(RoT)U(SoT), (5.70a) (RnS)oT C(RoT)n(SoT), (5.70b) 

(RUS) =R TUS! (5.70c) (RnS)!-mR!nsS-, (5.70d) 

(RoS)!—SoR. (5.700) 


4. Properties of Binary Relations 


A binary relation in a set A can have special important properties: 
Ris called 


reflexive, ifVa € AaRa, (5.71a) 
irreflexive, if Va € A-aRa, (5.71b) 
symmetric, if Va,b € A (aRb = bRa), (5.71c) 
antisymmetric, if Va,b € A(aRb A bRa => a= bd), (5.71d) 
transitive, if Va,b,c € A(aRbA bRc = aRc), (5.71e) 
linear, if Va,b € A (a Rb V bRa). (5.71f) 


These relations can also be described by the relation product. For instance: a binary relation is transi- 
tive if Ro R C R holds. Especially interesting is the transitive closure tra( R) of a relation R. It is the 
smallest (with respect to the subset relation) transitive relation which contains R. In fact 
tra(R) = U R” = FURURU., (5.72) 
n21 
where R” is the n times relation product of R with itself. 
E Let a binary relation R on the set {1, 2,3, 4,5} be given by its relation matrix M: 
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M?|12 34 5 
1:1 10 1 1 
21061001 
510 1101 
4 JO 1010 
5|000 10 





Calculating M? by matrix multiplication where the values 0 and 1 are treated as truth values and 
instead of multiplication and addition one performs the logical operations conjunction and disjunction, 
then, M? is the relation matrix belonging to R?. The relation matrices of R?, R* etc. can be calculated 
similarly. 
* The relation matrix of RU R? U R? (the matrix on the left) can 

MVM?VM?|1 2 4 5 be get by calculating the disjunction elementwise of the matri- 
ces M, M? and M?. Since the higher powers of M contains no 
new 1-s, this matrix already coincides with the relation matrix 
of tra(R). 
The relation matrix and relation product have important ap- 
plications in search of path length in graph theory (see 5.8.2.1, 
p. 404). 
In the case of finite binary relations, one can easily recognize the above properties from the arrow 
diagrams or from the relation matrices. For instance one can recognize the reflexivity from “self-loops” 
in the arrow diagram, and from the main diagonal elements 1 in the relation matrix. Symmetry is 
obvious in the arrow diagram if to every arrow there belongs another one in the opposite direction, or 
if the relation matrix is a symmetric matrix (see 5.2.3, 2., p. 331). Easy to see from the arrow diagram 
or from the relation matrix that the divisibility T is a reflexive but not symmetric relation. 





5. Mappings 

A mapping or function f (see 2.1.1.1, p. 48) from a set A to a set B with the notation f: A > Bisa 
rule to assign to every element a € A exactly one element b € B, which is called f (a). 

A mapping f can be considered as a subset of A x B and so as a binary relation: 


f —((a.f(a))a e A} C Ax B. (5.73) 


a) f is called a injective or one to one mapping, if to every b € B at most one a € A with f(a) = b 
exists. 

b) f is called a surjective mapping from A to B, if to every b € B at least one a € A with f(a) = b 
exists. 

c) f is called bijective, if f is both injective and surjective. 

If A and B are finite sets, between which exists a bijective mapping, then A and B possess the same 
number of elements (see also 5.2.5, p. 335). 

For a bijective mapping f: A — B exists the inverse relation f^: B — A, the so-called inverse 
mapping of f. 

The relation product of mappings is used for the one after the other composition of mappings: If f: 
A — B and g: B — C are mappings, then f o g is also a mapping from A to C, and is defined by 


(f o g)(a) — g(f(a)) - (5.74) 


Remark: Becareful with the order of f and g in this equation (it is treated differently in the literature!). 


5.2.4 Equivalence and Order Relations 


The most important classes of binary relations with respect to a set A are the equivalence and order 
relations. 
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1. Equivalence Relations 

A binary relation R with respect to a set A is called an equivalence relation if R is reflexive, symmetric, 

and transitive. For aRb also the notations a ~p b or a ~ b are used, if the equivalence relation R is 

already known, in words a is equivalent to b (with respect to R). 

Examples of Equivalence Relations: 

E A: A= Z, m €N \ {0}. a ~p b holds exactly if a and b have the same remainder when divided by 

m (they are congruent modulo m). 

W B: Equality relation in different domains, e.g., in the set Q of rational numbers: n E = © MQ = 
1 2 

poqdi (pa, po, di, q2 integer; qı, q2 # 0), where the first equality sign defines an equality in Q, while the 

second one denotes an equality in Z. 

W C: Similarity or congruence of geometric figures. 

W D: Logical equivalence of expressions of propositional calculus (see 5.1.1, 6., p. 324). 

2. Equivalence Classes, Partitions 

1. Equivalence Classes An equivalence relation in a set A defines a partition of A into non-empty 

pairwise disjoint subsets, into equivalence classes. 


(a]g :— (b | b€ A^a gb) (5.75) 
is called an equivalence class of a with respect to R. For equivalence classes the following is valid: 

(a]g Z 0, agb [a]g—|b]]g, and a-mgb- α]ηΏ [η -- 0. (5.76) 
These equivalence classes form a new set, the quotient set A/R: 

A/R = {{alr | a € A}. (5.77) 
A subset Z C P(A) of the power set P(A) is called a partition of A if 

04 Z, X, YeZ^Xg2YxnY-0, UXH=A. (5.78) 

ΧεΖ 


2. Decomposition Theorem Every equivalence relation R in a set A defines a partition Z of A, 

namely Z = A/R. Conversely, every partition Z of a set A defines an equivalence relation R in A: 
angb&3XeceZļlaceX^abex). (5.79) 

An equivalence relation in a set A can be considered as a generalization of the equality, where “ in- 

significant ” properties of the elements of A are neglected, and the elements, which do not differ with 

respect to a certain property, belong to the same equivalence class. 

3. Ordering Relations 

A binary relation R in aset A is called a partial ordering if R is reflexive, antisymmetric, and transitive. 

If in addition R is linear, then R is called a linear ordering or a chain. The set A is called ordered or 

linearly ordered by R. In a linearly ordered set any two elements are comparable. Instead of aRb also 

the notation a <p b or a < b is used, if the ordering relation R is known from the problem. 

Examples of Ordering Relations: 

W A: The sets of numbers N, Z, Q, R are completely ordered by the usual < relation. 

E B: The subset relation is also an ordering, but only a partial ordering. 

W C: The lexicographical order of the English words is a chain. 

Remark: If Z — (A, B) is a partition of Q with the property a € AA b € B > a < b, then (A, B) is 

called a Dedekind cut. If neither A has a greatest element nor B has a smallest element, so an irrational 

number is uniquely determined by this cut. Besides the nest of intervals (see 1.1.1.2, p. 2) the notion 

of Dedekind cuts is another way to introduce irrational numbers. 

4. Hasse Diagram 

Finite ordered sets can be represented by the Hasse diagram: Let an ordering relation < be given on a 

finite set A. The elements of A are represented as points of the plane, where the point b € A is placed 
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above the point a € Aifa « b holds. If there is no c € A for which a « c < b, one says a and b are 
neighbors or consecutive members. Then one connects a and b by a line segment. 


A Hasse diagram is a “simplified” arrow diagram, where all the loops, arrow- 4ο 
heads, and the arrows following from the transitivity of the relation are elimi- 
nated. The arrow diagram of the divisibility relation T of the set A = {1, 2, 3, 4} 
is given in Fig. 5.7. T also denotes an ordering relation, which is represented 2. 03 
by the Hasse diagram in Fig. 5.8. 


5.2.5 Cardinality of Sets 


In 5.2.1, p. 327 the number of elements of a finite set was called the cardinality 
of the set. This notion of cardinality can be extended to infinite sets. Figure 5.8 








le 


1. Cardinal Numbers 

Two sets A and B are called equinumerous if there is a bijective mapping between them. To every set A 
a cardinal number |A| or card A is assigned, so that equinumerous sets have the same cardinal number. 
A set and its power set are never equinumerous, so no “ greatest " cardinal number exists. 

2. Infinite Sets 

Infinite sets can be characterized by the property that they have proper subsets equinumerous to the 
set itself. The "smallest" infinite cardinal number is the cardinal number of the set IN of the natural 
numbers. This is denoted by No (aleph 0). 





A set is called enumerable or countable if it is equinumerous to N. This means that its elements can be 
enumerated or written as an infinite sequence a1, d2, . . .. 

A set is called non-countable if it is infinite but it is not equinumerous to IN. Consequently every infinite 
set which is not enumerable is non-countable. 

W A: The set Z of integers and the set Q of the rational numbers are countable sets. 

E B: The set R of the real numbers and the set C of the complex numbers are non-countable sets. 


These sets are equinumerous to P(N), the power set of the natural numbers, and their cardinality is 
called the continuum. 


5.3 Classical Algebraic Structures 
5.3.1 Operations 


1. nary Operations 

The notion of structure has a central role in mathematics and its applications. Next to investigate are 
algebraic structures, i.e., sets on which operations are defined. An n ary operation p on a set A is a 
mapping y: A” — A, which assigns an element of A to every n tuple of elements of A. 

2. Properties of Binary Operations 

Especially important is the case n = 2, which is called a binary operation, e.g., addition and multipli- 
cation of numbers or matrices, or union and intersection of sets. A binary operation can be considered 
as a mapping * : A x A —> A, where instead of the notation **(a, b)" in this chapter mostly the infir 
form “ax b” will be used. A binary operation x in A is called associative if 


(a b) « c — as (bx c), (5.80) 
and commutative if 
axb=bx*a (5.81) 


holds for every a,b,c € A. 
An element e € A is called a neutral element with respect to a binary operation x in A if 


a*e-—exa-a holds for every a€ A. (5.82) 
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3. Exterior Operations 
Sometimes exterior operations are to be considered. That are the mappings from K x A to K, where 
K is an “exterior” and mostly already structured set (see 5.3.8, p. 365). 


5.3.2 Semigroups 

The most frequently occurring algebraic structures have their own names. A set H having one associa- 
tive binary operation * , is called a semigroup. The notation: is H — (H,x). 

Examples of Semigroups: 

W A: Number domains with respect to addition or multiplication. 

E B: Power sets with respect to union or intersection. 

W C: Matrices with respect to addition or multiplication. 

WB D: The set A* of all * words " (strings) over an * alphabet " A with respect to concatenation (free 
semigroup). 

Remark: Except for multiplication of matrices and concatenation of words, all operations in these 
examples are also commutative; in this case one talks about a commutative semigroup. 


5.3.3 Groups 
5.3.3.1 Definition and Basic Properties 


1. Definition, Abelian Group 
A set G with a binary operation x is called a group if 
e x is associative, 
e + has a neutral element e, and for every element a € G there exists an inverse element a such that 

axa! =a xa = e. (5.83) 
A group is a special semigroup. 
The neutral element of a group is unique, i.e., there exists only one. Furthermore, every element of the 
group has exactly one inverse. If the operation * is commutative, then the group is called an Abelian 
group. If the group operation is written as addition, +, then the neutral element is denoted by 0 and 
the inverse of an element a by —a. 
The number of elements of a finite group is called the order of the group (see 5.3.3.2,3., p. 338). 
Examples of Groups: 
W A: The number of domains (except IN) with respect to addition. 
B B: QV(0), RV (0), and CV (0) with respect to multiplication. 
MC: Sv :={f:M— MA fbijective} with respect to composition of mappings. This group is called 
symmetric. If M is finite having n elements, then Sp is written instead of Sm. Sn has n! elements. 
The symmetric group 5S, and its subgroups are called permutation groups. So, the dieder groups Dn 
are permutation groups and subgroups of Sp. 
W D: The set D, of all covering transformations of a regular n-gon in the plane is considered. Here a 
covering transformation is the transition between two symmetric positions of the n-gon, i.e., the moving 
of the n-gon into a superposable position. Denoting by d a rotation by the angle 27/n and by o the 
reflection with respect to an axis, then D, has 2n elements: 

D, — fed, d?,..., d"-!, o, do, ... , d" 1g). 
With respect to the composition of mappings D, is a group, the dihedral group. Here the equalities 
d” = o? = e and od = dto hold. 
E E: All the regular matrices (see 4.1.4, p. 272) over the real or complex numbers with respect to 
multiplication. 
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Remark: Matrices have a very important role in applications, especially in representation of linear 
transformations. Linear transformations can be classified by matrix groups. 

2. Group Tables or Cayley’s Tables 

For the representation of finite groups Cayley’s tables or group tables are used: The elements of the 
group are denoted at the row and column headings. The element a x b is the intersection of the row of 
the element a and the column of the element b. 

W if MV = {1,2,3}, then the symmetric group Sm is also denoted by S3. S3 consists of all the bijective 
mappings (permutations) of the set {1, 2, 3} and consequently it has 3! = 6 elements (see 16.1.1, p. 805). 
Permutations are mostly represented in two rows, where in the first row there are the elements of M 
and under each of them there is its image. So one gets the six elements of S3 as follows: 


Fira g | f1238 23 
*-1123/» 9711325)» ?7132 1; 
| /12 3 | [i238 | [i123 
πι hasl an hal 


With the successive application of these mappings (binary operations) the following group table is 
obtained for δη: 


(5.84) 


e From the group table it can be seen that the identity per- 


ο ε P 
ο ος mutation e is the neutral element of the group. 





€ pi pa ps pa ps e In the group table every element appears exactly once in 
Pı E Ps P4 P3 p2 (5.85) every row and in every column. 

papat Ps Pi Ps : e It is easy to recognize the inverse of any group element in 
Pa Ps pa E pop the table, i.e., the inverse of p4 in 53 is the permutation ps, 
P € P 2 P3 Pı Ps € because at the intersection of the row of p4 with the column 
Ps P3 Pı P2 € pa of ps is the neutral element e. 


e If the group operation is commutative (Abelian group), then the table is symmetric with respect to 
the “main diagonal”; S3 is not commutative, since, e.g., p1 o po Æ p2 o pi. 
e The associative property cannot be easily recognized from the table. 


5.3.3.2 Subgroups and Direct Products 


1. Subgroups 
Let G — (G, x) bea group and U C G. If U is also a group with respect to *, then U = (U, x) is called 
a subgroup of G. 
A non-empty subset U of a group (G, *) is a subgroup of G if and only if for every a, b € U, the elements 
a* banda! are also in U (subgroup criterion). 
1. Cyclic Subgroups The group G itself and E = {e} are subgroups of G, the so-called trivial 
subgroups. Furthermore, a subgroup corresponds to every element a € G, the so-called cyclic subgroup 
generated by a: 

«a»-(..,a?,a,ea,d),...). (5.86) 
If the group operation is addition, then one writes the integer multiple ka as a shorthand notation of 
the k times addition of a with itself instead of the power a^, i.e., as a shorthand notation of the k times 
operation of a by itself, 

«a » — (...,(-2)a, —a,0,a,2a,...). (5.87) 
Here < a > is the smallest subgroup of G containing a. If < a > = G holds for an element a of G, 
then G is called cyclic. 
There are infinite cyclic groups, e.g., Z with respect to addition, and finite cyclic groups, e.g., the set 
Zm the residue class modulo m with residue class addition (see 5.4.3, 3., p. 377). 
W if the number of elements of a finite G group is a prime, then G is always cyclic. 
2. Generalization The notion of cyclic groups can be generalized as follows: If M is a non-empty 
subset of a group G, then the subgroup of G whose elements can be written in the form of a product 
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of finitely many elements of M and their inverses, is denoted by < M >. The subset M is called the 
system of generators of < M >. If M contains only one element, then < M > is cyclic. 
3. Order ofa Group, Left and Right Cosets In group theory the number of elements of a finite 
group is denoted by ord G. If the cyclic subgroup < a > generated by one element a is finite, then this 
order is also called the order of the element a, i.e., ord < a >= orda. 
If U is a subgroup of a group (G, *) and a € G, then the subsets 

aU :— (as u|u € U) and. Ua:— (us*a|u € U) (5.88) 
of G are called left co-sets and right co-sets of U in G. The left or right co-sets form a partition of G, 
respectively (see 5.2.4, 2., p. 334). 
ΑΙ the left or right co-sets of a subgroup U in a group G have the same number of elements, namely 
ordU. From this it follows that the number of left co-sets is equal to the number of right co-sets. This 
number is called the index of U in G. The Lagrange theorem follows from these facts. 
4. Lagrange Theorem The order of a subgroup is a divisor of the order of the group. 
In general it is difficult to determine all the subgroups of a group. In the case of finite groups the 
Lagrange theorem as a necessary condition for the existence of a subgroup is useful. 
2. Normal Subgroup or Invariant Subgroup 
For a subgroup U, in general, aU is different from Ua (however |aU| = |Ua| is valid). If aU = Ua for all 
à € G holds, then U is called a normal subgroup or invariant subgroup of G. These special subgroups 
are the basis of forming factor groups (see 5.3.3.3, 3., p. 339). 
In Abelian groups, obviously, every subgroup is a normal subgroup. 
Examples of Subgroups and Normal Subgroups: 
B A: RV (0), QV (0) form subgroups of C \ {0} with respect to multiplication. 
W B: The even integers form a subgroup of Z with respect to addition. 
W C: Subgroups of 53: According to the Lagrange theorem the group 55 having six elements can have 
subgroups only with two or three elements (besides the trivial subgroups). In fact, the group δα has 
the following subgroups: E — (e), Ui — (e, pi), Us — (e, pa), Us — (6: pa]; Ua — (8 Pa, Ps}, S3- 
The non-trivial subgroups U;, U2, U3, and U4 are cyclic, since the numbers of their elements are primes. 
But the group S3 is not cyclic. The group S3 has only U4 as a normal subgroup, except the trivial 
normal subgroups. 
Anyway, every subgroup U of a group G with |U| = |G|/2 is a normal subgroup of G. 
Every symmetric group Sy and their subgroups are called permutation groups. 
E D: Special subgroups of the group GL(n) of all regular matrices of type (n, n) with respect to matrix 
multiplication: 
SL(n) group of all matrices A with determinant 1, 
O(n) group of all orthogonal matrices, 
SO(n) group of all orthogonal matrices with determinant 1. 
The group SL(n) is anormal subgroup of GL(n) (see 5.3.3.3, 3., p. 339) and SO(n) is anormal subgroup 
of O(n). 
W E: Assubgroups of all complex matrices of type (n, n) (see 4.1.4, p. 272): 
U(n) group of all unitary matrices, 
SU(n) group of all unitary matrices with determinant 1. 
3. Direct Product 
1. Definition Suppose A and B are groups, whose group operation (e.g., addition or multiplication) 
is denoted by -. In the Cartesian product (see 5.2.2, 4., p. 331) A x B (5.65a) an operation * can be 
introduced in the following way: 


(a3, 03) * (a2, b2) = (a1 * az, bı - b2). (5.89a) 
A x B becomes a group with this operation and it is called the direct product of A and B. 
(e, e) denotes the unit element of A x B, (a7™}, b7}) is the inverse element of (a, b). 
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For finite groups A, B 

ord (A x B) = ord A- ord B (5.89b) 
holds. The groups A’ := {(a,e)|a € A} and B' := ((e,0)|b € B} are normal subsets of Ax B isomorphic 
to A and B, respectively. 
The direct product of Abelian groups is again an Abelian group. 
The direct product of two cyclic groups A, B is cyclic if and only if the greatest common divisor of the 
orders of the groups is equal to 1. 
E A: With Z = {e, a} and Z3 = {e, b, b?}, the direct product Z2 x Zz; = {(e, e), (e, b), (e, b?), (a, e), (a, 
b), (a, b?) }, is a group isomorphic to Ze (see 5.3.3.3, 2., p. 339) generated by (a, b). 
E B: On the other hand Z x Z2 = {(e, e), (e, b), (a, e), (a, b)} is not cyclic. This group has order 4 
and it is also called Klein’s four group, and it describes the covering operations of a rectangle. 
2. Fundamental Theorem of Abelian Groups Because the direct product is a construction 
which enables to make “larger” groups from “smaller” groups, the question can be reversed: When is 
it possible to consider a larger group G as a direct product of smaller groups A, B, i.e., when will G be 
isomorphic to A x B? For Abelian groups, there exists the so-called fundamental theorem: 
Every finite Abelian group can be represented as a direct product of cyclic groups with orders of prime 
powers. 


5.3.3.3 Mappings Between Groups 


1. Homomorphism and Isomorphism 
1. Group Homomorphism Between algebraic structures, not arbitrary mappings, but only “struc- 
ture keeping” mappings are considered: 
Let Gy = (G1, *) and Gy = (G2,0) are two groups. A mapping h: G4 — G3 is called a group homo- 
morphism, if for all a,b € G, holds: 
h(a * b) = h(a)oh(b) (“image of product = product of images” ) (5.90) 
W A^s an example, consider the multiplication law for determinants (see 4.2.2, 7., p. 279): 
det(AB) = (det A)(det B). (5.91) 
Here on the right-hand side there is the product of non-zero numbers, on the left-hand side there is the 
product of regular matrices. 
If h: G4 — Ga» is a group homomorphism, then the set of elements of G1, whose image is the neutral 
element of G3, is called the kernel of h, and it is denoted by ker h. The kernel of h is a normal subgroup 
of Gi. 
2. Group Isomorphism Ifa group homomorphism h is also bijective, then h is called a group iso- 
morphism, and the groups G'; and G2 are called isomorphic to each other (notation: G; = G2). Then 
ker h = E is valid. 
Isomorphic groups have the same structure, i.e., they differ only by the notation of their elements. 
W The symmetric group 53 and the dihedral group D3 are isomorphic groups of order 6 and describe 
the covering mappings of an equilateral triangle. 
2. Cayley's Theorem 
The Cayley theorem says that every group can be interpreted as a permutation group (see 5.3.3.2, 2., 
p. 338): 
Every group is isomorphic to a permutation group. 
The permutation group P, whose elements are the permutations 7, (g € G) mapping a to G, xg, isa 
subgroup of Sg isomorphic to (G, *). 
3. Homomorphism Theorem for Groups 
The set of co-sets of a normal subgroup N in a group G is also a group with respect to the operation 
aN obN =abN. (5.92) 
It is called the factor group of G with respect to N, and it is denoted by G/N. 
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The following theorem gives the correspondence between homomorphic images and factor groups of a 
group, because of what it is called the homomorphism theorem for groups: 

A group homomorphism h: G1 — G» defines a normal subgroup of G4, namely ker h — (a € Gy|h(a) = 
€). The factor group G;/ ker h is isomorphic to the homomorphic image h(G1) = {h(a)|a € G1). Con- 
versely, every normal subgroup N of G; defines a homomorphic mapping natw: G4 — G/N with 
natn(a) = aN. This mapping naty is called a natural homomorphism. 

E Since the determinant construction det: GL(n) — RV (0) is a group homomorphism with ker- 
nel SL(n), SL(n) is a normal subgroup of GL(n) and (according to the homomorphism theorem): 
GL(n)/SL(n) is isomorphic to the multiplicative group R\ {0} of real numbers (for notation see 5.3.3.2, 
2., p. 338). 


5.8.4 Group Representations 
5.3.4.1 Definitions 


1. Representation 
A representation D(G) of the group G is a map (homomorphism) of G onto the group of non-singular 
linear transformations D on an n-dimensional (real or complex) vector space Vp: 

D(G):a> D(a), aeG. (5.93) 
The vector space V, is called the representation space; n is the dimension of the representation (see 
also 12.1.3, 2., p. 657). Introducing the basis (ej). (i — 1,2,...,n) in V, every vector x can be written 
as a linear combination of the basis vectors: 


x=) ve, x€ Vy. (5.94) 
ici 


The action of the linear transformation D(a), a € G, on x can be defined by the quadratic matrix 
D(a) = (Dix(a)) (i,k — 1,2,...,n), which provides the coordinates of the transformed vector x’ 
within the basis {e;}: 
n n 
x’ = D(a)x = Y wie, x; = X Da(a)my. (5.95) 
i=l k=l 
This transformation may also be considered as a transformation of the basis {e;} > {ej}: 


e; = e,D(a) = S Drila)ep- (5.96) 
πι 


Thus, every element a of the group is assigned to the representation matrix D = (Dix(a)): 
D(G):a>D=(Dyla)) (i,k =1,2,...,n),a€G. (5.97) 

The representation matrix depends on the choice of basis. 

@ A: Abelian Point Group C,,. A regular polygon (see 3.1.5, p. 138) with n sides has a symmetry 


such that rotating it around an axis, which is perpendicular to the y 

plane of the figure and goes through its center M (Fig.5.9) by an an- 

gle ey — 2zk/n,k —0,1,...,n—1 the resulted polygon is identical 

to the original one (invariance of the system under certain rotations). 

The rotations R;(y,) form the Abelian group of points Cn. C, isa 

cyclic group (see 5.3.3.2, p. 337), i.e. every element of the group can 

be represented as a power of a single element 7, whose n-th power 

is the unit element e = Ro: x 
Cr = {e, Ri, Ri,...,RP 1}, RP =e. (5.98a) 

Let the center of an equilateral triangle (n = 3) be the origin (see 

Fig.5.9), then the angles of rotations and the rotations are in accor- Figure 5.9 


A 
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dance with (5.98b). 


k — 0, qo — 0 or 2m, 
k —1, 1 — 20/3, (5.98b) 
k — 2, Q3 — 4n[3. 


Ro: A> A,B > B,C >C, 
Rı: A> B,B > C,C >A, (5.98c) 
Ry: ASC,BOA,COB. 


The rotations (5.98c) satisfy the relations 


Ro = È, Ri - Ro = R? = Ro =e. (5.98d) 


They form the cyclic group C3. 
The matrix of rotation (see (3.432), p. 230) 


_ (cosy —siny 
Ry) = (See end (5.980) 


of a geometric transformation of this triangle (for rotation of this figure in a fixed coordinate system see 
3.5.3.3,3., p. 213) gives the representation of group C3 if y is substituted by the angles given in (5.98b): 


Ῥ(ο) -- R(0) = lf d , D(R;) = R(27/3) = (a 2 (5.988 
D(R2) = R(47r/3) = ( ἕν o (5.98g) 


The same relations hold for the matrices of this representation given in (5.98f) and (5.98g) as for the 
group elements Ry (5.98d): 


D(R2) = D(RıRı) = D(R1)D(Rı), D(Rı)D(R2) = D(e). (5.98h) 


W B: Dihedral Group Ds. The equilateral triangle is invariant with 
respect to rotations by angle π about its bisectors (see Fig.5.10). 
These rotations correspond to reflections S4, Sp, Sc with respect to 
a plane being perpendicular to the plane of the triangle and containing 
one of the rotation axes. 

Sa: Rotations A— A, B C,C > B; 

Sp: Rotations A— C, B > B,C > A; (5.99a) 

So: Rotations A > B, B > A,C > C. 
For the reflections there is: 

SoSo =e (o= A,B,C). (5.99b) 





Figure 5.10 


The product SS- (o # T) results in one of the rotations R,, Ro, e.g. using S4 Sp for the triangle 
AABC: 


Sa Ss(AABC) = 54(ACBA) = ACAB = Ry(AABC), (5.990) 


consequently S4 Sp — R4. Here $4, Sp, Sc correspond to the outcomes on Fig.5.10. 
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€ Ry Rə SA Sp So 





The cyclic group C3 and the reflections S4, Sg, Sc together ele Ri Ra Sa Sp Sc 

form the dihedral group D3. The reflections do not forma — mm Rs e Sc S4 Sp 

subgroup because of (5.99c). A summary of relations is rep- Rə| Rə e Rı Sg So Sa (5.99d) 
resented in group-table (5.99d). S4 S4 Sp Sc e Ry Ro 

Only the signs of the z-coordinates of points B and C are Sp|Sp Sc Sa Ro e Ri 

changed at reflection 54 (see Fig.5.9). This coordinate ^ SolSo S, Sg Ri Ra c 
transformation is given by the matrix 


D(S4) = Ti i} I (5.996) 


The matrices representing reflections Sg and Sg can be found in the group-table (5.99d) and from the 
matrices of representation in (5.98f) and (5.98g) 


Ῥ(δο) - DG)D(S9 - (1452 V2) o 1) - (uia 12) (5.990) 


D(5) - DG)b(89) - (7352 V2) o Ὁ) = (Nee te) (5995) 


Matrices (5.98f) and (5.98g) together with matrices (5.99f) and (5.99g) form a representation of the 
dihedral group D3. 
2. Faithful Representation 
A representation is called faithful if G — D(G) is an isomorphism, i.e., the assignment of the element 
of the group to the representation matrix is a one-to-one mapping. 
3. Properties of the Representations 
A representation with the representation matrices D(a) has the following properties (a,b € G,I unit 
matrix): 

D(a*b) =D(a)-D(b), D(a ')=D (a), D(e) =I. (5.100) 
5.3.4.2 Particular Representations 


1. Identity Representation 
Any group G has a trivial one-dimensional representation (identity representation), for which every 
element of the group is mapped to the unit matrix I: a — I for all a € G. 
2. Adjoint Representation 
The representation D*(G) is called adjoint to D(G) if the corresponding representation matrices are 
related by complex conjugation and reflection in the main diagonal: 

D+t(G) = D*(G). (5.101) 
3. Unitary Representation 
For a unitary representation all representation matrices are unitary matrices: 

D(G)-D*(G) =], (5.102) 
where E is the unit matrix. 
4. Equivalent Representations 
Two representations D(G) and D'(G) are called equivalent if for each element a of the group the cor- 
responding representation matrices are related by the same similarity transformation with the non- 
singular matrix T — (7; ;): 


D'(a) 2 T!-D(a). T, D&(a)— Y; T5: Di(a) Tir, (5.103) 
jl=1 
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where Τη denotes the elements of the inverse matrix T^! of T. If such a relation does not hold two 
representations are called non-equivalent. The transition from D(G) to D'(G) corresponds to the trans- 
formation T' : fe1,e5,...,e4] — (ei, e5,..., el M of the basis in the representation space V4: 


e—eT, e —M Tue, (i =1,2,...,n). (5.104) 


Any representation of a finite group is equivalent to a unitary representation. 
5. Character of a Group Element 


In the representation D(G) the character x(a) of the group element a is defined as the trace of the 
representation matrix D(a) (sum of the main diagonal elements of the matrix): 


x(a) = Tr(D) = Y Du) (a) (5.105) 


The character of the unit τη e is given by the dimension n of the representation: x(e) — n. Since 
the trace of a matrix is invariant under similarity transformations, the group element a has the same 
character for equivalent representations. 
W Within the shell model of atomic or nuclear physics two out of three particles with space coordinates 
T; (i = 1,2,3) can be described by the wave function ya(r) while the third particle has the wave 
function vg(r) (configuration a?8(r)). The wave function v of the system is a product of the three one- 
particle wave functions: 7 = Yeas. In accordance with the possible distributions of the particles 
1, 2,3 to the wave functions one gets the three functions 

Vi = Palti)Palt2)~a(ts) v2 — ea(ri)ee(Ya)wa (rs) , 3 = valli) Palt2)Pa(Fs) ; (5.106a) 
which, when realizing permutations, transform among one another according to 5.3.3.1, 2., p. 337. This 
way one gets for the functions i23 a three dimensional representation of the symmetric group 53. 
According to (5.93) the matrix elements of the representation matrices can be found by investigating the 
action of the group elements (5.84) on the coordinate subscripts in the basis elements e;. For example: 

pii — pigo(Yi)ea (Y2)pg (Ya) = PalF1) Pa (F2) Pals) ^ Dm(pi)U», 

Pils = pigo(Yi)eg(Y2)ga (Ya) — eo(ri)eo(r2)ea(rs) ^ Dis(pi)vi, 

pis — pieg(ri)ea(ro)eo (rs) — ea(ri)ga(r2)go(rs) — Dss(pi)vss. (5.106b) 
Altogether one finds: 


100 010 001 
b se[orol, by [100], bt ται. 

001 001 100 

100 010 001 (5.1060) 
D(ps) - (81). D(z) -(01). D(ps) (200). 

010 100 010 


For the characters one has: 

x(e) = 3, x(p1) = x(P2) = x(p3) = 1, x(p4) — x(ps) = 0. (5.106d) 
5.3.4.3. Direct Sum of Representations 
The representations DO (G), DO (G) of dimension n1 and n2 can be composed to create a new repre- 


sentation D(G) of dimension n = nı + na by forming the direct sum of the representation matrices: 


(1) 
— pO Gy [Ὁ. ία) 0 E 
D(a) = DY (a) p D” (a) ( 0 DO()]: (5.107) 
The block-diagonal form of the representation matrix implies that the representation space V, is the 
direct sum of two invariant subspaces V, , V5: 


η ξ πι ϐ πο, N= +n. (5.108) 
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A subspace Vm (m < n) of Vn is called an invariant subspace if for any linear transformation D(a) 
G, every vector x € V,, is mapped onto an element of Vm again: 
x —D(a)x with x,x'€ V,. (5.109) 
The character of the representation (5.107) is the sum of the characters of the single representations: 
x(a) = x(a) + x(a). (5.110) 


5.3.4.4 Direct Product of Representations 
Ife; (i= 1,2,...,n1) and e'j (k — 1,2,...,n2) are the basis vectors of the representation spaces V, 
and V,,,, respectively, then the tensor product 

ex, = {ee,} (¢=1,2,...,m1; k=1,2,...,n2) (5.111) 


ας 


forms a basis in the product space V4, & V4, of dimension n - ng. With the representations DO (G) 
and D) (G) in Vn, and V,,, respectively an n; - r-dimensional representation D(G) in the product 
space can be constructed by forming the direct or (inner) Kronecker product (see 4.1.5,9., p. 276) of 
the representation matrices: 
1 2 
D(G) = D?(G)eDO(G), (D(G); = Die (a) - Dy? (a) 
with i,k =1,2,...,m1; 7, 1,2,... πο. (5.112) 


The character of the Kronecker product of two representations is equal to the product of the characters 
of the factors 


x99 (a) 2 xO(a) . X9 (a). (5.113) 


5.3.4.5 Reducible and Irreducible Representations 
If the representation space V,, possesses a subspace Vm (m « n) invariant under the group operations 
the representation matrices can be decomposed according to 
x Di(a A (m TOWS 
1. pe 1 
T- - D(a): T ( 0 Da(a) ) ( n — m rows 


by a suitable transformation T of the basis in V4. Di(a) and D(a) themselves are matrix representa- 
tions of a € G of dimension m and n — m, respectively. 


(5.114) 


A representation D(G) is called irreducible if there is no proper (non-trivial) invariant subspace in Vp. 
The number of non-equivalent irreducible representations of a finite group is finite. If a transformation 
T of a basis can be found which makes V, to a direct sum of invariant subspaces, i.e., 


Van = V19- 9 Vs, (5.115) 
then for every a € G the representation matrix D(a) can be transformed into the block-diagonal form 
(A = 0 in (5.114)): 

DO (a) 0 

T?.D(a)-T =D% (a) @---@ D(a) = E I (5.116) 

0 D(a) 
by a similarity transformation with T. Such a representation is called completely reducible. 
Remark: For the application of group theory in natural sciences a fundamental task consists in the 
classification of all non-equivalent irreducible representations of a given group. 
W The representation of the symmetric group 53 given in (5.106c), p. 343, is reducible. For example, 
in the basis transformation (e,,e5, e) —9 (e; — e; -- e3 +63, e = €, €} = ez} one obtains for the 
representation matrix of the permutation ps (with V; — ej, V — es, ia — e): 


100 
ποσα ω 





010 
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with A = (ο) , Di(p3) = 1 as the identity representation of S3 and Do(p3) = ( 0): 


5.3.4.6 Schur's Lemma 1 


If C is an operator commuting with all transformations of an irreducible representation D of a group 
(C. D(a)] = C- D(a) —D(a)-C =0, a € G, and the representation space V, is an invariant subspace 
of C, then C is a multiple of the unit operator, i.e., a matrix (cj) which commutates with all matrices 
of an irreducible representation is a multiple of the matrix I, C = A-I, AEC. 

5.3.4.7 Clebsch-Gordan Series 

In general, the Kronecker product of two irreducible representations D® (G), D® (G) is reducible. By 
a suitable basis transformation in the product space D® (G) & D®) (G) can be decomposed into the 


direct sum of its irreducible parts D? (a — 1,2,...,n) (Clebsch-Gordan theorem). "This expansion 
is called the Clebsch-Gordan series: 
DO(G) & DO (a) = $^ 6 m,D(?(G). (5.118) 
a=1 


Here, mq is the multiplicity with which the irreducible representation D (G) occurs in the Clebsch— 
Gordan series. 

The matrix elements of the basis transformation in the product space causing the reduction of the 
Kronecker product into its irreducible components are called Clebsch-Gordan coefficients. 


5.3.4.8 Irreducible Representations of the Symmetric Group Sj 
1. Symmetric Group Sy 


The non-equivalent irreducible representations of the symmetric group Sj are characterized uniquely 
by the partitions of M, i.e., by the splitting of M into integers according to 

1] --[λι,λ2,...,λµ], Artà + AÀW—M, M22322:e2A4 20 (5.119) 
The graphic representation of the partitions is done by arranging boxes in Young diagrams. 


W For the group 54 one obtains five Young diagrams 4 
as shown in the figure. [3,1] 221 [21,1 [1] 
The dimension of the representation [A] is given by CELER H-- 
JL Oe e 3-9 
nl 2 MIEL —— —. 5.120 
s IA; tki)! ἘΠ 


The Young diagram [A] conjugated to [A] is constructed by the interchange of rows and columns. 
In general, the irreducible representation of Sj, is reducible if one restricts to one of the subgroups 
Sy-i S2 

W In quantum mechanics for a system of identical particles the Pauli principle demands the construc- 
tion of many-body wave functions that are antisymmetric with respect to the interchange of all coordi- 
nates of two arbitrary particles. Often, the wave function is given as the product of a function in space 
coordinates and a function in spin variables. If for such a case due to particle permutations the spatial 
part of the wave function transforms according to the irreducible representation [A] of the symmetric 
group, then it has to be combined with a spin function transforming according to 1] in order to get a 
total wave function which is antisymmetric if two particles are interchanged. 


5.3.5 Applications of Groups 


In chemistry and in physics, groups are applied to describe the "symmetry" of the corresponding 
objects. Such objects are, for instance, molecules, crystals, solid structures or quantum mechanical 
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systems. The basic idea of these applications is the von Neumann principle: 
If a system has a certain group of symmetry operations, then every physical observational quantity of 
this system must have the same symmetry. 


5.3.5.1 Symmetry Operations, Symmetry Elements 


A symmetry operation s of a space object is a mapping of the space into itself such that the length of 
line segments remains unchanged and the object goes into a covering position to itself. The set of fixed 
points of the symmetry operation s is denoted by Fix s, i.e., the set of all points of space which remain 
unchanged for s. The set Fix s is called the symmetry element of s. 'The Schoenflies symbolism is used 
to denote the symmetry operation. 

Two types of symmetry operations are distinguished: Operations without a fixed point and operations 
with at least one fixed point. 

1. Symmetry Operations without a Fixed Point, for which no point of the space stays un- 
changed, cannot occur for bounded space objects, but now only such objects are considered. A sym- 
metry operation without a fixed point is for instance a parallel translation. 

2. Symmetry Operations with at least One Fixed Point are for instance rotations and reflec- 
tions. The following operations belong to them. 

a) Rotations Around an Axis by an Angle y: The axis of rotation and also the rotation itself is 
denoted by C, for y = 27 /n. The axis of rotation is then called of n-th order. 

b) Reflection with Respect to a Plane: Both the plane of reflection and the reflection itself are 
denoted by c. If additionally there is a principal rotation axis, then one draws it perpendicularly and 
denote the planes of reflections which are perpendicular to this axis by o; (h from horizontal) and the 
planes of reflections passing through the rotational axis are denoted by c; (v from vertical) or o4 (d 
means dihedral, if certain angles are halved). 

c) Improper Orthogonal Mappings: An operation such that after a rotation Cj, a reflection op 
follows, is called an improper orthogonal mapping and it is denoted by S,. Rotation and reflection 
commute. The axis of rotation is then called an improper rotational axis of n-th order and it is also 
denoted by Sn. This axis is called the corresponding symmetry element, although only the symme- 
try center stays fixed under the application of the operation S,. For n = 2, an improper orthogonal 
mapping is also called a point reflection or inversion (see 4.3.5.1, p. 287) and it is denoted by i. 


5.3.5.2 Symmetry Groups or Point Groups 
For every symmetry operation S, there is an inverse operation S~!, which reverses S “back”, ie., 
SS = SIS se. (5.121) 
Here e denotes the identity operation, which leaves the whole space unchanged. The family of symmetry 
operations of a space object forms a group with respect to the successive application, which is in general 
a non-commutative symmetry group of the objects. The following relations hold: 
a) Every rotation is the product of two reflections. The intersection line of the two reflection planes is 
the rotation axis. 
b) For two reflections c and σ’ 
σσ' -- σ'σ (5.122) 
if and only if the corresponding reflection planes are identical or they are perpendicular to each other. 
In the first case the product is the identity e, in the second one the rotation C5. 
c) The product of two rotations with intersecting rotational axes is again a rotation whose axis goes 
through the intersection point of the given rotational axes. 
d) For two rotations Cz and C% around the same axis or around axes perpendicular to each other: 


EU! e CLOS. (5.123) 
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The product is again a rotation. In the first case the corresponding rotational axis is the given one, in 
the second one the rotational axis is perpendicular to the given ones. 

5.3.5.3 Symmetry Operations with Molecules 

It requires a lot of work to recognize every symmetry element of an object. In the literature, for instance 
in [5.10], [5.13], it is discussed in detail how to find the symmetry groups of molecules if all the symmetry 
elements are known. The following notation is used for the interpretation of a molecule in space: The 
symbols above C in Fig. 5.11 mean that the OH group lies above the plane of the drawing, the symbol 


to the right-hand side of C means that the group OC2Hs is under C. 
The determination of the symmetry group can be made by the following method. 


1. No Rotational Axis 


a) If no symmetry element exists, then G = {e} holds, i.e., the molecule does not have any symmetry 
operation but the identity e. 

W The molecule hemiacetal (Fig.5.11) is not planar and it has four different atom groups. 

b) Ifo is a reflection or i is an inversion, then G — (e,0) =: C, or G = {e,i} = C; hold, and with this 
it is isomorphic to Za. 

W The molecule of tartaric acid (Fig.5.12) can be reflected in the center P (inversion). 


H 


OH 
Y HO —— CI: CO;E 


P 
CH, —— Cl o 
A | CO,HB= C— OH 





me 


H 
Figure 5.11 Figure 5.12 Figure 5.13 


2. There is Exactly One Rotational Axis C 

a) If the rotation can have any angle, i.e., C = Cx, then the molecule is linear, and the symmetry 
group is infinite. 

W A: For the molecule of sodium chloride (common salt) NaCl there is no horizontal reflection. The 
corresponding symmetry group of all the rotations around C is denoted by Cov- 

E B: The molecule Ox has one horizontal reflection. The corresponding symmetry group is generated 
by the rotations and by this reflection, and it is denoted by Doon. 

b) The rotation axis is of n-th order, C — C, , but it is not an improper rotational axis of order 2n. 

If there is no further symmetry element, then G is generated by a rotation d by an angle m/n around 
Cn, i.e., G =< d > Zn. In this case G is also denoted by Ch. 

If there is a further vertical reflection o,, then G =< d,o, >= Dy, holds (see 5.3.3.1, p. 336), and G is 
denoted by C. 

If there exists an additional horizontal reflection op, then G = < d, 6, » € Z, x Za holds. G is denoted 
by Cna and it is cyclic for odd n (see 5.3.3.2, p. 337). 

W A: For hydrogen peroxide (Fig.5.13) these three cases occur in the order given above for 0 < 6 < 
1/2,6 — 0 and ó — 7/2. 

E B: The molecule of water H3O has a rotational axis of second order and a vertical plane of reflection, 
as symmetry elements. Consequently, the symmetry group of water is isomorphic to the group D2, 
which is isomorphic to the Klein four-group V4 (see 5.3.3.2, 3., p. 338). 

c) The rotational axis is of order n and at the same time it is also an improper rotational axis of order 
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2n. We have to distinguish two cases. 
a) There is no further vertical reflection, so G = Zn holds, and G is denoted also by S2n. 


WB An example is the molecule of tetrahydroxy allene with formula C3(OH), (Fig.5.14). 

B) If there is a vertical reflection, then G is a group of order 4n, which is denoted by Don. 

Hn = 2 gives G = Du, ie., the dihedral group of order eight. An example is the allene molecule 
(Fig.5.15). 





Figure 5.14 Figure 5.15 Figure 5.16 


3. Several Rotational Axes If there are several rotational axes, then one has to distinguish further 
cases. In particular, if several rotational axes have an order n > 3, then the following groups are the 
corresponding symmetry groups. 

a) Tetrahedral group Ty: Isomorphic to S4, ordTy = 24. 

b) Octahedral group Op: Isomorphic to S4 x Z2, ordOp = 48. 

c) Icosahedral group 1j: ord; — 120. 

These groups are the symmetry groups of the regular polyhedron discussed in 3.3.3, Table 3.7, p. 155, 
(Fig.3.63). 

W The methane molecule (Fig.5.16) has the tetrahedral group Ty as a symmetry group. 


5.3.5.4 Symmetry Groups in Crystallography 


1. Lattice Structures 

In crystallography the parallelepiped represents, independently of 
the arrangement of specific atoms or ions, the elementary (unit) cell 
of the crystal lattice. It is determined by three non-coplanar ba- 
sis vectors à; starting from one lattice point (Fig. 5.17). The infi- 
nite geometric lattice structure is created by performing all primitive 
translations tu 

th = may + nod + ngd3, n= (ni no, na) ni E€ Z. (5.124) 
Here, the coefficients n; (i — 1,2,...) are integers. 





Figure 5.17 


All the translations t, fixing the space points of the lattice L = {t,,} in terms of lattice vectors form 
the translation group T with the group element T(t,), the inverse element T-!(£;) = T(—t,), and 
the composition law T(t) * T(tm) = (tn + tm). The application of the group element T(t,) to the 
position vector r is described by: 


T(E) =T + ta. (5.125) 
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2. Bravais Lattices 


Taking into account the possible combinations of the relative lengths of the basis vectors aj and the pair- 
wise related angles between them (particularly angles 90° and 120°) one obtains seven different types 
of elementary cells with the corresponding lattices, the Bravais lattices (see Fig. 5.17, and Table 5.4). 
This classification can be extended by seven non-primitive elementary cells and their corresponding 
lattices by adding additional lattice points at the intersection points of the face or body diagonals, pre- 








serving the symmetry of the elementary cell. In this way one may distinguish one-side face-centered 
lattices, body-centered lattices, and all-face centered lattices. 
Tabelle 5.4 Primitive Bravais lattice 
Elementary cell | Relative lengths | Angles between 
of basis vectors | basis vectors 
triclinic a, # dy F ag o z Bz-z90? 
monoclinic a, F# a2 F a3 a= 7=90° 8B 
rhombic a, F a2 F a3 α-β-η-90 
trigonal ay = a2 = a3 a=B=7< 120°(F 90°) 
hexagonal a, = a2 £ a3 a = ß = 90°, y = 120° 
tetragonal ay, = d3 £ a3 a=B=7=90° 
cubic a, = dz = ag a = ß = y = 90° 











3. Symmetry Operations in Crystal Lattice Structures 

Among the symmetry operations transforming the space lattice to equivalent positions there are point 
group operations such as certain rotations, improper rotations, and reflections in planes or points. But 
not all point groups are also crystallographic point groups. The requirement that the application of a 





group element to a lattice vector ty leads to a lattice vector t/, € L (L is the set of all lattice points) 
again restricts the allowed point groups P with the group elements P(R) according to: 


P-(R:Rt,€L), ΕΙ. (5.126) 
Here, R denotes a proper (R € ,O(3)) or improper rotation operator (R = IR’ € O(3),R’ € 
SO(3),I is the inversion operator with /r — —r, T is a position vector). For example, only n-fold 


rotation axes with n = 1, 2,3,4 or 6 are compatible with a lattice structure. Altogether, there are 32 
crystallographic point groups P. 

The symmetry group of a space lattice may also contain operators representing simultaneous applica- 
tions of rotations and primitive translations. In this way one gets gliding reflections, i.e., reflections in 
a plane and translations parallel to the plane, and screws, i.e., rotations through 27 /n and translations 
by m&/n (m — 1,2,...,n — 1, & are basis translations). Such operations are called non-primitive 
translations V(R), because they correspond to “fractional” translations. For a gliding reflection R is 
a reflection and for a screw FR is a proper rotation. 

The elements of the space group G, for which the crystal lattice is invariant are composed of elements 
P of the crystallographic point group P, primitive translations T (t) and non-primitive translations 


V(R): 
G={{R|V(R)+ta: RE P, tn €L}. (5.127) 


The unit element of the space group is {e|0} where e is the unit element of R. The element (e|t5) 
means a primitive translation, {R|0} represents a rotation or reflection. Applying the group element 
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{R\t,,} to the position vector f one obtains: 

{Rty}F = RF+ ty. (5.128) 
4. Crystal Systems (Holohedry) 
From the 14 Bravais lattices, L = {t,,} , the 32 crystallographic point groups P = { R} and the allowed 
non-primitive translations V (R) one can construct 230 space groups G = {R|V(R) + tn}. The point 
groups correspond to 32 crystallographic classes. Among the point groups there are seven groups that 
are not a subgroup of another point group but contain further point groups as a subgroup. Each of 
these seven point groups form a crystal system (holohedry). The symmetry of the seven crystal systems 
is reflected in the symmetry of the seven Bravais lattices. The relation of the 32 crystallographic classes 
to the seven crystal systems is given in Table 5.5 using the notation of Schoenflies. 
Remark: The space group G (5.127) is the symmetry group of the “empty” lattice. The real crystal 
is obtained by arranging certain atoms or ions at the lattice sites. The arrangement of these crystal 
constituents exhibits its own symmetry. Therefore, the symmetry group Go of the real crystal possesses 
a lower symmetry than G (G D Go), in general. 


Table 5.5 Bravais lattice, crystal systems, and crystallographic classes 
Notation: Cn — rotation about an n-fold rotation axis, D,, — dihedral group, T; — tetrahedral group, 
On — octahedral group, Sn — mirror rotations with an n-fold axis. 








Lattice type | Crystal system | Crystallographic class 
(holohedry) 

triclinic Ci Ci, Cj 

monoclinic Con C2, Ch, Con 

rhombic Don Ca, Da, Don 

tetragonal Dan C4, 84, Can, Da; Cav, Doa, Dan 

hexagonal Den C's. Can, Con, Do, Cov, Dan, Den 

trigonal Dza C3, S6, Ds, Cav, Dsa 

cubic On T; Th, aO, Oh 

















5.3.5.5 Symmetry Groups in Quantum Mechanics 
Linear coordinate transformations that leave the Hamiltonian H of a quantum mechanical system (see 
9.2.4, 2., p. 593) invariant represent a symmetry group G, whose elements g commute with H: 

(9, |] - gf - lg - 0, geG. (5.129) 
'The commutation property of g and A implies that in the application of the product of the operators 
g and H to a state qw the sequence of the action of the operators is arbitrary: 

g(Hq) — H(go). (5.130) 


Hence, one has: If ego. (o — 1,2,...,n) are the eigenstates of H with energy eigenvalue E of degener- 
acy n, i.e., 


Hone = Evra (a=1,2,...,n); (5.131) 
then the transformed states gyzq are also eigenstates belonging to the same eigenvalue E: 


gĤpra = gyra = Egpga- (5.132) 
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The transformed states gogo can be written as a linear combination of the eigenstates yga: 


n 
σφπα -- Σ) Ὀρα(σ)Φπβ- (5.133) 
B=1 

Hence, the eigenstates Yq form the basis of an n-dimensional representation space for the representa- 
tion D(G) of the symmetry group G of the Hamiltonian H with the representation matrices (Dag(g)) . 
This representation is irreducible if there are no “hidden” symmetries. One can state that the energy 
eigenstates of a quantum mechanical system can be labeled by the signatures of the irreducible repre- 
sentations of the symmetry group of the Hamiltonian. 
Thus, the representation theory of groups allows for qualitative statements on such patterns of the en- 
ergy spectrum of a quantum mechanical system which are established by the outer or inner symmetries 
of the system only. Also the splitting of degenerate energy levels under the influence of a perturbation 
which breaks the symmetry or the selection rules for the matrix elements of transitions between energy 
eigenstates follows from the investigation of representations according to which the participating states 
and operators transform under group operations. 


The application of group theory in quantum mechanics is presented extensively in the literature (see, 
e.g., [5.6], [5.7], [5.8], [5.10], [5.11]). 

5.3.5.6 Further Applications of Group Theory in Physics 

Further examples of the application of particular continuous groups in physics can only be mentioned 
here (see, e.g., [5.6], [5.10]). 

U(1): Gauge transformations in electrodynamics. 

SU(2): Spin and isospin multiplets in particle physics. 

SU(3): Classification of the baryons and mesons in particle physics. Many-body problem in nuclear 
physics. 

SO(3): Angular momentum algebra in quantum mechanics. Atomic and nuclear many-body problems. 
SO(4): Degeneracy of the hydrogen spectrum. 

SU(4): Wigner super-multiplets in the nuclear shell model due to the unification of spin and isospin 


degrees of freedom. Description of flavor multiplets in the quark model including the charm degree of 
freedom. 


SU(6): Multiplets in the quark model due to the combination of flavor and spin degrees of freedom. 
Nuclear structure models. 


U 
SU(n), SO(n): Many-body problems in nuclear physics. 

SU (2) & U (1): Standard model of the electro weak interaction. 

SU(5) 5 SU(3) & SU(2) ® U(1): Unification of fundamental interactions (GUT). 

Remark: The groups SU(n) and SO(n) are Lie groups, i.e. continuous groups (see, 5.3.6, p. 351 and 
e.g., [5.6]). 

5.3.6 Lie Groups and Lie Algebras 


5.3.6.1 Introduction 

Lie groups and Lie algebras are named after the Norwegian mathematician Sophus Lie (1842-1899). In 
this chapter only Lie groups of matrices are considered since they are most important in applications. 
Main examples of matrix-Lie groups are: 


~ 


n): Shell models in atomic and nuclear physics. 





e the group O(n) of orthogonal matrices, 


e the subgroup SO(n) of orthogonal matrices of determinants +1, i.e. the orthogonal matrices de- 
scribing rotations in R”, 
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e the Euclidean group S E(n), which describes rigid-body motions. 

These groups have many applications in computer graphics and in robotics. 

The most important relation between a Lie group and the corresponding Lie algebra will be described 
by the exponential mapping. This relation is explained by the following example. 


W The solution of initial value problems of first order differential equations or of a system of differential 
equations can be determined with the help of the exponential function. 
The initial value problem (5.134a) for y = y(t) has the following solution (5.134b): 


dy 

dt 
Similarly, for the system of first order differential equations with unknown vector y = y(t) and with 
the constant coefficient matrix X the initial value problem (5.135a) 


dy (dy di dy, Y. 
T = (= τ s te) — Xy (matrix X const) with y(0) — yo, (5.135a) 


=xy (aconst) with y(0) = yo, (5.134a) πο = tyo. (5.134b) 


has the solution (5.135b) with the matrix-exponential function εἰ 
γα) ορ, οκ.- Σ gix = = Inxn + xx qx. (5.135b) 


The special matrix-exponential function e’* for a given quadratic n x n matrix X has the following 
properties: 


X .r 


ec nxn , Where Inxn denotes the unit matrix. 


e eX is invertible, because det eX = etSPuX 4 0, 


o eHXetX — eltitt)X — et2Xet1X for every tı, ta € R, but in general is eX1eX2 Æ eX2eX1 4 eX1+X2 


-tX tX _ etXe—tX — 


ο In particular e = Inxn- 


d 
αχ 





=Xe%| =X. 


t=0 
t=0 





Consequently, the elements e'™ (for a fixed X) form a multiplicative group with respect to matrix 
multiplication. Since t € R, the matrices e” form a one dimensional group. At the same time it is one 
of the simplest examples of Lie groups. It will be shown that matrices X and tX are elements of the Lie 
algebra belonging to this Lie group (see 5.3.6.4, p. 356). In this way the exponential function generates 
the Lie group from the elements of the Lie algebra. 
5.3.6.2 Matrix-Lie Groups 
For matrix-Lie groups it is not necessary to define Lie groups in general. For general Lie groups there 
should be introduced the notion of differentiable manifolds, which is not needed here. For matrix-Lie 
groups the following definitions are important, while in further discussions the main topic will be the 
general linear group. 
1. General Linear Group 
1. Group A group (see 5.3.3, p. 336) is a set G with a map 

GxGoG, (g,h)egxh, (5.1362) 
which is the so called group operation or group multiplication with the following properties: 
e Associativity: for every g, h, k € G 

g*(h*k) — (gs h)sk, (5.136b) 
e Existence of identity: There is an element e € G , such that for every g € G 

g*xe=exg=gQ, (5.136c) 
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e Existence of an inverse: For every g € G there is an element h € G such that 

πα (5.136d) 
Remark 1: If g * h = h* g for everyg,h € G, then the group is called commutative. The matrix 
groups considered here are not commutative. It follows obviously from the definition, that the product 
of two elements of the group also belongs to the group, so the group is closed with respect to group 
multiplication. 


Remark 2: Let M, (IR) the vector space of all n x n matrices with real entries. M; (IR) is obviously 
not a group with respect to matrix multiplication, since not every n x n matrix is invertible. 

2. Definition of the General Linear Group The set of all real, invertible, n x n matrices, which 
obviously form a group with respect to matrix multiplication, is called the general linear group and is 


denoted by GL(n, R). 


2. Matrix-Lie Groups 


1. Convergence of Matrices A sequence {A,,}°°_, of matrices Am = (a δει where Am € 


M,(R) converges to the n x n matrix A, if every sequence of entries (ates . converges to the 
corresponding matrix entry ax; in the sense of convergence of real numbers. 
2. Definition of the Matrix-Lie Groups A matrix-Lie group is a subgroup G of GL(n, R) with 
the property: Let {A,,}°_, be an arbitrary sequence of matrices from G converging to a matrix A € 
M, (R) in the sense of convergence in M, (IR). Then either A € G or A is not invertible. 
This definition can be also formulated in the following way: A matrix-Lie group is a subgroup which is 
also a closed subset of GL(n, R). (It does not mean, that G must be closed in M,(R)). 
3. Dimension of the Matrix-Lie Group The dimension of a matrix-Lie group is defined as the 
dimension of the corresponding Lie algebra (see 5.3.6.4, p. 356). The matrix-Lie group GL(n, R) has 
dimension n?. 
3. Continuous Groups 
Matrix-Lie groups can be introduced also with the help of continuous groups (see [22.22], [5.9], [5.7]). 
1. Definition A continuous group is a special infinite group whose elements are given uniquely by 
a continuous parameter vector Y = (Y1, P2,- --, Pn): 

a — a(q). (5.137) 


Wl Group of rotation matrices in IR? (see (3.432), p. 230): 
_ [cosy -sing \ _ i E 
He ee E —a(p) mit 0€ o X 2n. (5.138) 
The group elements depend only on one real parameter «ο. 
2. Product The product of two elements a, = a((p,), @2 = a(p,) ofa continuous group with elements 
a = a() is given by 
aı * a2 = a3 = a(p3) with (5.139a) 
gs — f(v1,92); (5.139b) 
where the components of f (i1, 2) are continuously differentiable functions. 
W The product of two rotation matrices a = a(y1) and a — a(q2) with 0 € 1,9» € 2m (a(q) as in 


(5.138), is a3 — a(q1) * a(q2) — a(qs) with ys — f(v1.92) — q1 + Y2. Using the Falk’s scheme (see 
4.1.4, 5., p. 273) and addition theorems one gets: 
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αφ») 


letailed 
alpı)jalps) = alpi Fpa) “ο 








COS p2 — sin p2 

sin p2 COS (3 
cosy, | —singi|cosq1cosqs —singisinQga — COS Y1 SİN Y2 — SİN Y1 COS P2 ` 
sin qi cos p1 | sin p1 COS p2 + cos pı SiN Y2 — sin y1 SİN Y2 + COS Y1 COS P2 





3. Dimension The parameter vectors are elements of a vector space which is called parameter 
space. In this parameter space there is a domain which is given as the domain of the continuous group, 
and it is called the group space. The dimension of this group space is considered as the dimension of 
the continuous group. 

Wl A: The group of the real quadratic n x n invertible matrices has the dimension n?, since every entry 
can be considered as a parameter. 

E B: The group of the rotation matrices (with respect to matrix multiplication) D in (5.138) has 
dimension 1. The rotation matrices are of type 2x2, but their four entries depend only on one parameter 
e (0€ o x27). 

4. Lie Groups 


1. Definition of the Lie Group A Lie group is a continuous group where all elements of the group 
are given as continuous functions of the parameters. 


2. Special Matrix-Lie Groups and their Dimension 
B A Group SO(n) of Rotations R: The group SO(n) of rotations R acts on the elements X c R^ 
with matrix multiplication as X' - R X € R". SO(n) is an n(n — 1)/2-dimensional Lie group. 
B D Special Euclidean Group SE(n): The special Euclidian group SE(n) consists of elements 
g — (R, b) with R € SO(n) and b € R” and with group multiplication g1 o go — (R4 Ra, Rib; + bi). 
It acts on the elements of Euclidean spaces IR" as 
X'— RX4 b. (5.140) 
S E(n) isthe group of rigid-body motions of n-dimensional Euclidean space, it is an n(n--1)/2-dimensional 
Lie group. Discrete subgroups of S E(n) are e.g. the crystallographic space groups, i.e. the symmetry 
group of a regular crystal-lattice. 
E C Scaled Euclidean Group SIM (n): The scaled Euclidian group SIM (n) consists of all pairs 
(eR, b) with a € R, R € SO(n), be R^, with group multiplication g1 o g2 — (e^ *?R, R3, Ribs + 
b). It acts on the elements of IR" by translation, rotation and dilatation (=stretching or shrinking): 
x’ =e"RX+b. (5.141) 
The scaled Euclidean group has the dimension 1 + n(n + 1)/2. 
B D Real Special Linear Group SL(n, IR): The real special linear group consists of all (real) nx n 
matrices with determinant 4-1. It acts on the elements of R" with X' — Lx by rotation, distortion and 
shearing so that the volume remains the same and parallel lines remain parallel. The dimension is 
2 
N^ d 
E E Special Affine Group: The special affine groups of R”, which consists of all pairs (e° L, b) 
with L € SL(n) and bem , acts on the objects in IR" as rotation, translation, shearing, distortion 
and dilatation. This Lie group is the most general group of deformations in Euclidean spaces mapping 
parallel lines into parallel lines; it has dimension n(n + 1). 


B F Group SO(2): The group SO(2) describes all rotations about the origin in R?: 
50) -- ls 2 pe R} (5.142) 


siny cosy 
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E G Group SL (2): Every element of SZ(2) can be represented as 


cos@ —sin@\ (et 0 1€ 
ín cos Ü JU 2) G i : (5.143) 


B H Group SE(2): The elements of the group S E(2) can be represented as 3 x 3 matrices: 


cos? —sin@ x, 2 
sinf cos mr» with 0 c R and Xx — i επ. (5.144) 
0 0 1 P2 


Remark: Beside real matrix-Lie groups complex matrix-Lie groups also can be considered. So, e.g. 
SL(n,C) is the Lie group of all complex n x n matrices with determinant +1. Similarly there are 
matrix-Lie groups whose entries are quaternions. 

5.3.6.3 Important Applications 


1. Rigid Body Movement 
1. The group SE(3) is the group of rigid-body motions in the Euclidean space R*. That is why it is so 
often applied in control of robots. The 6 independent transformations are defined usually as follows: 


1. Translation in x-direction, 4. Rotation about the z-axis, 
2. Translation in y-direction, 5. Rotation about the y-axis, 
3. Translation in z-direction, 6. Rotation about the z-axis. 


These transformations can be represented by 4 x 4 matrices applied to homogeneous coordinates (see 
3.5.4.2, p. 231) in 3 dimensions, i.e. (a, y, z)™ € R? is represented as a vector (a, y, 2, 1) with four 
coordinates (see 3.5.4.2, p. 231). 

Matrices corresponding to the transformations 1 until 6 are: 


100a 1000 1000 
0100 0100 0100 

Mi-|ggasp|* Me |ao10]* Me—|on1e (5.145a) 
0001 0001 0001 
1 0 0 0 cosB 0 sinB 0 cosy —siny 0 0 

_ | 0 cosa —sina 0 . 0 1.00 _ | siny cosy 00 7 

Mi=] sina cosa 0 |? Ms= “dao bone$ | Me 0 0 10 .(5.145b) 

0 0 0 1 0 0.0 1 0 0 01 


The matrices My, Ms, Mg describe the rotations in RË, consequently SO(3) is a subgroup of SE(3). 
'The group S E(3) acts on X — (v, y, z)! € R? with homogeneous coordinates (X, 1)" as follows: 


x' Rv) (Χ RxX+Vv 
G)-G)00)- 07 αν 
where R € SO(3) is a rotation, and V — (a, b, c)T is a translation vector. 
2. Affine Transformations of 2-Dimensional Space 
The group GA(2) of affine transformations of the 2-dimensional space is a 6-dimensional matrix Lie 
group with the following 6 dimensions: 


1. Translation in x-direction, 4. Stretching or shrinking with respect to the origin, 
2. Translation in y-direction, 5. Shearing (stretching with resp. to y, with resp. to x), 
3. Rotation about the origin, 6. 45°-shearing with respect to 5. 


Also these transformations are described by matrices in homogeneous coordinates (a, y, 1) for (x, y)? € 


R?: 
10a 100 cosa — sina 0 
Μι -- (° 1 o) , M= (° 1 ' , Ma- (9^ cosa o) ; (5.1472) 
001 001 0 0 1 
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e 00 e 0 0 coshy sinhy 0 
My, = ι e o) < M;= ( 0 e™ o) , M= (v cosh v o) : (5.147b) 





001 0 01 0 0 1 
This group has as essential subgroups the translation group, given by M, and Mg, the Euclidean group 
S'E(2), given by My, Mz and Ms, the similarity group, given by My, Ma, M3, Mg. 
Application: The group G'A(2) can be applied to describe all transformations of a planar object 
which is recorded under slight angle modifications by a camera moving in the 3 dimensional space. 
If also large changes in angles of perspectivity can occur, then group P(2) the group of all transforma- 
tions of projective spaces can be used. The matrix-Lie group is generated by the matrices M; until Mg 
and by the two further matrices 


100 100 
M;- (^ n). M; = (° 10). (5.147c) 
801 051 


These two additional matrices correspond to a change of the horizon or vanishing of an edge of the plane 
picture. 
5.3.6.4 Lie Algebra 


1. RealLie algebra 
A real Lie algebra A is a real vector space with an operation 
[,:-]:AxKxA A, (5.148) 
which is called the Lie bracket and for which the following properties are valid for all a,b,c € A: 
e |., .] is bilinear, 
e [a, b] — —[b, a], i.e. the operation is skew-symmetric or anticommutative, 
e the so called Jacobi identity is valid (as a replacement of the missing associativity) 
[a, [b, cl] +[e, la, b]] + [b, [ο, a]] — 0. (5.149) 
Obviously [a , a] — 0 holds. 
2. Lie Bracket 
For (real) n x n matrices X and Y a Lie bracket is given by the commutator, i.e. 
(X, Y]:2 XY - YX. (5.150) 
3. Special Lie-Algebras 
There are associated Lie algebras to matrix-Lie groups. 
1. A function g: R > GL(n) is a one-parameter subgroup of GL(n), if 
e gis continuous, 
96 g(0) = Inxn, 
e g(t + s) = g(t)g(s) for every t, s € R. 
In particular: 
2. If g isa one-parameter subgroup of GL(n), then there exists a uniquely defined matrix X such that 
g(t) — ο (see 5.3.6.1, p. 351). (5.151) 
3. For every n x n matrix A the logarithm log A is defined by 
E 
log A = xc 2———-(A - I)" (5.152) 
m=1 
if this series is convergent. In particular, the series converges if ||A — I|| < 1. 
4. Correspondence between Lie Group and Lie Algebra 
The correspondence between a matrix-Lie group and the associated Lie algebra is as follows. 
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1. Let G be a matrix-Lie group. The Lie algebra of G, which is denoted by g, is the set of all matrices 
X such that e"* € G holds for all real numbers t. 

In a given matrix-Lie group the elements close to the unit matrix can be represented as g(t) — e**X with 
X € g, and f close to zero. If the exponential map is surjective, as in the case of SO(n) and S E(n), 
then the elements of the group can be parameterized with the help of the matrix-exponential function 


by elements of the corresponding Lie algebra. The matrices i gt and pa respectively are called 
tangent vectors or tangent elements to g € G. Calculating these elements for t = 0, one gets X itself, 
i.e. g is the tangent space TjG at the identity matrix I. 

2. It can be shown that the Lie algebra assigned to a Lie group in this way is a Lie algebra also in the 
abstract sense. 

Let G be a matrix-Lie group with the associated matrix-Lie algebra g and X and Y elements of g. 
Then: 

e sX € g for any real numbers s, 

eX+Ye€g, 

e [X, Y}=XY-YX eg. 

Wi A: Tho Lie algebra so(2) associated to the Lie group SO(2) is calculated from the representation 
cos — sin 0 


of the elements g(0) — ( sind. tos 


) by SO(2) with the help of the tangential elements 





dg _4 — sin0 — cos cos@ sind ο ή 
do! B ( cos 0 1. sn 0 ) (5.1532) 
6—0 =0 
Consequently 


so(2) = {s B a) , s€ R} . (5.153b) 


Conversely, from 


X= a m comes e” = coss Pm o +sins Hu a = n ο : (5.153c) 


WB B: The following matrices form a basis for the Lie algebra so(3): 


00 0 0 01 0-10 
Χι-[ο0οθ-1],. X= 000), x,=/10 0). (5.154) 
01 0 -100 000 


Remark: The surjectivity of the exponential mappings so(3) + SO(3) and se(3) — SE(3) implies 
the existence of a (many-valued) logarithmic function. Nevertheless this logarithm function can be 
applied to interpolation. 

E.g. if rigid-body motions B4, B» € SE(3) are given, then log Bi, log B9 can be calculated which are 
elements of the Lie algebra so(3). Then between these logarithms linear interpolation (1 — t) log By + 
tlog B3 can be taken and then the exponential map can be applied in order to get an interpolation 
between the rigid-body motions B; and B5 by 


exp ((1 — t)log Bi 4- tlog B5). (5.155) 


B C: The matrix-Lie algebra se(3) associated to the matrix-Lie group S E(3) is generated by the 
matrices: 





00 
E -|00 (5.1562) 
00 
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0000 0 010 0-100 
00-10 0 000 1000 

Es=| 01 00| Es7|-1000|: £*7|00 00 (5.156b) 
00 0 0 0 000 0 0 00 


5. Inner Product 


For a given finite dimensional matrix-Lie group it is always possible to find an orthonormal basis for 
the associated Lie algebra if a suitable inner product (scalar product) is defined. In this case from any 
basis of the Lie algebra an orthonormal basis can be obtained by the Gram-Schmidt orthogonalization 
process (sce 4.6.2.2, 4. p. 316). 

In the case of a real matrix-Lie group the Lie algebra consists of real matrices and so an inner product 
is given by 


(X, Y)- Spur (XWY7) (5.157) 


with a positive definite real symmetric matrix W. 
@ A: The group of rigid-body motions S (2) can be parametrized as 


cos@ —sin@ x, 
g(z1, T2, 0) = eX +42X2 ef X3 (sse cos 0 2) with (5.158a) 
0 0 1 
001 000 0-10 
Χι-|000|, Χ;-|001)]), X3;={1 0 0]. (5.158b) 
000 000 000 


Here X4, X2, X; form an orthonormal basis of Lie algebra se(2) with respect to an inner product given 
by the weight matrix 


100 
w=(010 (5.158c) 
002 
W B: A basis of Lie algebra sl(2, IR) is 
0-1 10 01 
xi η. x, - (t 5) and x - (10) (5.159) 


These elements form an orthonormal basis with respect to the weight matrix W = Igy. = ( a D 


5.3.6.5 Applications in Robotics 
1. Rigid Body Motion 
The special Euclidean group SE(3), which describes the rigid-body motions in R?, is the semidirect 
product of group SO(3) (rotation about the origin) and R? (translations): 
SE(3) = SO(3) x RÀ. (5.160) 


In a direct product the factors have no interaction, but this is a semidirect product since rotations act 
on translations as it is clear from matrix multiplication: 


R t) [Rit \_ [RR Rt +t ο 
a oC I 0 1 ). (5-161) 


i.e. the first translation vector is rotated before the second translation vector is added. 
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2. Theorem of Chasles 
This theorem tells that every rigid-body motion which is not a pure translation can be described as a 
(finite) screw motion. A (finite) screwing motion along an axis through the origin has the form 


Op. 
A(6) = B η) : (5.1624) 
0 1 


where X is a unit vector in the direction of the axis of rotation, 0 is the angle of rotation and p is the 
angular coefficient. Since X is the axis of rotation RX — X, i.e. Xis an eigenvector of matrix R belonging 
to unit eigenvalue 1. 

When the axis of rotation does not go through the origin, then a point u of the axis of rotation is chosen 
which is shifted into the origin, then after the screwing it is shifted back: 


(o 1) |. η) (5 E - b ας) (5.162b) 


0 1 0 1 
The theorem of Chasles tells that an arbitrary rigid-body motion can be given in the above form, i.e. 
A Op. E 
Rt| [R πχτ- Βα (5.163) 
0 1 0 1 


for given R, t and appropriate p and i. Assuming that the angle of rotation 0 and the axis of rotation 
X are already known from R 
Op > 
egint 5.164 
I (5.164) 


is valid, so the angular coefficient p can be calculated. Then the solution of a linear system of equations 
gives U: 
+ Op. > e 

(I-R)i— 3,X-t- (5.165) 
This is a singular system of equations, where X is in its kernel. Therefore the solution ü is unique except 
to a manyfold of x. In order to determine u it is reasonable to require that u is perpendicular to X. 
When the rigid body motion is a pure rotation, then it is not possible to determine an appropriate 
vector ü. 


3. Mechanical Joints 
Joints with one degree of freedom can be represented by a one-parameter subgroup of the group S E(3). 
For the general case of screw joints the corresponding subgroup is 


0p. E 
A(0) — E a+ d (5.166) 


where X is the axis of rotation, 0 is the angle of rotation, p gives the angular coefficient and d is an 
arbitrary point on the axis of rotation. 
The most often occurring types of joints are the rotational joints which can be described by the following 
subgroup: 
R (I-R)u 
A() = ( a ) 


The subgroup corresponding the shift joints is 


A(6) = E T (5.168) 


(5.167) 
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where t describes the direction of the shifting. 

4. Forward Kinematics 

The goal in the case of industrial robots is the moving and control of the end effectors, which is done 
by joints in a kinematic chain. If all joints are of one parameter and the robot consists e.g. of 6 joints, 
then every position of the robot can be described by the joint-variables gt = (03, 05, 03, 04, 95, Og). 
The output state of the robot is described by the null vector. Then the motions of the robot can be 
described so that first the farest joint together the end effector are moved and this motion is given by 
the matrix A(05). Then the 5-th joint is moved. Since the axis of this joint should not be influenced 
by the motion of the last joint, this motion is given by the matrix A(05). In this way all the joints are 
moved, and the complete motion of the end effector is given by 


KÖ = Aı(01)A2(02)A3(03)A4(04)As5(05)A6s(06)- (5.169) 
5. Vector Product and Lie Algebra 


A screw is given by 
0p =x 
A(0) — (5 a pom (5.170) 
0 1 


and it represents rigid body motions parameterized by the angle 0. Obviously, 0 — 0 gives the identity 
transformation. If the derivative is calculated at 0 — 0, i.e. the derivative at the identity, then the 
general element of the Lie algebra is the following: 


dR p, dR, Ps —Qü 
—-| 49 2x7" d$" - 27 , (5.171a) 
6-0 0 0 0 0 








6-0 
dR. ; ] : : 
where Q = de X) is a skew symmetric matrix. It can be shown that R is an orthogonal matrix, so 


RR? = I and RRT = I holds and therefore 











d dR dRT dI 
RR? R™+R : 5.171} 
qp RR 7 ag dé dé (5.171b) 
Since R = I for 0 = 0 
dR dRT 
de 9 T ο ΞΟ. (5.171c) 
So every skew symmetric matrix 
0 -ω, Wy 
Q= ( w, ϐ | (5.171d) 
—Wy We 0 


can be identified with a vector W" = (We, Wy, Wz). In this way the multiplication of any three dimen- 
sional vector p by matrix Q corresponds to the vector product with vector @ : 

Qp-u xp. (5.171) 
Consequently QJ is the angular velocity of the rigid body with an amplitude w. 
Hence a general element of the Lie algebra se(3) has the form 


2 lon (5.171f) 


These matrices form a 6-dimensional vector space which is often identified with the 6-dimensional vec- 
tors of the form 


d- (e (5.172) 


ν 
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5.3.7 Rings and Fields 
In this section, there are discussed algebraic structures with two binary operations. 
5.3.7.1 Definitions 
1. Rings 
A set R with two binary operations + and * is called a ring (notation: (R,+,*)), if 
e (R, +) is an Abelian group, 
e (R,*) is a semigroup, and 
e the distributive laws hold: 

a*(b--c) — (ax*b)--(axc), (b+c)*a= (bxa)+ (c*a). (5.173) 
If (R, *) is commutative or if (R, *) has a neutral element, then (R, +, *) is called a commutative ring 
or a ring with identity (ring with unit element), respectively. 
A commutative ring with a unit element and without zero divisor is called the domain of integrity. 
A nonzero element of a ring is called zero divisor or singular element if there is a nonzero element of the 
ring such that their product is equal to zero. 
In a ring with zero divisor the following implication is generally false: a b — 0 —5 (a — 0 V b — 0). 
If Ris a ring with a unit element, then the characteristic of the ring R is the smallest natural number k 
such that kl = 1+ 1+... +1 = 0 (k times 1 equals to zero), and it is denoted by char R = k. If such 
a k does not exist, then char R = 0. 
char R = k means that the cyclic subgroup (1) of the additive group (R, +) generated by 1 has order 
k, so the order of every element is a divisor of k. 
If char R = k and for allr € R, then r 4- r 4- ... 4- r (k times)is equal to zero. The characteristic of a 
domain of integrity is zero or a prime. 
2. Division Ring, Field 
A ring is called division ring or skew field if (R \ {0}, *) is a group . 
If (R\ {0}, *) is commutative, then Risa field. So, every field is a domain of integrity and also a division 
ring. Reversed, every finite domain of integrity and every finite division ring is a field. This statement 
is a theorem of Wedderburn. 
Examples of rings and fields 
E A: The number domains Z, Q, R, and C are commutative rings with identity with respect to ad- 
dition and multiplication; Q, R, and C are also fields. The set of even integers is an example of a ring 
without identity. 
E B: Theset M, ofall square matrices of order n with real (or complex) elements is a non-commutative 
ring with respect to matrix addition and multiplication. It has a unit element which is the identity ma- 


trix. M, has zero divisors, e.g. for n = 2, G d m i} = (a nh ie. both matrices (i 7 and 


a $ are zero divisors in Mə. 


HC: The set of real polynomials p(x) = apn£” + an-1%"7! + - -- + ax + ao forms a ring with respect 
to the usual addition and multiplication of polynomials, the polynomial ring IR[r]. 

More generally, instead of polynomials over IR, polynomial rings over arbitrary commutative rings with 
identity element can be considered. 

W D: Examples of finite rings are the residue class rings Z, modulo n. Zp, consists of all the classes 
[a], of integers having the same residue on division by n. ([a]; is the equivalence class defined by the 
natural number a with respect to the relation ^g introduced in 5.2.4, 1., p. 334.) The ring operations 
®, © on Z, are defined by 


[a], € [b], — [a -- b]; and [a], O [b] — [a - b]. (5.174) 
If the natural number n is a prime, then (Z, , 9, C) is a field. Otherwise Z, has zero divisors, e.g. in 
Ze (numbers modulo 6) [3]; - [2]g — [0]g. Usually Z;, is considered as Z, = {0,1,...,n — 1}, ie. the 
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residue classes are replaced by representatives(see 5.4.3,3., S. 377). 

3. Field Extensions 

If K and L are fields and K C L, then L is an extension field or an over-field of K. In this case L can 
be considered as a vector space over A. 

If L is a finite dimensional space over K, then L is called a finite extension field. If this dimension is n, 
then L is called also an extension of degree n of K (Notation: [L : K| — n). 

E.g. C is a finite extension of R. C is two-dimensional over R, and (1, i) is a basis. IR is an infinite- 
dimensional space over Q. 

For a set M C L, K(M) denotes the smallest field (an over-field of &) which contains the field K and 
the set M. 

Especially important are the simple algebraic extensions (a), where o € L is a root of a polynomial 
from K [a]. The polynomial of lowest degree with a leading coefficient 1 having a as a root is called the 
minimal polynomial ofa over K. If the degree of the minimal polynomials of a € L is n, then K (a) is 
an extension of degree n, i.e. the degree of the minimal polynomials is equal to the dimension of L as 
a vector space over K. 

E.g. C — R(i) andi € C is the root of the polynomial z? - 1 € R[z], ie. C is a simple algebraic 
extension and [C : IR] — 2. 

A field, which does not have any proper subfield, is called a prime field. 

Every field K contains a smallest subfield, the prime field of K. 

Out of isomorphism, Q) (for fields of characteristic 0) and Z, (p prime, for fields of characteristic p) are 
the single prime fields. 


5.3.7.2 Subrings, Ideals 


1. Subring 

Suppose R = (R,+,*)isaringandU C R. IfU with respect to + and * is also aring, then U = (U, +, *) 
is called a subring of R. 

A non-empty subset U of a ring (R, +, *) forms a subring of F if and only if for all a,b € U also a + (—b) 
and a * b are in U (subring criterion). 

2. Ideal 

A subring J is called an ideal if for allr € R anda € I also r * a and a * r are in J. These special 
subrings are the basis for the formation of factor rings (see 5.3.7.3, p. 363). 

The trivial subrings (0) and R are always ideals of R. Fields have only trivial ideals. 

3. Principal Ideal 

If all the elements of an ideal can be generated by one element according to the subring criterion, then 
it is called a principal ideal. All ideals of Z are principal ideals. They can be written in the form 
mZ = (mg|g € Z) and they are denoted by (m). 


5.3.7.3 Homomorphism, Isomorphism, Homomorphism Theorem 


1. Ring Homomorphism and Ring Isomorphism 
1. Ring Homomorphism: Let Rı = (R1, +, *) and Rg = (R2, 0+, 04) be two rings. A mapping 
h: Rı > Rz is called a ring homomorphism if for all a,b € Rı 

h(a +b) = h(a) o4 h(b) and h(a xb) = h(a) o, h(b) (5.175) 
hold. 
2. Kernel: The kernelof h is the set of elements of Rı whose image by h is the neutral element 0 of 
(R2, +), and it is denoted by ker h: 

kerh = {a € Rı|hk(a) = 0}. (5.176) 
Here ker ^ is an ideal of R4. 
3. Ringlsomorphism: If h is also bijective, then A is called a ring isomorphism, and the rings Rı 
and Rz are called isomorphic. 
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4. Factor Ring: If J is an ideal of a ring (R, +, *), then the sets of co-sets {a + I|a € R} of I in the 

additive group (R, +) of the ring R (see 5.3.3, 1., p. 337) form a ring with respect to the operations 
(a4- I)o4 (b4- I) 2 (a--b) --I and. (a4 I) o, (b 4- I) — (a b) 4- I. (5.177) 

This ring is called the factor ring of R by I, and it is denoted by R/T. 

The factor ring of Z by a principal ideal (m) is the residue class ring Z, = Z/(m) (see examples of rings 

and fields on p. 361). 

2. Homomorphism Theorem for Rings 

If the notion of a normal subgroup is replaced by the notion of an ideal in the homomorphism theorem 

for groups, then the homomorphism theorem for rings is obtained: A ring homomorphism h: Ry > R2 

defines an ideal of Ft; , namely ker h = {a € Ri|h(a) = 0}. The factor ring R/ ker h is isomorphic to the 

homomorphic image h(R,) = {h(a)|a € Ri}. Conversely, every ideal I of R, defines a homomorphic 

mapping nat;: Ry + R2/I with nat;(a) = a+J. This mapping nat; is called a natural homomorphism. 





5.3.7.4 Finite Fields and Shift Registers 
1. Finite Fields 


The following statements give an overview of the structure of finite fields. 
1. Galois Field GF For every power of primes p" there exits a unique field with p" elements (out 
of an isomorphism), and every finite field has p” elements. The fields with p" elements are denoted by 
GF(p") (Galois field). 
Note: For n > 1 GF(p") and Zp» are different. 
In constructing finite fields with p” elements (p is prime, n > 1), the ring of polynomials over Zp (see 
5.3.7,2., p. 361, IE C) and irreducible polynomials are needed: Z, [v] consists of all polynomials with 
coefficients from Z, . The cocfficients are calculated modulo p. 
2. Algorithm of Division and Euclidean Algorithm In a ring of polynomials K [x] the division 
algorithm is applicable (dividing polynomials with a remainder), i.e. for f(x), g(x) € K[x], degf(x) € 
degg(x) there exist g(x), r(x) € K[2] such that 

g(a) = q(x) - f(x) + r(x) and deg r(x) < deg f(x). (5.178) 
This relation is denoted by r(r) = g(x)(modf(«)). Repeatedly performed division with remainders is 
known as the Euclidean algorithm for rings of polynomials and the last nonzero remainder gives the 
greatest common divisor of f(x) and g(x). 
3. Irreducible Polynomials A polynomial f(x) € K[|zx] is irreducible if it can not be represented 
as a product of polynomials of lower degrees, i.e. (analogously to the prime numbers in Z) f(x) is a 
prime in K[z]. E.g. for polynomials of second or third degree irreducibility means, that they do not 
have roots in ΚΚ. 
It can be shown that there are irreducible polynomials of arbitrary degree in K[x]. If f(x) € K{z] is 
an irreducible polynomial, then 

K|x]/ f(x) :— fp(x) € K [a] | deg p(x) < deg f (x)} (5.179) 
is a field, where addition and multiplication are performed modulo f(x), ie. g(x) * h(x) = g(x) - 
h(x) (mod f ()). 
If K — Z, and deg f(x) — n, then K [z]/ f (x) has p" elements, i.e. GF(p") — Z,[x]/ f (z), where f(x) is 
an irreducible polynomial of degree n. 
4. Calculation Rule in GF(p”) In GF(p") the following useful rule is valid: 

(a+b) =a" +P" reN. (5.180) 
So, in GF(p") =Z,[2]/ f(x) there is an element a = x, a root of the polynomial f(x) irreducible in 
Z,(x), and GF(p") = Z,[x]/f(x) = Z,(a). It can be proven that Z, (o) is the splitting field of f(x). 
The splitting field of a polynomial from Z,[:] is the smallest extension field of Z, which contains all 
roots of f (x). 
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5. Algebraic Closure, Fundamental Theorem of Algebra A field K is algebraically closed if 
all roots of the polynomials from K |x] arein K. The fundamental theorem of algebra tells that the field 
C of complex numbers is algebraically closed. An algebraic extension L of K is called the algebraic 
closure of K if L is algebraically closed. The algebraic closure of a finite field is not finite. So there are 
infinite fields with characteristic p. 


6. Cyclic and Multiplicative Group The multiplicative group K* = K \ {0} of a finite field 
K is cyclic, i.e. there is an element a € K such that every element of K* is a power of a: K* = 
{1,a,a7,...,a7-*}, if K has q elements. 

An irreducible polynomial f(x) € K[x] is called primitive, if the powers of a represents all nonzero 
elements of L := K[2|/ f(a), ie. the multiplicative group L* of L can be generated by x. 

With a primitive polynomial f(x) of degree n it is possible to construct a ,,Table of logarithm” for 
GF(p") from GF(p)[z], which makes calculations easier. 

Wi Construction of field GF(23) and its table of logarithm. 

f(x) 9 1-4 x 4 2? is irreducible over Z[r], since neither 0 nor 1 are roots of it: 


GF(25) — Za|v]/ f(x) = {ao + ax + azz? | ag,a1,a2 € Z3 ^ 32 — 14 x). (5.181) 
f(x) is primitive, so a table of logarithm can be created for GF(2?): 
Two expressions are assigned to every polynomial ag 4- ax -- a3? from Za[v]/ f(z). The coefficient 


vector ag, a, a? and the so called logarithm which is a natural number i such that zê = ao + a£ + agx? 
modulo 1 4- z + 2°. The table of logarithm is: 








e Addition of the field elements (KE) in GF(8): 
e Addition of the coordinate vectors (KV) componentwise mod 2 
(in general mod p). 
e Multiplication of the field elements (KE) in GF(8): 
e Addition of logarithms (Log) mod 7 (in general mod (p^ — 1)). 
za T 5 9 
Example: E cUm . 76 =r°=2 














Remark: Finite fields are extremely important in coding theory as linear codes, where vector spaces 
in form (GF(q))" are considered. A subspace of such a vector space is called linear code (see 5.4.6.2,3., 
p. 385). The elements (code words) of a linear code are also n-tuples with elements from a finite field 
GF(q"). In applications in code theory it is important to know the divisors of X" — 1. 

The splitting field of X" — 1 € K[X] is called the n-th cyclotomic field over K. 

If the characteristic of ΜΚ is not a divisor of n and a is a primitive n-th unit root, then: 

a) The extension field K (a) is the splitting field of X” — 1 over K. 

b) In K (a), the field X" — 1 has exactly n pairwise different roots which form a cyclic group, and among 
them there are y(n) primitive n-th unit roots, where y(n) denotes the Euler function (5.4.4,1., p. 381). 
By the k-th powers (k < n, g.c.d.(k,n)=1) of a primitive n-th unit root a all unit roots can be got. 


2. Applications of Shift Registers 


Calculations with polynomials can be performed well by a linear feedback shift register (see Fig.5.18). 
With a linear feedback shift register based on the feedback polynomial f(x) = fo+fiw+---+fp127 t+ 
1^ and from the state polynomial s(r) — so 4- sz -- --- 4- 5, 127^! one gets the state polynomial 
s(x): x — Sr-1: f(x) = s(x) - x (mod f(x)). 

Especially, if s(z) — 1, after i steps (i-times applications) the state polynomial is z? (mod f(x)). 

Wl Demonstration with the example from page 364: The primitive polynomial f(z) 2 1--z4-z? € Za[v] 
is chosen as feedback polynomial. Then the shift register with lengh 3 has the following sequence of 
states: 
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Figure 5.18 

From the initial state: 10021 (mod f(x)) 
the states follow as: 01052 (mod f(x)) 
00122? (mod f (x)) 

11053? =l+a2 (mod f (z)) 

01125z* zr-4z? (mod f (z)) 

111525? Έ 1 γα γα (mod f(z)) 

1012385 z1-42? (mod f (z)) 

100257 z1 (mod f (z)) 


The states are considered as coefficient vectors of a state polynomial sp + sy 4- sx?. 
In general: A linear feedback shift register with length r gives a sequence of states of maximal length 
with period 2” — 1 if and only if the feedback polynomial is a primitive polynomial of degree r. 


5.3.8 Vector Spaces * 
5.3.8.1 Definition 


A vector space over a field F' consists of an Abelian group V = (V, +) of “ vectors” written in additive 
form, of a field F = (F,+,*) of “ scalars” and an exterior multiplication F x V — V, which assigns 
to every ordered pair (k, v) for k € F and v € V a vector kv € V. These operations have the following 
properties: 


(V1) (u+v)+w =u + (v+ w) foral u,v, w € V. (5.182) 
(V2) Thereisa vector 0 € V such that v +0 = v for every v € V. (5.183) 
(V3) To every vector v there is a vector — v such that v + (—v) = 0. (5.184) 
(V4) v+w=w +v for every v,w E V. (5.185) 
(V5) 1v-— v for every v € V, 1 denotes the unit element of F. (5.186) 
(V6) r(sv) — (rs)v for every r,s € F and every v € V. (5.187) 
(VT) (r4 s)v — rv 4- sv for every r,s € F and every v € V. (5.188) 
(V8) r(v-4- w) — rv c rw forevery r € F and every v,w € V. (5.189) 


If F = R holds, then it is called a real vector space. 


Examples of vector spaces: 

W A: Single-column or single-row real matrices of type (n, 1) and (1, n), respectively, with respect to 
matrix addition and exterior multiplication with real numbers form real vector spaces IR" (the vector 
space of column or row vectors; see also 4.1.3, p. 271). 


W D: All real matrices of type (m,n) form a real vector space. 
E C: All real functions continuous on an interval [a, b] with the operations 


(F +9)(2) = f(x) + g(a) and. (kf)(x) = k- f(x) (5.190) 





*In this paragraph, generally, vectors are not printed in bold face. 
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form a real vector space. 
Function spaces have a fundamental role in functional analysis (see Ch. 12, p. 654). For further exam- 
ples see 12.1.2, p. 655. 


5.3.8.2 Linear Dependence 

Let V be a vector space over F. The vectors v1, v, ..., v4 € V are called linearly dependent if there 
are ky, ko,..., km E K not all of them equal to zero such that 0 = kyu, + kave + +++ + kmUm holds. 
Otherwise they are linearly independent. Linear dependence of at least two vectors means that one of 
them is a multiple of the other. 

If there is a maximal number n of linearly independent vectors in a vector space V, then the vector 
space V is called n dimensional. This number n is uniquely defined and it is called the dimension. 
Every n linearly independent vectors of V form a basis. If such a maximal number does not exist, then 
the vector space is called infinite dimensional. The vector spaces in the above examples are n, m - n, 
and infinite dimensional. 

In the vector space R”, n vectors are independent if and only if the determinant of the matrix, whose 
columns or rows are these vectors, is not equal to zero. 

If {v1, v2,..-, Un} form a basis of an n-dimensional vector space over F’, then every vector v € V has a 
unique representation v = kyvy + kov + -+ + knUn with ki, k2... , kn E F. 

Every set of linearly independent vectors can be completed into a basis of the vector space. 


5.3.8.3 Linear Operators 


1. Definition of Linear Operators 
Let V and W be two real vector spaces. A mapping f : V —» W from V into W is called a linear 
mapping or linear transformation or linear operator (see also 12.1.5.2, p. 658) from V into W if 


f(u--v) 2 fu-- fv forall u,v eV, (5.191) 
f(Au) =Afu forall u € V and all real A. (5.192) 

B 
WB A: The mapping fu :— f u(t) dt, which transforms the space C[a, 8] of continuous real functions 


Q 
into the space of real numbers is linear. 
In the special case when W = R}, as in the previous example, linear transformations are called linear 
functionals. 
W B: Let V = R” and let W be the space of all real polynomials of degree at most n — 1. Then the 
mapping f (ao, @1,---,@n—1) := ag -- a1 + aga” +--+ + a,_12"71 is linear. In this case each n-element 
vector corresponds to a polynomial of degree € n — 1. 
B C: If V = R” and W = R”, then all linear operators f from V into W (f: R” — R”) can 
be characterized by a real matrix A = (aix) of type (m,n). The relation Ax = y corresponds to the 
system of linear equations (4.174a) 


Q11 412 *** Qin Ti yı 
021 022 :** Ω9ῃ T2 Y2 
Am1 Am2 ``: Amn Yn Um 


2. Sum and Product of Two Linear Operators 
Let f: V — W, g: V — W and h: W — U be linear operators. Then the 


sum f +g: V — W isdefinedas (f -- g)u — fu-- gu forall u € V and the (5.193) 
product hf: V —> U is defined as (hf)u — h(fu) for all u € V. (5.194) 
Remarks: 


1. If f, g and h are linear, then f + g and fh are also linear operators. 
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2. The product (5.194) of two linear operators represents the consecutive application of these operators 
fandh. 


3. The product of two linear operators is usually non-commutative even if the products exist: 


hf z fh. (5.195a) 
Commutability exists, if 
hf — fh=0 (5.195b) 


holds. In quantum mechanics the left-hand side of this equation hf — fh is called the commutator. In 
the case (5.195a) the operators f and h do not commutate, therefore we have to be very careful about 
the order. 


W Asa particular example of sums and products of linear operators one may think of sums and products 
of the corresponding real matrices. 
5.3.8.4 Subspaces, Dimension Formula 
1. Subspace: Let V be a vector space and U a subset of V. If U is also a vector space with respect to 
the operations of V, then U is called a subspace of V. 
A non-empty subset U of V is a subspace if and only if for every u1, uz E€ U and every k € F also u1 +u2 
and k - uy are in U (subspace criterion). 
2. Kernel, Image: Let Vi, V? be vector spaces over F. If f: Vi — V3 is a linear mapping, then the 
linear subspaces kernel (notation: ker f) and image (notation: im f) are defined in the following way: 
ker f = {v € V|f(v) =0}, im f={f(v)|v eV}. (5.196) 
So, for example, the solution set of a homogeneous linear equation system Ax = 0 is the kernel of the 
linear mapping defined by the coefficient matrix A. 
3. Dimension: The dimension dim ker f and dim im f are called the defect f and rank f, respectively. 
For these dimensions the equality 
defect f + rank f = dim V, (5.197) 
is valid and is called the dimension formula. In particular, if the defect f = 0, i.e., ker f = {0}, then 
the linear mapping f is injective, and conversely. Injective linear mappings are called regular. 
5.3.8.5 Euclidean Vector Spaces, Euclidean Norm 
In order to be able to use notions such as length, angle, orthogonality in abstract vector spaces we 
introduce Euclidean vector spaces. 
1. Euclidean Vector Space 
Let V be a real vector space. If y: V x V + R is a mapping with the following properties (instead of 
tp(v, w) one writes v - w) for every u,v,w € V and for every r c R 


(S1) v-w=w-v, 5.198) 


5.199) 
5.200) 
5.201) 


(S2) (ucv):w-—u-:w-v-:w, 
(S3) r(v.w) — (rv) -w — v- (rw), 
(S4) v-v 0ifand only if v Z 0, 


then y is called a scalar product on V. If there is a scalar product defined on V, then V is called a 
Euclidean vector space. 


( 
( 
( 
( 


These properties are used to define a scalar product with similar properties on more general spaces, too 
(see 12.4.1.1, p. 673). 


2. Euclidean Norm 
The value 


lell = vv -v (5.202) 
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denotes the Euclidean norm (length) of v. The angle a between v, w from V is defined by the formula 
er (5.203) 

TOTAL 5. 

[οἱ] τω] 

If v - w = 0 holds, then v and w are called orthogonal to each other. 


E Orthogonality of Trigonometric Functions: In the theory of Fourier series (see 7.4.1.1, p. 474), 
there are functions of the form sin kx and cos kx. Theses fubctions can be considered as elements of 
C0, 27]. In the function space Ca, b] the formula 


1:27 [ fot ds (5.204) 


defines a scalar product. Since 


cosa = 


2π 2π 
J sinkz.sinlrdr — 0 (kl), (5.905) / coskx-coslrdr=0 (k#1), (5.906) 
0 0 


2m 
[ sin ke - cosla dx = 0, (5.207) 
“Ὁ 


the functions sin kx € C[0, 27] and coslax € C[0, 27] for every k,1 € IN are pairwise orthogonal to each 
other. This orthogonality of trigonometric functions is used in the calculation of Fourier coefficients in 
harmonic analysis (see 7.4.1.1, p. 474). 


5.3.8.6 Bilinear Mappings, Bilinear Forms 
Bilinear mappings can be considered as generalizations of different products between vectors. In that 
case bilinearity uses the distributivity of the corresponding product with respect to vector addition. 
1. Definition 
Let U, V, W be vector spaces over the same field A. A mapping f: U x V —> W is called bilinear if 
for every u€ U the mapping v+> f(u, v) is a linear mapping of V into W and 
for every v € V the mapping u > f(u, v) is a linear mapping of U into W. (5.208) 
It means that a mapping f: U x V —35 W is bilinear, if for every k € K, u,u' € U, and v,' € V holds: 
f(utu',v) = f(u,v) 4 f(u' v), f(ku,v) — kf(u,v) and 
f(u,v v) 2 f(u,v) t- f(u v), f(u, kv) 2 kf(u,v). (5.209) 
If f is replaced by the dot product or vector product or by a multiplication in a field, these relations 
describe the left sided and right sided distributivity of this multiplication with respect to vector addi- 
tion. 
Especially, if U — V, and W — K which is the underlying field, then f is called a bilinear form. In this 
book only the real (K = R) or complex (K = C) cases are considered. 
Examles of Bilinearforms 
E A: U = V = R”, W =R, f is the dot product in R”: f(u, v) = uTv — Y? 4, u;v; , where u; and vj 
(i = 1,2,...,n) denote the Cartesian coordinates of u and v. 
E B: U =V =W =R, f isthe cross product in R: 
f(u, v) 2 u x v 2 (usua — v9u3 , vyua — uyUa , ujUo — viua)? . 
2. Special Bilinear Forms 
A bilinear form f: V x V — R is called 
e symmetric, if f(v, v^) — f(v',v) forevery v, v € V , 
e skew-symmetric, if f(v, v) — —f(v', v) for every v, v' € V and 
e positive definite, if f(v, v) > 0 for every v E€ V v Z 0. 
So an Euclidean dot product in V (see 5.3.8.5, p. 367) can be characterized as a symmetric, positive 
definite bilinear form. The canonical Euclidean dot product in R” is defined as f(u, v) — uTv. 
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In finite dimensional spaces V a bilinear form can be represented by a matrix: If f:= V x V —> Ris 
a bilinear form, and B = (by, be,..., bn) is a basis of V, then the matrix 


Ag(f) — (f(bi b;)i;) (5.210) 
is the representation matrix of f with respect to basis B. The bilinear form then can be written in 
matrix product form: 

f(v,v') = vT AB(f)v', (5.211) 

where v and v’ are given with respect to basis B. 
The representation matrix is symmetric, if the bilinear form is symmetric. In complex vector spaces 
(because 2? can be a negative number) symmetric, positive definite bilinear forms do not have much 
sense. To define an unitary dot product and also distances and angles with it instead of bilinear form 
the concept of the so called sesquilinear form is used [5.6], [5.12]. 


3. Sesquilinear Form 
A mapping f: V x V — C is called sesquilinear form if for every v, v' € V and k € C: 

flu+ u,v) = fluo) + f(u,v), f(ku,v) =kf(u,v) and 

f(u,v v) 2 f(u,v) t- fluo’), flu, kv) = k* f(u,v). (5.212) 
where k* denotes the complex conjugate of k. The function is linear in the first argument and ,,semi- 
linear” in the second argument. Analogously to the real case ,,symmetry” is defined in the following 
way: 
A ecu form f: V x V —> Cis called hermitianif f (v, v') — f (v', v)* for every v,v' € V. 
In this way a (unitary) dot product is characterized by an hermitian, positive definite sesquilinear form. 
The canonical unitary dot product in C” is defined as f(u,v) = uTv*. 
If V is finite dimensional, then a sesquilinear form can be represented by a matrix (like in the real 
case): 
If f V x V — Cis a sesquilinear form, and B = (bi, b2,...,bn) is a basis of V, then the matrix 
Apg(f) — (f (bi, b;))i; is the representation matrix of f with respect to basis B. The sesquilinear form 
can be written in matrix product form: 

f(v,v) 2 vT Ag(f)v , (5.213) 
where v and v' are given with respect to basis B. A representation matrix is hermitian if and only if 
the sesquilinear form is hermitian. 
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5.4 Elementary Number Theory 


Elementary number theory investigates divisibility properties of integers. 
5.4.1 Divisibility 
5.4.1.1 Divisibility and Elementary Divisibility Rules 


1. Divisor 
An integer b € Z is divisible by an integer a without remainder iff * there is an integer q such that 

qa — b (5.214) 
holds. Here a is a divisor of b in Z, and q is the complementary divisor with respect to a; b is a multiple 
of a. For “a divides b” we write also alb. For “a does not divide b” we can write a/b. The divisibility 
relation (5.214) is a binary relation in Z (see 5.2.3, 2., p. 331). Analogously, divisibility is defined in 
the set of natural numbers. 


2. Elementary Divisibility Rules 








(DR1)  Forevery a € Z we have l|a, a|a and a|0. (5.215 

(DR2)  Ifa]|b, then (—a)|b and a|(—0^). (5.216 

(DR3)  a|b and b|a implies a — b or a — —b. (5.217 

(DR4) — afl implies a= 1 or a= —1. (5.218 

(DR5) — alb and b £0 imply |a| < |b]. (5.219 

(DR6)  a|b implies a|zb for every z € Z. (5.220 

(DR?)  a|bimplies az|bz for every z € Z. (5.221 

(DRS)  az|bz and z z 0 implies a|b for every z € Z. (5.222 

(DR9)  a|b and b|c imply a|c. (5.223 

(DR10) a|b and c|d imply ac|bd. (5.224) 
(DRA11) a|b and a|c imply a|(z1b 4- z2c) for arbitrary z1, z2 € Z. (5.225 

(DR12) a|b and a|(b 4- c) imply a|c. (5.226 

5.4.1.2 Prime Numbers 

1. Definition and Properties of Prime Numbers 

A positive integer p (p > 1) is called a prime number iff 1 and p are its only divisors in the set IN o 


positive integers. Positive integers which are not prime numbers are called composite numbers. 

For every integer, the smallest positive divisor different from 1 is a prime number. There are infinitely 
many prime numbers. 

A positive integer p (p > 1) isa prime number iff for arbitrary positive integers a,b, p|(ab) implies pla 
or p|b. 

2. Sieve of Eratosthenes 

By the method of the "Seve of Eratosthenes" , every prime number smaller than a given positive integer 
n can be determined: 

a) Write down the list of all positive integers from 2 to n. 

b) Underline 2 and delete every subsequent multiple of 2. 

c) If p is the first non-deleted and non-underlined number, then underline p and delete every p-th 
number (beginning with 2p and counting the numbers of the original list). 

d) Repeat step c) for every p (p < vnr) and stop the algorithm. 





*if and only if 
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Every underlined and non-deleted number is a prime number. In this way, all prime numbers € n are 
obtained. 

The prime numbers are called prime elements of the set of integers. 

3. Prime Pairs 

Prime numbers with a difference of 2 form prime pairs (twin primes). 

W (3,5), (5, 7), (11, 13), (17, 19), (29, 31), (41, 43), (59, 61), (71, 73), (101, 103) are prime pairs. 

4. Prime Triplets 

Prime triplets consist of three prime numbers occuring among four consecutive odd numbers. 

W (5,7,11), (7, 11, 13), (11, 13, 17), (13, 17, 19), (17, 19, 23), (37, 41, 43) are prime triplets. 

5. Prime Quadruplets 

If the first two and the last two of five consecutive odd numbers are prime pairs, then they are called a 
prime quadruplet. 

E (5,7,11,13), (11, 13, 17, 19), (101, 103, 107, 109), (191, 193, 197, 199) are prime quadruplets. 

The conjecture that there exist infinitely many prime pairs, prime triplets, and prime quadruplets, is 
not proved still. 

6. Mersenne Primes 

If2F — 1, k € N, is a prime number, then Kk is also a prime number. The numbers 2? — 1 (p prime) 
are called Mersenne numbers. A Mersenne prime is a Mersenne number 2? — 1 which is itself a prime 
number. 

W 2? — 1isa prime number for the first ten values of p: 2, 3, 5, 7, 13,17, 19, 31, 61, 89, 107, etc. 
Remark: Since a few years the largest known prime is always a Mersenne prime, e.g. 21912609 — ] jy 
2008, 257555161 — 1 1 2013. In contrary to other natural numbers the numbers of the form 2* — 1 can 
be tested in a relatively simple way whether they are primes: Let p > 2 be a prime and a sequence of 
natural numbers is defined by s, — 4, s;44 :— 8? — 2 (i > 1). The number 2? — 1 is a prime if and only 
if the term of the sequence s; is divisible by 2? — 1. 

The prime test based on this statement is called Lucas-Lehmer test . 

7. Fermat Primes 

Ifanumber 2*+1, k € N, isan odd prime number, then k is a power of 2. The numbers 2*+1, k €N, 
are called Fermat numbers. If a Fermat number is a prime number, then it is called a Fermat prime. 
W For k — 0,1,2,3,4 the corresponding Fermat numbers 3, 5, 17, 257, 65537 are prime numbers. It is 
conjectured that there are no further Fermat primes. 

8. Fundamental Theorem of Elementary Number Theory 

Every positive integer n > 1 can be represented as a product of primes. This representation is unique 
except for the order of the factors. Therefore n is called to have exactly one prime factorization. 

M 360 =2-2-2-3-3-5=23-3?-5, 

Remark: Analogously, the integers (except —1,0, 1) can be represented as products of prime elements, 
unique apart from the order and the sign of the factors. 

9. Canonical Prime Factorization 

It is usual to arrange the factors of the prime factorization of a positive integer according to their size, 
and to combine equal factors to powers. If every non-occurring prime is assigned exponent 0, then every 
positive integer is uniquely determined by the sequence of the exponents of its prime factorization. 

WB 101533312 — 27 - 3? - 11? belongs the sequence of exponents (7, 2, 0,0, 3, 0, 0, . . .). 

For a positive integer n, let pı, po,...Dm be the pairwise distinct primes divisors of n, and let œp denote 
the exponent of a prime number p; in the prime factorization of n. 'Then 


n= II». (5.2272) 
ki 
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and this representation is called the canonical prime factorization of n. It is often denoted by 
n 2 [[»^, (5.227b) 
P 


where the product applies to all prime numbers p, and where vp(n) is the multiplicity of p as a divisor 
of n. It always means a finite product because only finitely many of the exponents vp(n) differ from 0. 
10. Positive Divisors 

If a positive integer n > 1 is given by its canonical prime factorization (5.227a), then every positive 
divisor t of n can be written in the form 


t= [[ pë with € {0,1,2,..., ap} for k = 1,2,...,m. (5.228a) 
k=1 


The number 7 (n) of all positive divisors of n is 


7(n) — [] (ax +1). (5.228b) 
k=1 
E A: 7(5040) = 7(24- 3? - 5-7) = (4+ 1)(2+1)(1 + 1)(1 + 1) = 60. 
W D: 7(pipo-::p,) — 2^, if pi, pa, ..., p, are pairwise distinct prime numbers. 


The product P(n) of all positive divisors of n is given by 
P(n) = n7., (5.228c) 
B A: P(20) — 20? — 8000. B B: P(j?) — pP, if pis a prime number. 
Bl C: P(pq) — p?q?, if p and q are different prime numbers. 
The sum c (n) of all positive divisors of n is 


m pakti _ 1 


c(n) — TI Pr 


κ Pol 
E A: o(120) = o(2? - 3-5) = 15-4- 6 = 360. E B: o(p) =p + 1, ifp is a prime number. 


(5.228d) 


5.4.1.3 Criteria for Divisibility 
1. Notation 


Consider a positive integer given in decimal form: 


n — (aga: a20109)10 — a&10* 4- aj 41057 4 - -- E a910? -- a410 + ao. (5.229a 
Then 

Qi(n) = do + ay + ag +--+ + aK (5.229b 
and 

Qi(n) = ao — ay + ag — +++ + (—1)* ay (5.229c 


are called the sum of the digits (of first order) and the alternating sum of the digits (of first order) of n, 
respectively. Furthermore, 








Q»(n) — (a1a9)10 4- (a3a2)10 + (a5a4)10 4- --- and (5.229d 

Qs(n) — (a1ao)1o — (a3a2)10 -- (a5a4)10 — ^: -- (5.229e 
are called the sum of the digits and the alternating sum of the digits, respectively, of second order and 

Qs(n) — (aga1a9)10 4- (a5a403)10 - (agazag)10 4- --- (5.229f 
and 

Qs(n) — (aoa100)10 — (a5a4a3)10 -- (agazac)1o στι (5.229g 





are called the sum of the digits and alternating sum of the digits, respectively, of third order . 
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WB The number 123 456 789 has the following sum of the digits: Q1 = 9+8+7+6+5+4+3+2+1 = 45, 
Q1 — 9—84-7—64-5—44-3—2-4-1 — 5, Q9 — 894-67--454-234-1 — 225, Q5 — 89—67-4-45—23--1 — 45, 
Q3 = 789 + 456 + 123 = 1368 and Q = 789 — 456 + 123 = 456. 

2. Criteria for Divisibility 

There are the following criteria for divisibility: 

















DC-1: 3|n e 3|Qi (n), (5.230a) DC-2: "|n e T|Q5(n), (5.230b) 
DC-3: 9|n & 9|Q; (n), (5.230c) DC-4: ll|n e 11|Q! (n), (5.230d) 
DC-5: 13|n e 13|Q5(n) (5.230e) DC-6: 37|n e 3T|Qs(n), (5.230f) 
DC-7: 101|n e 101|Q5(n), (5.230g) DC-8: 2|n € 2]|ao, (5.230h) 
DC-9: 5|n & 5|ao, (5.2301) DC-10: 2*|n & 2*|(ay 1a& 3---a1a9)1o, (5.230) 


DC-11: 5*|n & 5*|(ay 1a& 3---a1a0)1o.. (5.230k) 

W A: a — 123456 189 is divisible by 9 since Qi(a) = 45 and 9|45, but it is not divisible by 7 since 
Q3(a) = 456 and 7/456. 

W B: 91619 is divisible by 11 since Q{ (91619) = 22 and 11]22. 

Wi C: 99994 096 is divisible by 24 since 24/4096. 

5.4.1.4 Greatest Common Divisor and Least Common Multiple 


1. Greatest Common Divisor 


For integers a), @9,...,@p, which are not all equal to zero, the largest number in the set of common 
divisors of a1, d2,...,@n is called the greatest common divisor of ay, a2,...,@n, and it is denoted by 
gcd(a, a2, ... a5). If ged(a1,a2,..., a4) — 1, then the numbers a4, a2, ..., à, are called coprimes. 


To determine the greatest common divisor, it is sufficient to consider the positive common divisors. If 
the canonical prime factorizations 


aj 5 [[»^? (5.231a) 
p 


0f à1, 02, ..., a4 are given, then 


gcd(ai, aa, ..., as) — Πο min [p(a;)] ) 


Ῥ 


(5.231b) 


Bb For the numbers a, = 15 400 = 23 . 52 . 7 - 11, a2 = 7 875 = 3? . 53 . 7, ag = 3850 = 2- 52. T- 11, the 
greatest common divisor is ged(a1, a2, a3) = 5° - 7 = 175. 
2. Euclidean Algorithm 
The greatest common divisor of two integers a, b can be determined by the Euclidean algorithm without 
using their prime factorization. To do this, a sequence of divisions with remainder, according to the 
following scheme, is performed. For a » blet ag — a,a, = b. Then: 

ao = Gia, + Q2, 0 < a < a, 

a, = Q202 + 03, 0 < a3 < a2, 


: (5.232a) 
An—2 = Qn-10n-1 t Qn; 0« An < ün—1; 


απ. = Qnan. 
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The division algorithm stops after a finite number of steps, since the sequence a2, a3,... is a strictly 
monotone decreasing sequence of positive integers. The last remainder a; different from 0 is the great- 
est common divisor of ag and a4. 


W gcd(38, 105) = 1, as can be seen by the help of the table to the right. 





By the recursion formula pe = : ` a τ A 
gcd(ai, a2, ..., a5) — gcd(ged(ai, ao, . .., a5 1), a5); (5.232b) 9)- 3.9.9 
the greatest common divisor of n positive integers with n > 2 can be de- 9=4-241 
termined by repeated use of the Euclidean algorithm. 2=2-1 
W gcd(150, 105,56) = ged(ged(150, 105), 56) = gcd(15, 56) = 1. 
WB The Euclidean algorithm to determine the ged (see also 1.1.1.4, 1., 55 =1-34421 
p. 3) of two numbers has especially many steps, if the numbers are adja- 34=1-21413 
cent numbers in the sequence of Fibonacci numbers (see 5.4.1.5, p. 375). 21=1-13+8 
The annexed calculation shows an example where all quotients are al- 13 =1-84+5 
ways equal to 1. 8=1-54+3 
3. Theorem for the Euclidean Algorithm 5=1-3+4+2 
For two natural numbers a,b with a > b > 0, let A(a, b) denote the 3=1-2+1 
number of divisions with remainder in the Euclidean algorithm, and let 2=1-1+1 
&(b) denote the number of digits of b in the decimal system. Then 121.1 
A(a,b) € 5- (b). (5.233) 


4. Greatest Common Divisor as a Linear Combination 
It follows from the Euclidean algorithm that 

az = Ag — NA, = Codo + doar, 

a3 = A1 — G2d2 = 6ιαρ + diay, 





(5.234a) 
an = n-2 — Qn-1ün-1 — Cn-200 t di 501. 
Here c49 and d, 3 are integers. Thus the gcd(ao, a1) can be represented as a linear combination of ag 
and a, with integer coefficients: 


gcd(ag, a1) = Cn-240 4- ds 3a. (5.234b) 
Moreover gcd(ai, a2, . .., a4) can be represented as a linear combination of a1, a2, . . . , ἄπ, since: 
gcd(a1, a2, ..., a4) — ged(gced(a1,a2,..., a5 1),a5) — c: ged(a1,a9,...,a5 1) + dan. — (5.234c) 


E gcd(150, 105,56) — gcd(gcd(150,105),56) — gcd(15,56) — 1 with 15 — (—2) - 150 4- 3 105 and 
1 = 15-15 + (—4) - 56), thus ged(150, 105, 56) = (—30) - 150 + 45 - 105 + (—4) - 56. 


5. Least Common Multiple 


For integers a1, d2,...,@n, among which there is no zero, the smallest number in the set of positive 
common multiples of a1, a2,..., Qn is called the least common multiple of a1, a5, .. . , a4, and itis denoted 
by lem(a;, a2, . . . απ). 


If the canonical prime factorizations (5.231a) of a4, a3, . .., a4 are given, then: 


I [vp(ai)| } 

Ιαπ(αι;αχ....,αῃ)--]]ρ! j (5.235) 
p 

E For the numbers a = 15400 = 2? - 52. 7-11, a, = 7875 = 3? - 5° - 7,43 = 3850 = 2-5?-7-11 the 

least common multiple is lem(a;, a5, a3) — 23 - 32-55. 7- 11 — 693000. 
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6. Relation between gcd and lcm 
For arbitrary integers a, b: 

|ab| = gcd(a, b) - lem(a, 0). (5.236) 
"Therefore, the lcm(a, b) can be determined with the help of the Euclidean algorithm without using the 
prime factorizations of a and b. 


5.4.1.5 Fibonacci Numbers 


1. Recursion Formula 
The sequence 

(Fui)asN with F,— F?—1 and F413 — Fy t Fay (5.237) 
is called Fibonacci sequence. It starts with the elements 1, 1, 2, 3, 5, 8, 13, 21, 34, 55, 89, 144, 
233, 377, ... 
E The consideration of this sequence goes back to the question posed by Fibonacci in 1202: How many 
pairs of descendants has a pair of rabbits at the end of a year, if every pair in every month produces a 
new pair, which beginning with the second month itself produces new descended pairs? The answer is 
Fy4 = 377. 
2. Explicit Formula 
Besides the recursive definition (5.237) there is an explicit formula for the Fibonacci numbers: 


QST ES) » 











V5 2 2 

Some important properties of Fibonacci numbers are the followings. For m,n € N: 

(1) Enin = Fm-ifa t FinFati (m> 1). (5.239a) (2) Fin| Finn: (5.239b 

(3) ged(m, n) = dimplies ged(Fin, Fn) = Fu. —(5.239c) (4) ged( Fs, F541) — 1. (5.239d 

(5) Fm| Fk holds iff m|k holds. (5.239e) (6) XO F? = FarPay. (5.239f 
il 

(7) gcd(m,n) = 1 implies F,, F, | Fi. (5.239g) (8 35 E 9 Fus - 1. (5.239h 
il 

(9) F,F,io — F2,, 5 (-1)**t. (5.2391) (10) F2 4- F2, = Fm. (5.239j 

(11) Fi, — Fi — Fanga- (5.239k) 


5.4.2 Linear Diophantine Equations 
1. Diophantine Equations 


An equation f (21, %2,...,%) = b is called a Diophantine equation in n unknowns iff f (zi, 2, ..., 24) 
is a polynomial in 21, %2,..., 2p» With coefficients in the set Z of integers, b is an integer constant and 


only integer solutions are of interest. The name “Diophantine” reminds of the Greek mathematician 
Diophantus, who lived around 250 AD. 

In practice, Diophantine equations occur for instance, if relations between quantities are described. 
Until now, only general solutions of Diophantine equations of at most second degree with two variables 
are known. Solutions of Diophantine equations of higher degrees are only known in special cases. 
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2. Linear Diophantine Equations in n Unknowns 
A linear Diophantine equation in n unknowns is an equation of the form 

α121 d- d333 c :::a434 — b. (a; € Z, b € Z), (5.240) 
where only integer solutions are searched for. A solution method is described in the following. 
3. Conditions of Solvability 
If not all the coefficients a; are equal to zero, then the Diophantine equation (5.240) is solvable iff 
gcd(a1, a2, . . - , an) is a divisor of b. 
E 114x +315y = 3 is solvable, since ged(114, 315) = 3. 
If a linear Diophantine equation in n unknowns (n > 1) has a solution and Z is the domain of variables, 
then the equation has infinitely many solutions. Then in the set of solutions there are n—1 free variables. 
For subsets of Z, this statement is not true. 
4. Solution Method for n = 2 
Let 

απι +a2z2 =b (αι, ας) σ (0,0) (5.241a) 
be a solvable Diophantine equation, i.e., gcd(a1, a2)|b. To find a special solution of the equation, the 
equation is divided by gcd(a;, a3) and one obtains a2, 4- a525 — U' with ged(a4, a5) = 1. 
As described in 5.4.1, 4., p. 374, gcd(aj, a5) is determined to obtain finally a linear combination of a 
and a5: a1c, 4- a3c — 1. 
Substitution in the given equation demonstrates that the ordered pair (c{b’, c4b’) of integers is a solution 
of the given Diophantine equation. 
W 114r--315y — 6. The equation is divided by 3, since 3 = ged(114, 315). That implies 38r 4-105y — 2 
and 38 - 47 + 105 - (—17) = 1 (see 5.4.1, 4., p. 374). The ordered pair (47 - 2, (—17) - 2) = (94, —34) is 
a special solution of the equation 1142 + 315y = 6. 
The family of solutions of (5.241a) can be obtained as follows: If (29, 19) is an arbitrary special solution, 
which could also be obtained by trial and error, then 

{(a9 + t+ a), 09 —t-a))|t € Z} (5.241b) 
is the set of all solutions. 
Wi The set of solutions of the equation 114x + 315y = 6 is {(94 + 315t, —34 — 114¢)|t € Z}. 
5. Reduction Method for n > 2 
Suppose a solvable Diophantine equation 





421 + dog + +++ +an%n =) (5.242a) 





with (a1, a2, .. . , an) Z- (0,0,...,0) and ged(ai,a2,..., a4) — lis given. If ged(a1, a2, ..., a4) z 1, then 
the equation should be divided by ged(a4, a2, ..., a4). After the transformation 











atı + d313 d- ^: d- Gs 1344 — b — Antn (5.242b) 
Zp» is considered as an integer constant and a linear Diophantine equation in n— 1 unknowns is obtained, 
and it is solvable iff gcd(ay, a2,...,@n—1) is a divisor of b — anXn. 
The condition 





gcd(ai, a2, ..., as 1)|b — asta (5.242c) 
is satisfied iff there are integers c, c,, such that: 

gcd(a1, a2, ..., às 1) : € - an, — b. (5.2424) 
This is a linear Diophantine equation in two unknowns, and it can be solved as shown in 5.4.2,4., p. 376. 
If its solution is determined, then it remains to solve a Diophantine equation in only n — 1 unknowns. 
This procedure can be continued until a Diophantine equation in two unknowns is obtained, which can 
be solved with the method given in 5.4.2, 4., p. 376. 
Finally, the solution of the given equation is constructed from the set of solutions obtained in this way. 
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W Solve the Diophantine equation 

2a + 4y + 3z = 3. (5.243a) 
This is solvable since gcd(2, 4, 3) is a divisor of 3. 
The Diophantine equation 

2z + 4y = 3-32 (5.243b) 
in the unknowns x,y is solvable iff gcd(2, 4) is a divisor of 3 — 3z. The corresponding Diophantine 
equation 22' 4- 3z — 3 has the set of solutions ((—3 4- 35,3 — 2t)|t € Z). This implies, z — 3 — 2t, and 
now the set of solutions of the solvable Diophantine equation 2x 4- 4y — 3 — 3(3 — 2t) or 

z +2y=-—3+3t (5.243c) 
is sought for every t € Z. 
The equation (5.243c) is solvable since gcd(1,2) — 1|(—3 -- 3t). Now 1- (C1) -2- 1 — Land 1- (3— 
3t) +2-(—3+3t) = —3 + 3t. The set of solution is (((3 — 3t) + 2s, (—3+ 3t) — s)|s € Z}. That implies 
x = (3 — 3t) - 2s, y — (—3 + 3t) — s, and {(3 — 3t + 2s, —3 + 3t — s,3 — 2t)|s, t € Z) so obtained is 
the set of solutions of (5.243a). 





5.4.3 Congruences and Residue Classes 


1. Congruences 
Let m be a positive integer m, m > 1. If two integers a and b have the same remainder, when divided 
by m, then a and b are called congruent modulo m, denoted by a = b mod m or a = b(m). 


W3-—13mod5, 38z13mod5, 3-z —2 mod 5. 


Remark: Obviously, a = b mod m holds iff m is a divisor of the difference a — b. Congruence modulo 
m is an equivalence relation (see 5.2.4, 1., p. 334) in the set of integers. Note the following properties: 


a = a mod m for every a € Z, (5.244a) 
a=bmod m = b= a mod m, (5.244b) 
à z b mod m Ab z c mod m — a z c mod m. (5.244c) 
2. Calculating Rules 
a =b mod m Ac = d mod m => a +c = b + d mod m, (5.245a) 
a =b mod m Ac = d mod m => a: c = b- d mod m, (5.245b) 
a:czb-cmod m ^ gced(c,m) —1 — az b mod m, (5.245c) 
m 
a-c=b-cmodmA c#05 a=b nt eae aa. (5.245d) 


3. Residue Classes, Residue Class Ring 
Since congruence modulo m is an equivalence relation in Z, this relation induces a partition of Z into 
residue classes modulo m: 

[a] = (x|x € Z^ x 2 a mod m]. (5.246) 
The residue class * a modulo m ? consists of all integers having equal remainder if divided by m. Now 
[a] = [b]m iff a = b mod m. 
There are exactly m residue classes modulo m, and normally they are represented by their smallest 
non-negative representatives: 

Olm, 1]. - ... [rm — 1]a. (5.247) 
In the set Zm of residue classes modulo m, residue class addition and residue class multiplication are 
defined by 

[a]; € [b]. :— [a 4- b], (5.248) 


[a]. O [b]. :— [a - b]. (5.249) 
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These residue class operations are independent of the chosen representatives, i.e., 
la]m = [@|m and [b]m = [b'|m imply 

la]m ® [b]m = [a]; € [o]; and [a] © [b]m = [alm © [b"]m. (5.250) 
The residue classes modulo m form a ring with unit element, with respect to residue class addition 
and residue class multiplication (see 5.4.3, 1., p. 377), the residue class ring modulo m. If p is a prime 
number, then the residue class ring modulo p is a field (see 5.3.7, 2., p. 361). 
4. Residue Classes Relatively Prime to m 
A residue class [a]m with gcd(a, m) = 1 is called a residue class relatively prime to m. If p is a prime 
number, then all residue classes different from [0], are residue classes relatively prime to p. 
The residue classes relatively prime to m form an Abelian group (5.3.3.1,1., p. 336) with respect to 
residue class multiplication, the so-called group of residue classes relatively prime to m. The order of 
this group is y(m), where ¢ is the Euler function (see 5.4.4, 1., p. 381). 
E A: [1]s, [3]s, [5]s, [7]s are residue classes relatively prime to 8. 
E B: [1];,[2]s, [3]s, [4]s are residue classes relatively prime to 5 
W C: (8) = (5) = 4 is valid. 
5. Primitive Residue Classes 
A residue class [a]m relatively prime to m is called a primitive residue class if it has order y(m) in the 
group of residue classes relatively prime to m. 
ΒΛ: [2]; is a primitive residue class modulo 5, since ([2]5)? — [4]s, ([2]5)? — [3]s, ([2]5)* — [1]s. 
W B: There is no primitive residue class modulo 8, since [1] has order 1, and [3]s, [5]a, [7]a have order 
2 in the group of residue classes relatively prime to m. 
Remark: There is a primitive residue class modulo m, iff m = 2,m = 4,m = p* or m — 2p, where p 
is an odd prime number and k is a positive integer. 
If there is a primitive residue class modulo m, then the group of residue classes relatively prime to m 
forms a cyclic group. 
6. Linear Congruences 
1. Definition If a,b and m > 0 are integers, then 


ax = b(m) (5.251) 
is called a linear congruence pm the unknown x). 
2. Solutions An integer g“ satisfying ax* = b(m) is a solution of this congruence. Every integer, 


which is congruent to 2* modulo m, is also a solution. In finding all solutions of (5.251) it is sufficient 
to find the integers pairwise incongruent modulo m which satisfy the congruence. 

The congruence (5.251) is solvable iff gcd(a, m) is a divisor of b. In this case, the number of solutions 
modulo m is equal to ged(a, m). 

In particular, if gcd(a, m) = 1 holds, the congruence modulo m has a unique solution. 

3. Solution Method There are different solution methods for linear congruences. It is possible to 
transform the congruence ax = b(m) into the Diophantine equation ax + my = b, and to determine 
a special solution (z?, y?) of the Diophantine equation a/z 4- m/y — U' with a^ — a/gced(a, m), m' — 
m/gcd(a, m), t' — b/gcd(a, m) (see 5.4.2, 1., p. 375). 


The congruence a'x = (m) has a unique solution since gcd(a', m") = 1 modulo m’, and 


x = 2°(m’). (5.252a) 
The congruence ax = b(m) has exactly gcd(a, m) solutions modulo m: 
a9, a9 +m,2° + 2m, ... , a? + (ged(a,m) — 1)m. (5.252b) 


W 114r — 6 mod 315 is solvable, since gcd(114, 315) is a divisor of 6; there are three solutions modulo 
315. 

38x = 2 mod 105 has a unique solution: « = 94 mod 105 (see 5.4.2, 4., p. 376). 94, 199, and 304 are 
the solutions of 1142 = 3 mod 315. 
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7. Simultaneous Linear Congruences 
If finitely many congruences 


x = by (m1), x = bo(ma),..., 2 = tym) (5.253) 
are given, then (5.253) is called a system of simultaneous linear congruences. A result on the set of 
solutions is the Chinese remainder theorem: Consider a given system x = bı (m1), £ 2 bo(ma3),..., o — 
b,(m,), where m1, ma, ..., m, are pairwise coprime numbers. If 

m m m 
m —mj,-:m»:::mi,j— —,à3— —,...,Q04— — (5.254a) 
mi m me 
and «; is chosen such that a;x; = b;(m,;) for j = 1,2,...,t, then 
a! = aya, + aot +++? + ary (5.254b) 


is a solution of the system. The system has a unique solution modulo m, i.e., if z' is a solution, then x” 
is a solution, too, iff z" 2 a'(m). 

E Solve the system x = 1(2), r 2 2(3) x = 4(5), where 2,3,5 are pairwise coprime numbers. 
Then m — 30,a; — 15,a2 = 10,a3 = 6. The congruences 15x1 = 1 (2), 10x2 = 2 (3), 6x3 = 4 (5) 
have the special solutions xı = 1, x2 = 2, x3 = 4. The given system has a unique solution modulo m: 
x= 15. 1+ 10-2 +6- 4(30), i.e., x z 29 (30). 

Remark: Systems of simultaneous linear congruences can be used to reduce the problem of solving 
non-linear congruences modulo m to the problem of solving congruences modulo prime number powers 
(see 5.4.3, 9., p. 380). 

8. Quadratic Congruences 


1. Quadratic Residues Modulo m One can solve every congruence ax? + br +c = 0(m) if one 
can solve every congruence 23 = a(m): 


aa? + ba +c = 0(m) & (2ax -- b)? Ξε b — 4ac(m). (5.255) 


First quadratic residues modulo m are considered: Let m € IN,m > 1 and a € Z, gcd(a, m) — 1. The 
number a is called a quadratic residue modulo m iff there is an x € Z with 2? = a(m). 





If the canonical prime factorization of m is given, i.e., 
oo 
m= II». (5.256) 
i=l 
then r is a quadratic residue modulo m iff r is a quadratic residue modulo p; for i = 1,2,3,.... 
. . : : d a NC : 
If ais a quadratic residue modulo a prime number p, then this is denoted by ( :) = 1; ifa is a quadratic 
P 
] ui α 
non-residue modulo p, then it is denoted by (2) — —] (Legendre symbol). 
p 


W The numbers 1,4, 7 are quadratic residues modulo 9. 


2. Properties of Quadratic Congruences 


(E1)  p[ab and az b(p) imply (0) = (*) ; (5.257a) 
P p 


(E2) C) =1. (5.257b) 
p 


(E3) e Str" (5.2576) 
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(E4) (2) = (0) (*) in particular (=) = (0) (5.257d) 


2 pl 
(E5) (=) -(-1 s. (5.2570) 








(E6) Quadratic reciprocity law: If p and q are distinct odd prime numbers, 


then (5) (3) zu iu. (5.2578) 


p 




















τ (wr) ; (s) (ον) H C) CR) B (ὦ Ca^ e (3) 7 
-(- Ἐν -ι. 


In General: A congruence z? & a(2?), gcd(a,2) — 1, is solvable iff a = 1(4) for a = 2 and a = 1(8) 


for a > 3. If these conditions are satisfied, then modulo 2° there is one solution for a = 1, there are 
two solutions for a = 2 and four solutions for a > 3. 
A necessary condition for solvability of congruences of the general form 
r? a(m), m — 2? pt p? --- p?*, σο](α, πι) -- 1, (5.258a) 
is the solvability of the congruences 
α α α 
a = 1(4) fora =2, a= 1(8) for a > 3, (<) =1, (<) L4. (<) zu (5.258b) 
pi pa Pt 
If all these conditions are satisfied, then the number of solutions is equal to 2° for a = 0 and a = 1, 
equal to 2** for a — 2 and equal to 2'*? for a > 3. 


9. Polynomial Congruences 


If m4, m2, ..., m, are pairwise coprime numbers, then the congruence 

f(t) = ant” + dy 1x7 4 pag m O(m4ma --- m4) (5.2592) 
is equivalent to the system 

f(x) 8 0(mi), f(x) & 0(ma), ..., f(x) = O(m). (5.259b) 
If kj is the number of solutions of f(r) 2 0(m;) for j — 1,2,...,t, then kika -- - k; is the number of 
solutions of f(x) 2 O(mqm-- - m4). This means that the solution of the congruence 

f(e) = 0 (p p3 - -- pt"), (5.259c) 
where pi, pa, . .., pr are primes, can be reduced to the solution of congruences f(x) = 0(p*). Moreover, 


these congruences can be reduced to congruences f(x) = 0(p) modulo prime numbers in the following 
way: 

a) A solution of f(x) = 0(p®) is a solution of f(x) = 0(p), too. 

b) A solution z = zi(p) of f(x) 2 0(p) defines a unique solution modulo p? iff f'(z1) is not divisible 
by p: 

Suppose f (x1) 2 O0(p). Let x — x, 4- pt? and determine the unique solution 14 of the linear congruence 


i 
{αν + f'(£ı)tı = 0(p). (5.260a) 
p 
Substitute f, = ¢, + pty into x = 2, 4- pl, then z — x3 + pte is obtained. Now, the solution t} of the 
linear congruence 
f (v2) 


p? 





+ f'(z2)t2 = 0(p) (5.260b) 


5.4 Elementary Number Theory 381 





has to be determined modulo p?. By substitution of t2 = t, + pt3 into £ = r3 4 prt, the result 
x = x3 + pts is obtained. Continuing this process yields the solution of the congruence f(a) = 0 (p°). 
E Solve the congruence f(x) = x4 + 7x +4 = 0(27). f(x) = «4+ 72 +4 = 0(3) implies ¢ = 
1(3), ie, £ = 1 + 3tı. Because of f’(x) = 4x3 + 7 and 3/ f’(1) now the solution of the congruence 
f(1)/3 + f'(1) -tı = 4+ 11t, = 0 (3) is searched for: tı = 1 (3), i.e., tı = 1 + 3tz and x = 4 + Mt. 
Then consider f(4)/9 + f'(4) - t2 = 0 (3) and the solution tz = 2 (3) is obtained, i.e., t2 = 2 + 3t and 
x = 22 + 27tz. Therefore, 22 is the solution of xt + 7x + 4 = 0 (27), uniquely determined modulo 27. 





5.4.4 Theorems of Fermat, Euler, and Wilson 
1. Euler Function 


For every positive integer m with m > 0 one can determine the number of coprimes x with respect to 
m for 1 < x < m. The corresponding function y is called the Euler function. The value of the function 
(m) is the number of residue classes relatively prime to m (s. 5.4.3, 4., p. 378). 

For instance, (1) = 1, (2) = 1, (3) = 2, (4) = 2, 9(5) = 4, (6) = 2, (7) = 6, p(8) = 4, etc. 
In general, y(p) = p—1 holds for every prime number p and ¢(p*) = p* — p* for every prime number 
power p®. If m is an arbitrary positive integer, then y(m) can be determined in the following way: 


y(m) =m] I E - ;) ; (5.261a) 


pim 








where the product applies to all prime divisors p of m. 

Β (300) -- ρ(2}. 52. 5) -- 360-(1-- 4) - (1 — $) - (1 — 4) = 96. 

Furthermore 
X yld)=m (5.261b) 
d|m 

is valid. If ged(m, n) — 1 holds, then we get y(mn) = y(m)y(n). 

Β (300) -- 2(22- 33.5) 2 (25) - (32) - (5) 2 4-6-4 — 96. 

2. Fermat-Euler Theorem 

The Fermat-Euler theorem is one of the most important theorems of elementary number theory. If a 

and m are coprime positive numbers, then 


a) — 1(m). (5.262) 


W Determine the last three digits of 99 in decimal notation. This means, determine z with x = 
9” (1000) and 0 < x < 999. Now y(1000) = 400, and according to Fermats theorem 9*9? = 1 (1000). 
Furthermore 9° = (80 + 1)*-9 = ((¢)80°- 14 + ({)80 - 19) -9 = (1+ 4-80) -9 = —79- 9 = 89 (400). 
From that it follows that 9° = 999 — (10—1)® = (%)10°- (—1)89+ (9) 101 (—1)®8+ (9) 10? (-1)87 = 
—1+ 89-10 — 3916 - 100 = —1 — 110 + 400 = 289(1000). The decimal notation of 9% ends with the 
digits 289. 

Remark: The theorem above for m — p, i.e., e(p) — p— 1 was proved by Fermat; the general form was 
proved by Euler. This theorem forms the basis for encoding schemes (see 5.4.6). It contains a necessary 
criterion for the prime number property of a positive integer: If p is a prime, then a?~+ = 1(p) holds for 
every integer a with p Ja. 

3. Wilson's Theorem 


There is a further prime number criterion, called the Wilson theorem: 
Every prime number p satisfies (p — 1)! = —1(p). 
The inverse proposition is also true; and therefore: 
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'The number p is a prime number iff (p — 1)! 2 —1(p). 


5.4.5 Prime Number Tests 


In the followings two stochastic prime tests will be presented which are useful at large numbers to test 
the prime property with a sufficiently small probability of mistakes. With these tests it is possible to 
show that a number is not a prime, without knowing its prime factors. 

1. Fermat-Prime Number Test 

Let n be an odd natural number and a an integer such that gcd(a, n) = 1 and a^! & 1 (mod n).. Then 
n is called a pseudoprime to base a. 

W A: 341 is a pseudo prime to basis 2; 341 is not a pseudo prime to basis 3. 

Test: Let an odd natural number n > 1 be given. Choose a € Z,, \ {0}. 


e If the gcd (a,n) > 1, then n is not prime. 


a"-z] (mod m) did pass . 
e If the gcd (a, n) — 1 and { 4241 (modn) {? then n { did not ae the test to base a. If n did 


not pass the test, then n is not a prime. If n did pass the test, then it may be a prime, but more tests 
are needed with other base, i.e. tests with further values of a. 

E B: n = 15: The test with a = 4 gives 444 = 1(mod15). The test with a = 7 gives 74 = 44 
1 (mod 15). Hence 15 is not a prime. 

MC: n = 561: The test with arbitrary a € Zs \ {0} with ged(a,561) = 1 results in a = 
1 (mod 561). But 561 = 3-11-17 is not a prime. 

Remark: A composite number n for which a" & 1(modn) for all a € Zn \ {0} with gcd(a, n) = 1 is 
called a Carmichael number. 

If n is not a prime and not a Carmichael number, then one can show that the level of error of the first 
kind to get a false result using k numbers with gcd(a, n) =1 is at most 1/2" . At least for the half of the 
numbers in Z; V (0) with gcd(a, n) —1 the relation αἲ zt 1 (mod n) holds. 


2. Rabin-Miller Prim Number Test 
The Rabin-Miller primality test is based on the following statement (*): 
Let n > 2 be a prime, n — 1 — 2*u (u is odd), g.c.d(a, n) 21. Then: 
a* — 1(modn) or a?" 2 —1 (mod n) for some j € {0,1,...,t— 1}. (*) 
Every odd natural number n > 1 can be tested about prime property in the following way: 
Test: Choose a € Z, \ {0} and find the representation n — 1 — 2*u (u is odd). 
e If g.c.d(a, n) 7 1, then n is not a prime. 
e If g.c.d(a, n) —1, then the sequence a* (mod n), a?" (modn), ..., a? "(mod n) is calculated until a 
value is found which satisfies (x). These elements are calculated by repeated squaring mod n. If there 
is no such value, then n is not a prime. Otherwise n did pass the test to basis a. 
W A: n — 561, and should be tested by different values of a: 
235 2263 +1 (mod 561), 
270 21662 —1 (mod 561) 
122. —9. ; 
n-1—-2.33 a—7 guo .— Gr 1 (mod 561), 
2280 = 421 # —1 (mod 561). 
If choosing k different values randomly and independently and n passes the test to basis a for each, 
then the error rate of the first kind that n is not a prime is < 1/4*. In the practice k = 25 is chosen. 
Bl B: There is only one number X 2,5- 10° such that it passes the test to basis a = 2,3,5, 7 and it is 
not a prime. 





561 is not a prime. 
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3. AKS Prime Number Test 

The AKS primality test is based on a polynomial algorithm to determine whether a number is prime or 
composite. Published by Agrawal, Kayal, and Saxena, in 2002, meanwhile it is evident that the prime 
property can be tested efficiently for any natural number. 

The test is based on the following statements: 

Ifm > lisa natural number and r is a prime satisfying the assumtions 

e nis not divisible by primes < r, 

ο r' £1 (mod n) fori = 1,2,..., | (loggn)?|*, 

e (v +a)" = 2" +a (mod 2” — 1,n) for every 1 <a < Vrlogn, 

Then n is a power of a prime. 

Let n > 1 be an odd natural number whose prime characteristic is to be tested, and m := | (log, n)°|. 
If n « 5690034, then it is tested by comparing it to a list of known prime numbers whether n is a prime. 
For n > 5690034 holds n > m: 

Test: 

e Check, whether n can be divided by a natural number from the interval [3, m]. If yes, then n is not 
a prime. 

e Otherwise take a prime r < m, such that rê # 1( modn) for i — 1,2,...,|(loga n)?]. (It can be 
proven, that such a prime r exists.) 

e Check, whether the congruence (x + a)” — z^ -- a( mod z^ — 1,n) for a — 1,2, Vr|(log; n)| holds. 
If not, then n is not a prime. If yes, then n is a power of a prime. In this case it is to be tested, whether 
natural numbers g and k > 1 exist, for which n — q*. If not, then n is a prime. 

Different to the known and efficient stochastic algorithms, the result of the test can be trusted without 
even a negligible small error probability of mistakes. However in cryptography the Rabin-Miller test is 
preferred. 


5.4.6 Codes 
5.4.6.1 Control Digits 


In the information theory methods are provided to recognize and to correct errors in data combinations. 
Some of the simplest methods are represented in the form of the following control digits. 
1. International Standard Book Number ISBN-10 
A simple application of the congruence of numbers is the use of control digits with the International 
Standard Book Number ISBN. A combination of 10 digits of the form 

ISBN a — bcd — ef ghi — p. (5.263a) 
is assigned to a book. The digits have the following meaning: a is the group number (for example, a = 3 
tells us that the book originates from Austria, Germany, or Switzerland), bcd is the publisher’s number, 
and efghi is the title number of the book by this publisher. A control digit p will be added to detect 
erroneous book orders and thus help reduce expenses. The control digit p is the smallest non-negative 
digit that fulfils the following congruence: 

10a + 9b + 8c + Τά + 6e + 5f + 4g + 3h + 2i + p = 0(11). (5.263b) 
If the control digit p is 10, a unary symbol such as X is used (see also 5.4.6, 3., p. 384). A presented 
ISBN can now be checked for a match of the control digit contained in the ISBN and the control digit 
determined from all the other digits. In case of no match an error is certain. The ISBN control digit 
method permits the detection of the following errors: 
1. Single digit error and 
2. interchange of two digits. 
Statistical investigations showed that by this method more than 90% of all actual errors can be detected. 
All other observed error types have a relative frequency of less than 1%. In the majority of the cases 





*|a| is symbol for “greatest integer < x”. 
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the described method will detect the interchange of two digits or the interchange of two complete digit 
blocks. 
2. Central Codes for Drugs and Medicines 


In pharmacy, a similar numerical system with control digits is employed for identifying medicaments. 
In Germany, each medicament is assigned a seven digit control code: 


abcde f p. (5.264a) 
The last digit is the control digit p. It is the smallest, non-negative number that fulfils the congruence 
2a + 3b + 4c + 5d + 6e + 7f = p(11). (5.264b) 


Here too, the single digit error or the interchange of two digits can always be detected. 
3. Account Numbers 


Banks and saving banks use a uniform account number system with a maximum of 10 digits (depending 
on the business volume). The first (at most four) digits serve the classification of the account. The 
remaining six digits represent the actual account number including a control digit in the last position. 
The individual banks and saving banks tend to apply different control digit methods, for example: 


a) The digits are multiplied alternately by 2 and by 1, beginning with the rightmost digit. A control 
digit p will then be added to the sum of these products such that the new total is the next number 
divisible by 10. Given the account number abcd ef ghi p with control digit p, then the congruence 


2i +h +2g + f -- 2e 4- d - 2c 4- b -- 2a 4- p z 0 (mod 10). (5.265) 
holds. 


b) As in method a), however, any two-digit product is first replaced by the sum of its two digits and 
then the total sum will be calculated. 

In case a) all errors caused by the interchange of adjacent digits and almost all single-digit errors will 
be detected. 

In case b), however, all errors caused by the change of one digit and almost all errors caused by the 
interchange of two adjacent digits will be discovered. Errors due to the interchange of non-adjacent 
digits and the change of two digits will often not be detected. 


The reason for not using the more powerful control digit method modulo 11 is of a non-mathematical 
nature. The non-numerical sign X (instead of the control digit 10 (see 5.4.6, 1., p. 383)) would require 
an extension of the numerical keyboard. However, renouncing those account numbers whose control 
digit has the value of 10 would have barred the smooth extension of the original account number in a 
considerable number of cases. 


4. European Article Number EAN 


EAN stands for European Article Number. It can be found on most articles as a bar code or as a string 
of 13 or 8 digits. The bar code can be read by means of a scanner at the counter. 

In the case of 13-digit strings the first two digits identify the country of origin, e.g., 40, 41,42 and 
43 stand for Germany. The next five digits identify the producer, the following five digits identify a 
particular product. The last digit is the control digit p. 

This control digit will be obtained by first multiplying all 12 digits of the string alternately by 1 and 
3 starting with the left-most digit, by then totalling all values, and by finally adding a p such that the 
next number divisible by 10 is obtained. Given the article number abcde f ghikmn p with control digit 
p, then the congruence 


at+3b+c+3d+e+3f+g43h+i+3k+m+3n+p=0 (mod 10). (5.266) 
holds. 


This control digit method always permits the detection of single digit errors in the EAN and often the 
detection of the interchange of two adjacent digits. The interchange of two non-adjacent digits and the 
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change of two digits will often not be detected. 
5.4.6.2 Error correcting codes 


1. Model of Data Transmission and Error Correction 

At transmission of messages through noisy channels the correction of errors is often possible. The mes- 
sage is coded first, then after transmission the usually biased codes are corrected into the right ones, 
so after decoding them the original message can be recovered. That case is considered now, when the 
length of the words of the message is k, and the length of the coded words is n, and both of them consist 
of only zeros and ones. Then k is the number of information positions and n — k is the number of redun- 
dant positions. Every word of the message is an element of GF(2)* (see 5.3.7.4 p. 363) and every word 
of the code is an element of GF(2)". To simplify the notation the words of the message are written in 


the form a1, d2,...,a,%, and the words of the code in the form c,c5,...,c,. The words of the message 
are not transmitted, only the words of the code are. 
An often used idea of error correction is to convert the transmitted word di, d», .. . , d,, first into a valid 


codeword c, c9, ..., c, which differs from it in the least number of digits (decoding MLD). It depends 

on the properties of coding and the transmission channels that how many errors can be detected and 

corrected in this way. 

W At digit repeating codes the message word 0 is represented by the codeword 0000. If after trans- 

mission the receiver gets the word 0010, then he assumes that the original codeword was 0000, and 

it is decoded as message word 0. But if the received word is 1010, then similar assumption can not 

be applied, since the message word 1 is coded as 1111, so the difference is similar. At least it can be 

recognized that there is some error in the received word. 

2. t-Error Correcting Codes 

The set of all codewords is called code C. The distance of two codewords is the number of digits (po- 

sitions) in which the two words differ from each other. The minimal distance dmin(C) of codes is the 

smallest distance which occurs between the codewords of C. 

H For C; = {0000, 1111}, dmin(C)1 = 4. For Cp = {000, 011, 101,110}, dmin(C2) = 2, since there are 

codewords which have distance 2. For C3 = {00000, 01101, 10111, 11010}, dmin(C3) = 3, there are code- 

words in C3 whose distance is 3. 

If the minimal distance dmin(C) of a code C is known, then it is easy to recognize how many transmis- 

sion errors can be corrected. Codes, correcting t errors, are called t-error correcting. A code C is t-error 

correcting if dmin(C) > 2¢+ 1. 

W (Continuation) C; is 1-error correcting, C2 is 0-error correcting (it means, that no error can be cor- 

rected), C3 is l-error correcting. i 

For every t-error correcting code C € GM(2)" holds »; (9 :|C| € 2^ . If equality holds, then C is called 
i=0 

t-perfect. 

Bl The digit repeating code C = {00...0,11...1} C GF(2)4*7 is t-perfect. 

3. Linear Codes 

A non-empty subset C C GF(2)" is called (binary) linear code, if C is a sub-vector space of GF(2)"^. If 

a linear code C C GF(2)" has dimension &, then it is called an (n, k) linear code. 

W (Continuation) C; is a (4,1) linear code, C» is a (3,2) linear code, C3 is a (5,2) linear code. In the case 

of linear codes the minimal distance (and as a consequence the number of correctible errors) is easy 

to determine: The minimal distance of such a code is the smallest distance of a non-zero vector from 

the zero vector of the vector space. The minimal distance can be found if the minimal number of ones, 

except with all zeros, in the codewords is given. 

For every (n, k) linear code there is a generating matriz G for which C — faG | a € GF(2)*): 


fir -++ in σι 
GE =|: |]. (5.267) 


Iki --- Ikn) kxn Gk 
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The code is uniquely defined by the generating matrix; the codeword of the message word a1a2 .. . a; is 
determined in the following way: 


G1A2... Ap > A191 + A2go +... + ange - (5.268) 
a 
aG 


In the case of an (n, k) linear code C a check matrix is needed for decoding: 
hı aes hain 
H= : : : ; (5.269) 
hy ka sss nk (ak). 


The (binary) linear code C is 1-error correcting, if the columns of H are pairwise different and non-zero 
vectors. If the result of the transmission is the word d = d,d)...d,, then Hd” is calculated. If the 
result is the zero vector, then d is a codeword. Otherwise if Hd? is the i-th column of the check matrix 
H, then the corresponding codeword is d + e;, where e; — (0,0,...,0,1,0,...,0) and the 1 is on the 
i-th position. 

4. Cyclic Codes 

Cyclic codes are the most investigated linear codes. They provide efficient coding and decoding. 

A (binary) (n, k) linear code is called cyclic if for every codeword cic» .. . c, the codeword obtained by 
a cyclic right shift of the components is also a codeword, i.e. coc1 . . - Cn-1 E C = Cn—1€061 . - - Cn—2 E C 
W C — (000, 110, 101, 011] is a cyclic (3,2) linear code. 

Το have an efficient work with cyclic codes, the codewords are represented by polynomials of degree 
X n — 1 with coefficients from GF(2): C — (000, 110, 101, 011) is a cyclic (3, 2)-linear code. 

A (binary) (n, k) linear code C is cyclic if and only if for every c() 

c(r) € C 2 c(x): x(mod z^ -1) e C (5.270) 
A cyclic (n, k) linear code can be described by a generating polynomial and a control polynomial as 
follows: The generating polynomial g(x) of degree n — k (k € (1,2,...,n — 1]) is a divisor of z" — 1. 
The polynomial (x) of degree k for which g(x)h(x) — x" — 1is called the control polynomial . Coding 
of ayag... a, in polynomial representation a(x) is given by 

a(x) + a(x) + g(a). (5.271) 
Polynomial d(x) is an element of the code, if the generator polynomial g(x) is a divisor of d(x), or the 
control polynomial h(x) satisfies the relation d(x)h(x) = 0 mod z” — 1. 

An important class of cyclic codes are the BCH-codes. Here a lower bound ô of the minimal distance 
and with it a lower bound for the number of errors can be required for which code should be corrected. 
Here ó is called the design distance of the code. 

A (binary) (n, k) linear code C is a BCH-code with design distance 6 if for the generating polynomial 
g(x): 

g(x) — lem(ma (x), masa (x), ..., mgses-2(m)) , (5.272) 
where a is a primitive n-th unit root and b is an integer. The polynomials ma; (x) are minimal polyno- 
mials of o?. 

For a BCH-code C with design distance 6 the relation dmin(C) > 6 must hold. 


5.5 Cryptology 
5.5.1 Problem of Cryptology 


Cryptology is the science of hiding information by the transformation of data. 

The idea of protecting data from unauthorized access is rather old. During the 1970s together with the 
introduction of cryptosystems on the basis of public keys, cryptology became an independent branch of 
science. Today, the subject of cryptological research is how to protect data from unauthorized access 
and against tampering. 
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Beside the classical military applications, the needs of the information society gain more and more in 
importance. Examples are the guarantee of secure message transfer via email, electronic funds transfer 
(home-banking), the PIN of EC-cards, etc. 

Today, the fields of cryptography and cryptanalysis are subsumed under the notion of cryptology. Cryp- 
tography is concerned with the development of cryptosystems whose cryptographic strengths can be 
assessed by applying the methods of cryptanalysis for breaking cryptosystems. 


5.5.2 Cryptosystems 


An abstract cryptosystem consists of the following sets: a set M of messages, a set C of ciphertexts, 
sets IX and K’ of keys, and sets E and ID of functions. A message m € M will be encrypted into a 
ciphertext c € C by applying a function E € E together with a key k € K, and will be transmitted via 
a communication channel. The recipient can reproduce the original message m from c if he knows an 
appropriate function D € D and the corresponding key k’ € K’. There are two types of cryptosystems: 


1. Symmetric Cryptosystems: The conventional symmetric cryptosystem uses the same key k for 
encryption of the message and for decryption of the ciphertext. The user has complete freedom in 
setting up his conventional cryptosystem. Encryption and decryption should, however, not become 
too complex. In any case, a trustworthy transmission between the two communication partners is 
mandatory. 


2. Asymmetric Cryptosystems: The asymmetric cryptosystem (see 5.5.7.1, p. 391) uses two keys, 
one private key (to be kept secret) and a public key. The public key can be transmitted along the same 
path as the ciphertext. The security of the communication is warranted by the use of so-called one-way 
functions (see 5.5.7.2, p. 391), which makes it practically impossible for the unauthorized listener to 
deduce the plaintext from the ciphertext. 


5.5.3 Mathematical Foundation 


An alphabet A = {a0,a1,...,an-1} is a finite non-empty totally ordered set, whose elements a; are 
called letters. |A| is the length of the alphabet. A sequence of letters w = a1a5...a;, of length n € IN 
and a; € A is called a word of length n over the alphabet A. A” denotes the set of all words of length 
n over A. Let n, m € IN, let A, B be alphabets, and let S be a finite set. 
A cryptofunction is a mapping t: A” x S — B" such that the mappings ts: A” > B™: w > t(w,s) 
are injective for all s € S. The functions t, and t7+ are called the encryption and decryption function, 
respectively. w is called plaintext, t,(w) is the ciphertext. 
Given a cryptofunction t, then the one-parameter family {ts }ses is a cryptosystem T's. The term cryp- 
tosystem will be applied if in addition to the mapping t, the structure and the size of the set of keys is 
significant. The set S of all the keys belonging to a cryptosystem is called the key space. Then 

Ts = (ts: A” => A”|s € S} (5.273) 
is called a cryptosystem on A". 
If Ts is a cryptosystem over A" and n = 1, then ts is called a stream cipher; otherwise ts is called a 
block cipher. 
Cryptofunctions of a cryptosystem over A” are suited for the encryption of plaintext of any length. 
The plaintext will be split into blocks of length n prior to applying the function to each individual 
block. The last block may need padding with filler characters to obtain a block of length n. The filler 
characters must not distort the plaintext. 
There is a distinction between context-free encryption, where the ciphertext block is only a function of 
the corresponding plaintext block and the key, and context sensitive encryption, where the ciphertext 
block depends on other blocks of the message. Ideally, each ciphertext digit of a block depends on all 
digits of the corresponding plaintext block and all digits of the key. Small changes to the plaintext or 
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to the key cause extended changes to the ciphertext (avalanche effect). 


5.5.4 Security of Cryptosystems 


Cryptanalysis is concerned with the development of methods for deducing from the ciphertext as much 
information about the plaintext as possible without knowing the key. According to A. Kerkhoff the 
security of a cryptosystem rests solely in the difficulty of detecting the key or, more precisely, the de- 
cryption function. The security must not be based on the assumption that the encryption algorithm is 
kept secret. There are different approaches to assess the security of a cryptosystem: 

1. Absolutely Secure Cryptosystems: There is only one absolutely secure cryptosystem based on 
substitution ciphers, which is the one-time pad. This was proved by Shannon as part of his information 
theory. 

2. Analytically Secure Cryptosystems: No method exists to break a cryptosystem systematically. 
The proof of the non-existence of such a method follows from the proof of the non-computability of a 
decryption function. 

3. Secure Cryptosystems according to Criteria of Complexity Theory: There is no algorithm 
which can break a cryptosystem in polynomial time (with regard to the length of the text). 

4. Practically Secure Cryptosystems: No method is known which can break the cryptosystem 
with available resources and with justified costs. 


Cryptanalysis often applies statistical methods such as determining the frequency of letters and words. 
Other methods are an exhaustive search, the trial-and-error method and a structural analysis of the 
cryptosystem (solving of equation systems). 

In order to attack a cryptosystem one can benefit from frequent flaws in encryption such as using stereo- 
type phrases, repeated transmissions of slightly modified text, an improper and predictable selection 
of keys, and the use of filler characters. 


5.5.4.1 Methods of Conventional Cryptography 

In addition to the application of a cryptofunction it is possible to encrypt a plaintext by means of cryp- 
tological codes. A code is a bijective mapping of some subset A’ of the set of all words over an alphabet 
A onto the subset B’ of the set of all words over the alphabet B. The set of all source-target pairs of 
such a mapping is called a code book. 


today evening 0815 
tomorrow evening 1113 


The advantage of replacing long plaintexts by short ciphertexts is contrasted with the disadvantage 
that the same plaintext will always be replaced by the same ciphertext. Another disadvantage of code 
books is the need for a complete and costly replacement of all books should the code be compromised 
even partially. 

In the following only encryption by means of cryptofunctions will be considered. Cryptofunctions have 
the additional advantage that they do not require any arrangement about the contents of the messages 
prior to their exchange. 

Transposition and substitution constitute conventional cryptoalgorithms. In cryptography, a transpo- 
sition is a special permutation defined over geometric patterns. The substitutions will now be discussed 
in detail. There is a distinction between monoalphabetic and polyalphabetic substitutions according to 
how many alphabets are used for presenting the ciphertext. Generally, a substitution is termed polyal- 
phabetic even if only one alphabet is used, but the encryption of the individual plaintext letter depends 
on its position within the plaintext. 

A further, useful classification is the distinction between monographic and polygraphic substitutions. 
In the first case, single letters will be substituted, in the latter case, strings of letters of a fixed length 
>1. 
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5.5.4.2 Linear Substitution Ciphers 

Let A = {ao,a1,-..,@n—1} be an alphabet and k,s € {0,1,...,n — 1} with ged(k,n) = 1. The 
permutation t*, which maps each letter a; to t*(a;) = axis, is called a linear substitution cipher. There 
exist n y(n) linear substitution ciphers on A. 

Shift ciphers are linear substituting ciphers with k = 1. The shift cipher with s = 3 was already used 
by Julius Caesar (100 to 44 BC) and, therefore, it is called the Caesar cipher. 


5.5.4.3 Vigenère Cipher 


An encryption called the Vigenère cipher is based on the periodic application of a key word whose 
letters are pairwise distinct. The encryption of a plaintext letter is determined by the key letter that 
has the same position in the key as the plaintext letter in the plaintext. This requires a key that is as 
long as the plaintext. Shorter keys are repeated to match the length of the plaintext. 


. στ” : m A B CD 
A version of the Vigenère cipher attributed to L. Carroll utilizes 


the so-called Vigenère tableau (see picture) for encryption and 
decryption. Each row represents the cipher for the key letter 
to its very left. The alphabet for the plaintext runs across the 
top. The encryption step is as follows: Given a key letter D and 
a plaintext letter C, then the ciphertext letter is found at the 
intersection of the row labeled D and the column labeled C; the 
ciphertext is F. Decryption is the inverse of this process. 


πα ΟΩ ΟΣ 
ΠΩ ΑΣ 
: Qt oou 
mao"moo 
—-HmHQO"mnmu 
-anmo m 
AGO mu 


a Let the key be “ HUT”. 


Plaintext: O NC EU PO N AT I M E 
Key: HU THU THU THU TH 
Ciphertex: V H V LO I V HT AC F L 


Formally, the Vigenère cipher can be written in the following way: let a; be the plaintext letter and a; be 
the corresponding key letter, then k = i + j determines the ciphertext letter ax. In the above example, 
the first plaintext letter is O = a14. The 15-th position of the key is taken by the letter H = a7. Hence, 
k =i + j = 14 +7 = 21 yields the ciphertext letter ag; = V. 
5.5.4.4 Matrix Substitution 
Let A = {ao,@1,-..,@n—1} be an alphabet and S = (s;;),5;; € {0,1,...,m— 1}, be a non-singular 
matrix of type (m, m) with gcd(detS,n)— 1. The mapping which maps the block of plaintext a41), 
a4(2),-- . , 4¢(m) to the ciphertext determined by the vector (all arithmetic modulo n, vectors transposed 
as required) 
Gt(1) 
Gt(2 
( ) (5.274) 
Gt(m) 
is called the Hill cipher. This represents a monoalphabetic matrix substitution. 
| 148 2) Let the letters of the alphabet be enumerated ag = A, a; =B,...,a25 = 
ΕΙ S= à 


852 Z. For m — 3 and the plaintext AUTUMN, the strings AUT and UMN 
321 correspond to the vectors (0, 20, 19) and (20, 12, 13). 


Then S (0,20,19)' — (217,138,59)! & (9,8, 7)' (mod26) and S - (20,12, 13)" = (415, 246,97)" = 
(25, 12, 19)" (mod26). Thus, the plaintext AUTUMN is mapped to the ciphertext JIHZMT. 


5.5.5 Methods of Classical Cryptanalysis 


The purpose of cryptanalytical investigations is to deduce from the ciphertext an optimum of infor- 
mation about the corresponding plaintext without knowing the key. These analyses are of interest not 
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only to an unauthorized “eavesdropper” but also help assess the security of cryptosystems from the 
user's point of view. 
5.5.5.1 Statistical Analysis 


Each natural language shows a typical frequency distribution of the individual letters, two-letter com- 
binations, words, etc. For example, in English the letter e is used most frequently: 





Letter | Relative frequency 
E, 12.7% 

T, A,O,1,N,S,H,R 56.9 % 

D,L 8.3 96 

C, U, M, W, F, G, Y, P,B | 19.9% 

V, K, J, X, Q,Z 2.2 % 


Given sufficiently long ciphertexts it is possible to break a monoalphabetic, monographic substitution 
on the basis of the frequency distribution of letters. 


5.5.5.2 Kasiski-Friedman Test 


Combining the methods of Kasiski and Friedman it is possible to break the Vignère cipher. The attack 
benefits from the fact that the encryption algorithm applies the key periodically. If the same string 
of plaintext letters is encrypted with the same portion of the key then the same string of ciphertext 
letters will be produced. A length > 2 of the distance of such identical strings in the ciphertext must 
be a multiple of the key length. In the case of several reoccurring strings of ciphertext the key length 
is a divisor of the greatest common divisor of all distances. This reasoning is called the Kasiski test. 
One should, however, be aware of erroneous conclusions due to the possibility that matches may occur 
accidentally. 

The Kasiski test permits the determination of the key length at most as a multiple of the true key length. 
The Friedman test yields the magnitude of the key length. Let n be the length of the ciphertext of some 
English plaintext encrypted by means of the Vignére method. Then the key length / is determined by 


e 0.027n 
~ (n—1)IC — 0.038n + 0.065 


Here IC denotes the coincidence index of the ciphertext. This index can be deduced from the number 
n; of occurrences of the letter a; (i € {0,1,...,25}) in the ciphertext: 


(5.2752) 


26 
Σ nj(nj — 1) 
I¢ ==, 5.275b 
n(n — 1) ( ) 
In order to determine the key, the ciphertext of length n is split into | columns. Since the Vignére cipher 
produces the contents of each column by means of a shift cipher, it suffices to determine the equivalence 
of Eon a column base. Should V be the most frequent letter within a column, then the Vignere tableau 
points to the letter R 
E 
: (5.275c) 
RV 
of the key. The methods described so far will not be successful if the Vignére cipher employs very long 
keys (e.g., as long as the plaintext). It is, however, possible to deduce whether the applied cipher is 
monoalphabetic, polyalphabetic with short period or polyalphabetic with long period. 


5.5.6 One-Time Pad 


The one-time pad is the only substitution cipher that is considered theoretically secure. The encryption 
adheres to the principle of the Vignere cipher, where the key is a random string of letters as long as the 


5.5 Cryptology 391 





plaintext. 

Usually, one-time pads are applied as binary Vignére ciphers: Plaintext and ciphertext are represented 
as binary numbers with addition modulo 2. In this particular case the cipher is involutory, which means 
that the twofold application of the cipher restores the original plaintext. A concrete implementation 
of the binary Vignére cipher is based on shift register circuits. These circuits combine switches and 
storage elements, whose states are 0 or 1, according to special rules. 


5.5.7 Public Key Methods 


Although the methods of conventional encryption can have efficient implementations with today’s com- 
puters, and although only a single key is needed for bidirectional communication, there are a number 
of drawbacks: 

1. The security of encryption solely depends on keeping the next key secret. 

2. Prior to any communication, the key must be exchanged via a sufficiently secured channel; sponta- 
neous communication is ruled out. 

3. Furthermore, no means exist to prove to a third party that a specific message was sent by an iden- 
tified sender. 


5.5.7.1 Diffie-Hellman Key Exchange 

The concept of encryption with public keys was developed by Diffie and Hellman in 1976. Each partic- 
ipant owns two keys: a public key that is published in a generally accessible register, and a private key 
that is solely known to the participant and kept absolutely secret. Methods with these properties are 
called asymmetric ciphers (see 5.5.2, p. 387). 

The public key KP; of the i-th participant controls the encryption step E;, his private key AS; the 
decryption step D;. The following conditions must be fulfilled: 

1. Dj o E; constitutes the identity. 

2. Efficient implementations for E; and D; are known. 

3. The private key KS; cannot be deduced from the public key KP; with the means available in the 
foreseeable future. If in addition 

4. also E; o D; yields the identity, 

then the encryption algorithm qualifies as an electronic signature method with public keys. The elec- 
tronic signature method permits the sender to attach a tamperproof signature to a message. 


If A wants to send an encrypted message m to B, then A retrieves B's public key K Pp from the register, 
applies the encryption algorithm Eg, and calculates Eg(m) = c. A sends the ciphertext c via the public 
network to B who will regain the plaintext of the message by decrypting c using his private key K Spg in 
the decryption function Dg: Dg(c) = Dg(Eg(m)) = m. In order to prevent tampering of messages, A 
can electronically sign his message m to B by complying with an electronic signature method with the 
public key in the following way: A encrypts the message m with his private key: D4(m) = d. A attaches 
to d his signature “A” and encrypts the total using the public key of B: Eg(Da(m), “A”) = Ep(d, 
“A”) =e. The text thus signed and encrypted is sent from A to B. 

The participant B decrypts the message with his private key and obtains Dp(e) — Dp(Ep(d, *A")) 
— (d, ^A"). Based on this text B can identify A as the sender and can now decrypt d using the public 
key of A : E4(d) — E4(Da(m)) =m. 


5.5.7.2 One-Way Function 


The encryption algorithms of a method with public key must constitute a one-way function with a “trap 
door”. A trap door in this context is some special, additional information that must be kept secret. An 
injective function f: X —> Y is called a one-way function with a trap door, if the following conditions 
hold: 


1. There is an efficient method to compute both f and f^. 
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2. The calculation of f~! cannot be deduced from f without the knowledge of the secret additional 
information. 


The efficient method to get f~t from f cannot be made without the secret additional information. 
5.5.7.3 RSA Codes and RSA Method 
1. RSA Codes 


Rivest, Shamir and Adleman (see [5.16]) developed an encryption scheme for secret messages on the 
basis of the Euler-Fermat theorem (see 5.4.4, 2., p. 381). The scheme is called the RSA algorithm after 
the initials of their last names. Part of the key required for decryption can be made public without 
endangering the confidentiality of the message; for this reason, the term public key code is used in this 
context as well. 

In order to apply the RSA algorithm the recipient B chooses two very large prime numbers p and q, 
calculates m — pq and selects a number r relatively prime to y(m) = (p— 1)(q— 1)and 1 < r < e(m). 
B publishes the numbers m and r because they are needed for decryption. 

For transmitting a secret message from sender A to recipient B the text of the message must be con- 
verted first to a string of digits that will be split into N blocks of the same length of less than 100 decimal 
positions. Now A calculates the remainder R of N” divided by m. 


N* z R(m). (5.27762) 
Sender A calculates the number FR for each of the blocks N that were derived from the original text 


and sends the number to B. The recipient can decipher the message R if he has a solution of the linear 
congruence rs = 1(y(m)). The number N is the remainder of R* divided by m: 


R? = (N°) = Nm) = IN. (NPM) = N(m). (5.276b) 


Here, the Euler-Fermat theorem (see 5.4.4, 2., p. 381) with N?™ = 1(m) has been applied. Eventu- 
ally, B converts the sequence of numbers into text. 
W A recipient B who expects a secret message from sender A chooses the prime numbers p = 29 and 
q = 37 (actually too small for practical purposes), calculates m = 29 - 37 = 1073 (and y(1073) = 
(29) - (37) = 1008)), and chooses r = 5 (it satisfies the requirement of gcd(1008, 5) = 1). B passes 
the values m — 1073 and r = 5 to A. 
A intends to send the secret message NV = 8 to B. A encrypts N into R = 578 by calculating N" — 8° = 
578 (1073), and just sends the value R = 578 to B. B solves the congruence 5 - s = 1 (1008), arrives at 
the solution s = 605, and thus determines R = 578° = 8 = N (1073). 
Remark: The security of the RSA code correlates with the time needed by an unauthorized listener 
to factorize m. Assuming the speed of today’s computers, a user of the RSA algorithm should choose 
the two prime numbers p and q with at least a length of 100 decimal positions in order to impose a 
decryption effort of approximately 74 years on the unauthorized listener. The effort for the authorized 
user, however, to determine an r relatively prime to y(pq) = (p — 1)(q — 1) is comparatively small. 
2. RSA Method 
The RSA method is the most popular asymmetric encryption method. 
1. Assumptions Let p and qbe two large prime numbers with pq ~ 1024 and n = pq. The number 
of decimal positions of p and q should differ by a small number; yet, the difference between p and q 
should not be too large. Furthermore, the numbers p — 1 and q — 1 should contain rather big prime 
factors, while the greatest common divisor of p — 1 and q — 1 should be rather small. Let e > 1 be 
relatively prime to (p — 1)(q — 1) and let d satisfy d: e 2 1 (mod(p— 1)(q— 1)). Now n and e represent 
the public key and d the private key. 
2. Encryption Algorithm 

E: {0,1,...,n-1} > {0,1,...,n-1} E(x) τς a* mod n. (5.277a) 
3. Decyphering Operations 

D: (0,1,...,n 11] — (0,1,...,n — 1) D(z) :- x^ modn. (5.277b) 
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Thus D(E(m)) = E(D(m) = m for message m. 

The function in this encryption method with n > 10?°° constitutes a candidate for a one-way function 
with trap door (see 5.5.7.2, p. 391). The required additional information is the knowledge of how to 
factor n. Without this knowledge it is infeasible to solve the congruence d-e = 1 (modulo (p—1)(q—1)). 
The RSA method is considered practically secure as long as the above conditions are met. A disadvan- 
tage in comparison with other methods is the relatively large key size and the fact that RSA is 1000 
times slower than DES. 


5.5.8 DES Algorithm (Data Encryption Standard) 


The DES method was adopted in 1976 by the National Bureau of Standards (now NIST) as the official 
US encryption standard. The algorithm belongs to the class of symmetric encryption methods (see 
5.5.2, p. 387) and still plays a predominant role among cryptographic methods. The method is, however, 
no longer suited for the encryption of top secret information because today’s technical means permit 
an attack by an exhaustive test trying all keys. 

The DES algorithm combines permutations and non-linear substitutions. The algorithm requires a 
56-bit key. Actually, a 64-bit key is used, however, only 56 bits can freely be chosen; the remaining 
eight bits serve as parity bits, one for each of the seven-bit blocks to yield odd parity. 

The plaintext is split into blocks of 64 bits each. DES transforms each 64-bit plaintext block into a 
ciphertext block of 64 bits. First, the plaintext block will be subject to an initial permutation and is 
then encrypted in 16 rounds, each operating with a different subkey K1, K2,..., Kig. The encryption 
completes with a final permutation that is the inverse of the initial permutation. 

Decryption uses the same algorithm with the difference that the subkeys are employed in reverse order 
Kiş, Kıis,..., Kı. 

The strength of the cipher rests on the nature of the mappings that are part of each round. It can be 
shown that each bit of the ciphertext block depends on each bit of the corresponding plaintext and on 
each bit of the key. 

Although the DES algorithm has been disclosed in full detail, no attack has been published so far that 
can break the algorithm without an exhaustive test of all 256 keys. 


5.5.9 IDEA Algorithm 
(International Data Encryption Algorithm) 


The IDEA algorithm was developed by LAI and MASSAY and patented 1991. It is a symmetric en- 
cryption method similar to the DES algorithm and constitutes a potential successor to DES. IDEA 
became known as part of the reputed software package PGP (Pretty Good Privacy) for the encryption 
of emails. In contrast to DES not only was the algorithm published but even its basic design criteria. 
The objective was the use of particularly simple operations (addition modulo 2, addition modulo 216, 
multiplication modulo 219*1). 

IDEA works with keys of 128 bits length. IDEA encrypts plaintext blocks of 64 bits each. The algo- 
rithm splits a block into four subblocks of 16 bits each. From the 128-bit key 52 subkeys are derived, 
each 16 bits long. Each of the eight encryption rounds employs six subkeys; the remaining four subkeys 
are used in the final transformation which constructs the resulting 64-bit ciphertext. Decryption uses 
the same algorithm with the subkeys in reverse order. 





IDEA is twice as fast as DES, its implementation in hardware, however, is more difficult. No successful 
attack against IDEA is known. Exhaustive attacks trying all 2°° keys are infeasible considering the 
length of the keys. 
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5.6 Universal Algebra 


A universal algebra consists of a set, the underlying set, and operations on this set. Simple examples are 
semigroups, groups, rings, and fields discussed in sections 5.3.2, p. 336; 5.3.3, p. 336 and 5.3.7, p. 361. 
Universal algebras (mostly many-sorted, i.e., with several underlying sets) are handled especially in 
theoretical informatics. There they form the basis of algebraic specifications of abstract data types 
and systems and of term-rewriting systems. 


5.6.1 Definition 


Let Q be a set of operation symbols divided into pairwise disjoint subsets Qa, n € N. Qo contains the 
constants, Qn, n > 0, contain the n-ary operation symbols. The family (Qn)nen is called the type or 
signature. If A is a set, and if to every n-ary operation symbol w € Qn an n-ary operation w^ in A is 
assigned, then A = (A, {w4|w € Q}) is called an Q algebra or algebra of type (or of signature) Q. 

If Q is finite, Q = {w1,..., wz}, then one also writes A = (Aw... . wit) for A. 

If a ring (see 5.3.7, p. 361) is considered as an Q algebra, then 2 is partitioned Q = {w1}, Q1 = {wo}, 
Q2 = {w3,w4}, where to the operation symbols w, we, w3, w4 the constant 0, taking the inverse with 
respect to addition, addition and multiplication are assigned. 

Let A and B be Q algebras. B is called an Q subalgebra of A, if B C A holds and the operations w? are 
the restrictions of the operations w4 (w € Q) to the subset B. 


5.6.2 Congruence Relations, Factor Algebras 


In constructing factor structures for universal algebras, the notion of congruence relation is needed. A 
congruence relation is an equivalence relation compatible with the structure: Let A = (A, {w4|w € 0}) 
be an Q algebra and R be an equivalence relation in A. R is called a congruence relation in A, if for all 
w E Qn (n € N) and all aj,b; E€ A with a;Rb; (i = 1,... n): 

w^ (a, ..., as) Ru (bi, ... , bs). (5.278) 
The set of equivalence classes (factor set) with respect to a congruence relation also form an Q algebra 


with respect to representative-wise calculations: Let A = (A, {w4|w € Q}) be an Q algebra and R be 
a congruence relation in A. The factor set A/R (see 5.2.4, 2., p. 334) is an Q algebra A/R with the 


following operations w4/ (w € Qn, n € N) with 


wW^/R(la]r,---, lan]r) = lw4(a,.-.,an)]r (5.279) 
and it is called the factor algebra of A with respect to R. 
The congruence relations of groups and rings can be defined by special substructures — normal sub- 
groups (see 5.3.3.2, 2. p. 338) and ideals (see 5.3.7.2, p. 362), respectively. In general, e.g., in semi- 
groups, such a characterization of congruence relations is not possible. 


5.6.3 Homomorphism 


Just as with classical algebraic structures, the homomorphism theorem gives a connection between the 
homomorphisms and congruence relations. 
Let A and B be € algebras. A mapping h: A — B is called a homomorphism, if for every w € Q, and 
alla4,...,a4 € A: 

h(w^ (ai, ...,a&)) 9 wP(h(ai),.. . , h(a,)). (5.280) 
If, in addition, h is bijective, then h is called an isomorphism; the algebras A and B are called isomor- 
phic. The homomorphic image h(A) of an Q algebra A is an 2 subalgebra of B. Under a homomorphism 


h, the decomposition of A into subsets of elements with the same image corresponds to a congruence 
relation which is called the kernel of h: 


ker h = {(a,b) € A x Alh(a) = h(b)}. (5.281) 
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5.6.4 Homomorphism Theorem 


Let A and B be Q algebras and h: A — B a homomorphism. h defines a congruence relation ker h in 
A. The factor algebra A/ ker h is isomorphic to the homomorphic image h(A). 

Conversely, every congruence relation R defines a homomorphic mapping natg: A — A/R with natg(a) 
— [a|g. Fig. 5.19 illustrates the homomorphism theorem. 


5.6.5 Varieties 

A variety V is a class of Q algebras, which is closed 

ο ο under forming direct products, subalgebras, and 
homomorphic images, i.e., these formations do not 
lead out of V. Here the direct products are defined 
in the following way: 

nat ker h [| [ | L] Considering the operations corresponding to Q 


componentwise on the Cartesian product of the un- 





derlying sets of Q algebras, an Q algebra, the direct 
|] [ | [ | product of these algebras is obtained. The theorem 
A/kerh of Birkhoff (see 5.6.6, p. 395) characterizes the va- 
rieties as those classes of Q algebras, which can be 

Figure 5.19 equationally defined. 


5.6.6 Term Algebras, Free Algebras 


Let (Qn)nen be a type (signature) and X a countable set of variables. The set To(X) of Q terms over 
X is defined inductively in the following way: 
1. X U Qo C To(X). 
2. If &,...,t, € To(X) and v € Q, hold, then also wt; . . . tn E To(X) holds. 
The set To(X) defined in this way is an underlying set of an Q algebra, the term algebra To(.X ) of type 
Q over X, with the following operations: If t1,...,t, € To(X) and w € Qp hold, then T9 C9 is defined 
by 

WPC) t... ta) = wti.. tn: (5.282) 
Term algebras are the “most general” algebras in the class of all Q algebras, i.e., no “identities” are 
valid in term algebras. These algebras are called free algebras. 
An identity is a pair (s(x£1,... , En), t(£1,. - - , &n)) of Q terms in the variables z4,..., v4. An Q algebra 
A satisfies such an equation, if for every a4,...,a, € A holds: 

s^(ay,... , a4) — t^(a1, ..., a4). (5.283) 
A class of Q algebras defined by identities is a class of Q algebras satisfying a given set of identities. 
Theorem of Birkhoff: The classes defined by identities are exactly the varieties. 
W Varieties are for example the classes of all semigroups, groups, Abelian groups, and rings. But, e.g., 
the direct product of cyclic groups is not a cyclic group, and the direct product of fields is not a field. 
Therefore cyclic groups or fields do not form a variety, and cannot be defined by equations. 


5.7 Boolean Algebras and Switch Algebra 


Calculating rules, similar to the rules established in 5.2.2, 3., p. 329 for set algebra and propositional 
calculus (5.1.1, 6., p. 324), can be found for other objects in mathematics too. The investigation of 
these rules yields the notion of Boolean algebra. 


5.7.1 Definition 
A set B, together with two binary operations M (“conjunction”) and U (“disjunction”), and a unary 
operation (“negation”), and two distinguished (neutral) elements 0 and 1 from B, is called a Boolean 





396 5. Algebra and Discrete Mathematics 





algebra B = (B, N, U, `, 0, 1) if the following properties are valid: 
(1) Associative Laws: 


(arb) re 2 an (bnc), 5.284) (aub) ue au (buc). (5.285 





(2) Commutative Laws: 


alib —brla, 5.286) alib —bLla. (5.287 








(3) Absorption Laws: 
an (aub) =a, 5.288) aU (anb) =a. (5.289 




















(4) Distributive Laws: 

(aub) Nnce= (anc)u (bne), (5.290) (anb)uc= (auc)n(buc). (5.291 
(5) Neutral Elements: 
anl=a, 5.292) all0 — a, (5.293 














a0 - 0, 5.294) alll- 1, (5.295 


(6) Complement: 


























ana=0, 5.296) alla — 1. (5.297 


A structure with the associative laws, commutative laws, and absorption laws is called a lattice. If the 
distributive laws also hold, then the lattice is called a distributive lattice. So a Boolean algebra is a 
special distributive lattice. 

Remark: The notation used for Boolean algebras is not necessarily identical to the notation for the 
operations in propositional calculus. 


5.7.2 Duality Principle 


1. Dualizing 

In the “axioms” of a Boolean algebra is included the following duality: Replacing N by U, U by F1, 0 
by 1, and 1 by 0 in an axiom gives always the other axiom in the same row. The axioms in a row are 
dual to each other, and the substitution process is called dualization. The dual statement follows from 
a statement of the Boolean algebra by dualization. 

2. Duality Principle for Boolean Algebras 

The dual statement of a true statement for a Boolean algebra is also a true statement for the Boolean 
algebra, i.e., with every proved proposition, the dual proposition is also proved. 

3. Properties 

One gets, e.g., the following properties for Boolean algebras from the axioms. 

(E1) The Operations M and L! are Idempotent: 


aNla=a, (5.298) alla — a. (5.299) 








(E2) De Morgan Rules: 


anb=aub, (5.300) alib —arb, (5.301) 























(E3) A further Property: 
G =a. (5.302) 
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It is enough to prove only one of the two properties in any line above, because the other one is the dual 
property. The last property is self-dual. 


5.7.3 Finite Boolean Algebras 
All finite Boolean algebras can be described easily up to “isomorphism”. Let Bı, B2 be two Boolean 
algebras and f: Bı > Bə a bijective mapping. f is called an isomorphism if 

f(anb) = f(a) f(b), faub)= faus) and f(@) = Fla) (5.303) 
hold. Every finite Boolean algebra is isomorphic to the Boolean algebra of the power set of a finite set. 
In particular every finite Boolean algebra has 2" elements, and every two finite Boolean algebras with 
the same number of elements are isomorphic. 





Hereafter B denotes the Boolean algebra with two elements (0, 1) and with the operations 


πιο 1 ulo 1 — 
0/0 0 010 1 0/1 
1|0 1 pili 1 1190 


Defining the operations M, U, and~ componentwise on the n-times Cartesian product B" = {0,1} x 

- x {0,1}, then B" will be a Boolean algebra with 0 = (0,...,0) and 1 = (1,...,1). B" is called 
the n times direct product of B. Because B” contains 2” elements, this way one gets all finite Boolean 
algebras (out of isomorphism). 


5.7.4 Boolean Algebras as Orderings 


An order relation can be assigned to every Boolean algebra B: Here a < b holds if aM b = a is valid (or 
equivalently, if a L1 b — b holds). 
So every finite Boolean algebra can be represented by a Hasse diagram (see 5.2.4, 4., p. 334). 


W Suppose B is the set (1,2,3,5,6,10, 15, 30] of the divisors of 30. Then, the * 
least common multiple and the greatest common divisor can be defined as bi- 
nary operations and the complement as unary operation. The numbers 1 and 


P 30 
30 correspond to the distinguished elements 0 and 1. The corresponding Hasse e lo, 15 
diagram is shown in Fig. 5.20. <L 
ES fo > 
1 











5.7.5 Boolean Functions, Boolean Expressions 


1. Boolean Functions 





Denoting by B the Boolean algebra with two elements as in 5.7.3, p. 397, then . 

an n-ary Boolean function f is a mapping from B" into B. There are 2?” n-ary . 

Boolean functions. The set of all n-ary Boolean functions with the operations Figure 5.20 
(FD g)(b) = f(b) T g0), (5.304) (F U g)(b) = f(b) U g(b), (5.305) 
Fe) = f(b), (5.306) 


is a Boolean algebra. Here b always means an n tuple of the elements of B = {0,1}, and on the right- 
hand side of the equations the operations are performed in B. The distinguished elements 0 and 1 
correspond to the functions fo and fi with 


fo(b) =0, fi(b)=1 forall be B”. (5.307) 
ΒΑ: Inthe case n = 1, i.e., for only one Boolean variable b, there are four Boolean functions: 
Identity f(b) =6, Negation f(b) =), (5.308) 


Tautology f(b) =1, Contradiction f(b) — 0. 


W B: Inthecasen = 2, i.e., for two Boolean variables a and b, there are 16 different Boolean functions, 
among which the most important ones have their own names and notation. They are shown in Table 
5.6. 





398 5. Algebra and Discrete Mathematics 





Table 5.6 Some Boolean functions with two variables a and b 


Value table for 



















































































Name of the | Different Different s 0 0 1 1 
i i bol = ; 

function notation symbols B (Ὁ) ; 0) ; (Ὁ) ; (i) 

Sheffer a-b 

ii alb &i— D- ip Wa X2 d 

NAND NAND (a,b) 

Peirce a+b . 

a a8 zl = 1, 0, 0, 0 

NOR NOR a,b 

] b + αὖ 

Antivalence s LaL 

or E n z U 3 d Ὁ 
a 

ZOR apb 
ab+ab L14— © 

Equivalence amb 14 0 0 1 
a € b 

Implication a+b 1, 1 0, 1 
α -ὃ b 




















2. Boolean Expressions 
Boolean expressions are defined in an inductive way: Let X = {,y,z,...} be a (countable) set of 
Boolean variables (which can take values only from {0, 1}): 

1. The constants 0 and 1 just as the Boolean variables from X are 


Boolean expressions. (5.309) 
2. If S and T are Boolean expressions, so are T, (S n T), and (S U T), as well. (5.310) 
If a Boolean expression contains the variables x71,...,2n, then it represents an n-ary Boolean function 


fr: 
Let b be a “valuation” of the Boolean variables 21, ..., an, ie., b = (bi,..-,0n) € B". 
Assigning a Boolean function to the expression T in the following way gives: 


1. If T — 0, then fr — fo; ifT =1, then fr = fi. (5.311a 
2. ET =2;, then fr(b) = bj; if T = 5, then fz(b) = fs(b). (5.311b 
3. If 7 = RNS, then fr(b) — fg(b) 1 fs(b). (5.311c 
4. IT — RU S, then fr(b) — fa(b) ὯΙ fs(b). (5.311d 








) 
) 
) 
) 




















On the other hand, every Boolean function f can be represented by a Boolean expression T' (see 5.7.6, 
p. 399). 

3. Concurrent or Semantically Equivalent Boolean Expressions 

The Boolean expressions S and T are called concurrent or semantically equivalent if they represent the 
same Boolean function. Boolean expressions are equal if and only if they can be transformed into each 
other according to the axioms of a Boolean algebra. 

Under transformations of a Boolean expression here are considered especially two aspects: 

e Transformation in a possible “simple” form (see 5.7.7, p. 399). 

e Transformation in a “normal form”. 
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5.7.6 Normal Forms 


1. Elementary Conjunction, Elementary Disjunction 
Let B — (B,T,U, ,0,1) be a Boolean algebra and {21,...,%n} a set of Boolean variables. Every 
conjunction or disjunction in which every variable or its negation occurs exactly once is called an ele- 


mentary conjunction or an elementary disjunction respectively (in the variables 21, ..., an). 
Let T (z1,...,24) be a Boolean expression. A disjunction D of elementary conjunctions with D = T is 


called a principal disjunctive normal form (PDNF) of T. A conjunction C of elementary disjunctions 
with C = T is called a principal conjunctive normal form (PCNF) of T. 

W Part 1: In order to show that every Boolean function f can be represented as a Boolean expression, 
the PDNF form of the function f given in the annexed table is to be constructed: 














x y z| f(x,y,z) The PDNF of the Boolean function f contains the elementary conjunc- 
tions TN YN z,2Nylnz,«nylNz. These elementary conjunctions belong 
000 0 to the valuations b of the variables where the function f has the value 1. 
0 0 1 1 If a variable v has the value 1 in b, then v is to put in the elementary con- 
o i 0 i junction, otherwise v. 
100 0 W Part 2: The PDNF for the example of Part 1 is: 
1 0 1 1 @ngn z) u (ngn z)u (nyn z). (5.312) 
110 1 The “dual” form for PDNF is the PCNF: The elementary disjunctions be- 
1 4 1 0 long to the valuations b of the variables for which f has the value 0. 
If a variable v has the value 0 in b, then v is to put in the elementary disjunction, otherwise v. So the 
PCNF is: 
(zllyLz)rn(zuguz)n(cugtusz)n(zugtusz)n(zugts). (5.313) 


The PDNF and the PCNF of f are uniquely determined, if the ordering of the variables and the ordering 
of the valuations is given, e.g., if considering the valuations as binary numbers and arranging them in 
increasing order. 

2. Principal Normal Forms 

The principal normal form of a Boolean function fr is considered as the principal normal form of the 
corresponding Boolean expression T. 

Checking the equivalence of two Boolean expressions by transformations is often difficult. The prin- 
cipal normal forms are useful: Two Boolean expressions are semantically equivalent exactly if their 
corresponding uniquely determined principal normal forms are identical letter by letter. 

W Part 3: In the considered example (see Part 1 and 2) the expressions (y N z) U (x N y N Z) and 
(x LI ((y Ll z) 1 (gLl z) 1 (gU z))) D (x LU ((y LI z) P1 (FU z))) are semantically equivalent because the 
principal disjunctive (or conjunctive) normal forms of both are the same. 


5.7.7 Switch Algebra 


A typical application of Boolean algebra is the simplification of series-parallel connections (SPC). 
Therefore a Boolean expression is to be assigned to a SPC (transformation). This expression will be 
"simplified" with the transformation rules of the Boolean algebra. Finally a SPC is to be assigned to 
this expression (inverse transformation). The result is a simplified SPC which produces the same be- 
havior as the initial connection system (Fig. 5.21). 








A SPC has two types of contact points: the so-called “make contacts” and “break contacts”, and both 
types have two states; namely open or closed. The usual symbolism is: When the equipment is put on, 
the make contacts close and the break contacts open. With Boolean variables assigned to the contacts 
of the switch equipment follows: 

The position “off” or “on” of the equipment corresponds to the value 0 or 1 of the Boolean variables. 
The contacts being switched by the same equipment are denoted by the same symbol, the Boolean 
variable belonging to this equipment. The contact value of a SPC is 0 or 1, according to whether the 
switch is electrically non-conducting or conducting. The contact value depends on the position of the 
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contacts, so it is a Boolean function S (switch function) of the variables assigned to the switch equip- 
ment. Contacts, connections, symbols, and the corresponding Boolean expressions are represented in 


Fig. 5.22. 
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Figure 5.23 


Figure 5.24 


The Boolean expressions, which represent switch functions of SPC, have the special property that the 


negation sign can occur only above variables (never 


over subexpressions). 


E Simplification of the SPC Fig. 5.23. This connection corresponds to the Boolean expression 










































































S — (ar1b) U (arbre) u (an (bu c)) (5.314) 
as switch function. According to the transformation formulas of Boolean algebra holds: 
S = (bh (@U (aNe)))U (GN (bUc)) 
— (br1(aulc))u (an (bL c)) 
= (anb) u (bnc) u (ar1c) 
— (aribric)u(anbrne)u(bnoe)u(anbrne)u(arne)u(anbne) 
— (arte)u (bro). (5.315) 
Here one gets @c from (@NbNc)U (@Nc) U(@NbN ce), and bNZ from (@NbNz)U(bNe)U(anbne). 





The finally simplified result SPC is shown in Fig. 5. 
This example shows that usually it is not so easy to 
mations. In the literature one can find different met 








24. 
get the simplest Boolean expression by transfor- 
10ds for this procedure. 
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5.8 Algorithms of Graph Theory 


Graph theory is a field in discrete mathematics having special importance for informatics, e.g., for rep- 
resenting data structures, finite automata, communication networks, derivatives in formal languages, 
etc. There are also applications in physics, chemistry, electrotechnics, biology and psychology. More- 
over, flows can be applied in transport networks and in network analysis in operations research and in 
combinatorial optimization. 


5.8.1 Basic Notions and Notation 


1. Undirected and Directed Graphs 

A graph G is an ordered pair (V, E) of a set V of vertices and a set E of edges. There is a mapping, 
defined on E, the incidence function, which uniquely assigns to every element of E an ordered or non- 
ordered pair of (not necessarily distinct) elements of V. If a non-ordered pair is assigned then G is 
called an undirected graph (Fig. 5.25). If an ordered pair is assigned to every element of E, then the 
graph is called a directed graph (Fig. 5.26), and the elements of E are called arcs or directed edges. All 
other graphs are called mized graphs. 

In the graphical representation, the vertices of a graph are denoted by points, the directed edges by 
arrows, and undirected edges by non-directed lines. 





" Va Vs ' V, Vs 

e: O e: Ο 
" ; ' ; 
νι 6 e νι e e; 

" ' 

6ι Vi Εἰ i 
ν; V2 
Figure 5.25 Figure 5.26 Figure 5.27 


W A: For the graph G in Fig. 5.27: V — (vi, vo, v3, v4, v5), E = {e1, €2, 3, €4, 5, €6, C7}, 

Siler) = {v1, v2}, fi(e2) = (i, va), files) = (vas vs), fi(ea) = (vs, v4), files) = (v3, va), 

Files) = (va, vo), filer) = (vs, vs). 

WB DB: For the graph G in Fig. 5.26: V — (vi, vo, va, v5], E' — (es, €h, €b, e4} 

fa(e) — (va, va), fa(e5) — (va, va), fa(es) = (va, v2), falez) = (vs, vs). 

E C: For the graph G in Fig. 5.25: V = {vj, vo, v3, U4, U5}, E” = (et eo es. et). 

falei) = {v2 us}, falez) = {va v3}, falez) = Qva, val, fa(et) = {us, vs}. 

2. Adjacency 

If (v, w) € E, then the vertex v is called adjacent to the vertex w. Vertex v is called the initial point of 
(v, w), w is called the terminal point of (v, w), and v and w are called the endpoints of (v, w). 
Adjacency in undirected graphs and the endpoints of undirected edges are defined analogously. 

3. Simple Graphs 

If several edges or arcs are assigned to the same ordered or non-ordered pairs of vertices, then they 
are called multiple edges. An edge with identical endpoints is called a loop. Graphs without loops and 
multiple edges and multiple arcs, respectively, are called simple graphs. 

4. Degrees of Vertices 

The number of edges or arcs incident to a vertex v is called the degree da (v) of the vertex v. Loops are 
counted twice. Vertices of degree zero are called isolated vertices. 

For every vertex v of a directed graph G, the out-degree d&(v) and in-degree dg(v) of v are distinguished 
as follows: 


d&(v) = |{w|(v, w) € E}I, (5.316a) da(v) ^ |tw|(w,v) € E). (5.316b) 
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5. Special Classes of Graphs 

Finite graphs have a finite set of vertices and a finite set of edges. Otherwise the graph is called infinite. 
In regular graphs of degree r every vertex has degree r. 

An undirected simple graph with vertex set V is called a complete graph if any two different vertices in 
V are connected by an edge. A complete graph with an n element set of vertices is denoted by Ky. 

If the set of vertices of an undirected simple graph G can be partitioned into two disjoint classes X and 
Y such that every edge of G joins a vertex of X and a vertex of Y , then G is called a bipartite graph. 
A bipartite graph is called a complete bipartite graph, if every vertex of X is joined by an edge with 
every vertex of Y. If X has n elements and Y has m elements, then the graph is denoted by Kym. 

W Fig. 5.28 shows a complete graph with five vertices. 


W Fig. 5.29 shows a complete bipartite graph with a two-element set X and a three-element set Y . 


1 
5 2 X, m 
K; = X, " 
4 3 Ys 
Figure 5.28 Figure 5.29 


Further special classes of graphs are plane graphs, trees and transport networks. Their properties will 
be discussed in later paragraphs. 

6. Representation of Graphs 

Finite graphs can be visualized by assigning to every vertex a point in the plane and connecting two 
points by a directed or undirected curve, if the graph has the corresponding edge. There are examples 
in Fig. 5.30-5.33. Fig. 5.33 shows the Petersen graph, which is a well-known counterexample for 
several graph-theoretic conjectures, which could not be proved in general. 


a d. WU 


Figure 5.30 Figure 5.31 Figure 5.32 Figure 5.33 


7. Isomorphism of Graphs 

A graph Gi = (V1, £1) is called isomorphic to a graph Gz = (V2, Γι) iff there are bijective mappings 
y from V; onto V2 and ψ from E4 onto E3 being compatible with the incidence function, i.e., if u, v 
are the endpoints of an edge or u is the initial point of an arc and v is its terminal point, then y(u) 
and (v) are the endpoints of an edge and y(u) is the initial point and (v) the terminal point of 
an arc, respectively. Fig. 5.34 and Fig. 5.35 show two isomorphic graphs. The mapping y with 
(1) = a, (2) = b, (3) = c, v(4) = dis an isomorphism. In this case, every bijective mapping of 
{1, 2,3, 4} onto {α, b, c, d) is an isomorphism, since both graphs are complete graphs with equal number 
of vertices. 

8. Subgraphs, Factors 

If G — (V, E) is a graph, then the graph G' — (V', E") is called a subgraph of G, if V' C V and E' C E. 
If E' contains exactly those edges of E which connect vertices of V', then G' is called the subgraph of G 
induced by V' (induced subgraph). 
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1 2 a b 


Figure 5.34 Figure 5.35 


A subgraph G' — (V', E") of G — (V, E) with V' — V is called a partial graph of G. 

A factor F of a graph G is a regular subgraph of G containing all vertices of G. 

9. Adjacency Matrix 

Finite graphs can be described by matrices: Let G = (V, E) be a graph with V = {v1, v2,...,Un} and 
E = {e1,¢€2,...,em}. Let m(vi, v) denote the number of edges from v; to vj. For undirected graphs, 
loops are counted twice; for directed graphs loops are counted once. The matrix A of type (n, n) with 
A = (m(v;,v;)) is called an adjacency matrix. If in addition the graph is simple, then the adjacency 
matrix has the following form: 


z — fl, for(vivj) € E, 
A - (uj) - 05) for (vi, vj) € E; (5.317) 
ie. in the matrix A there is a 1 in the i-th row and j-th column iff there is an edge from v; to vj. 
The adjacency matrix of undirected graphs is symmetric. 
E A: Beside Fig. 5.36 there is the adjacency matrix Ay = A(G‘) of the directed graph ΟἽ. 


E B: Beside Fig. 5.37 there is the adjacency matrix Ay = A(G») of the undirected simple graph G3. 


Vi 
MA νι 010101 
0100 Ve v 101010 
I | [0000 010101 
τ At | 0103 we v, A2=| 101010 
v, 0100 010101 
Vs 101010 


Figure 5. 
igure 5.36 Figure 5.37 


10. Incidence Matrix 

For an undirected graph G — (V, E) with V = {v, v2,...,Un} and E = {e1,e2,...,em}, the matrix I 
of type (n,m) given by 

0, viis not incident with ej, 

1, wjisincident with e; and e; is not a loop, (5.318) 
2, viis incident with ej and ej is a loop 

is called the incidence matriz. 

For a directed graph G = (V, E) with V = {v1,v2,..., Un} and E = {¢1,€2,..., €m}, the incidence 
matrix I is the matrix of type (n, m), defined by 


; 


I = (bij) with bi = | 


᾽ 


vj is not incident with e;, 

vj is the initial point of ej and e; is not a loop, 
viis the terminal point of e; and e; is not a loop, 
vj is incident to e; and ej is a loop. 


᾿ 


0 
I-(bj)wihb;-4 PD (5.319) 
-0 


᾿ 


᾿ 


11. Weighted Graphs 
If G — (V, E) is a graph and f is a mapping assigning a real number to every edge, then (V, E, f) is 
called a weighted graph, and f(e) is the weight or length of the edge e. 
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In applications, these weights of the edges represent costs resulting from the construction, maintenance 
or use of the connections. 


5.8.2 Traverse of Undirected Graphs 


5.8.2.1 Edge Sequences or Paths 
1. Edge Sequences or Paths 


In an undirected graph G = (V, E) every sequence F — ((vi, va), {v2, v3}, ..., (Us, Ust1}) of the ele- 
ments of E is called an edge sequence of length s. 1 

If vı = vs41, then the sequence is called a cycle, otherwise it is an open edge 

sequence. An edge sequence F is called a path iff v1, v9, ..., v, are pairwise 

distinct vertices. A closed pathis a circuit. A trailis a sequence of edges with- 5 2 


out repeated edges. 
E In the graphs in Fig. 5.38, F, = ({1, 2}, {2,3}, {3, 5}, { 
{ 


5}, {5, 2}, {2, 4}) is 
an edge sequence of length 5, Fy = ({1,2}, {2,3}, {3,4}, 
) 


4,2}, {2, 1}) is 


a cycle of length 5, Fs = ({2,3}, {3,5}, {5,2}, {2,1}) is a path, Fy = 4 3 
({1, 2}, {2,3}, {3,4}) is a path. An elementary cycle is given by F5 = . " 
({1, 2}, {2,5}, (5,19. Figure 5.38 


2. Connected Graphs, Components 

If there is at least one path between every pair of distinct vertices v, w in a graph G, then G is called 
connected. If a graph G is not connected, it can be decomposed into components, i.e., into induced 
connected subgraphs with maximal number of vertices. 


3. Distance Between Vertices 
The distance ó(v, w) between two vertices v, w of an undirected graph is the length of a path with 
minimum number of edges connecting v and w. If such a path does not exist, then let 6(v, w) = οο. 


4. Problem of Shortest Paths 
Let G = (V, E, f) be a weighted simple graph with f(e) > 0 for every e € E. Determine the shortest 
path from v to w for two vertices v, w of G, i.e., a path from v to w having minimum sum of weights of 
edges and arcs, respectively. 
There is an efficient algorithm of Dantzig to solve this problem, which is formulated for directed graphs 
and can be used for undirected graphs (see 5.8.6, p. 410) in a similar way. 
Every graph G — (V, E, f) with V = {v, v2,..., Un} has a distance matriz D of type (n, n): 

D-(di) with dij —Óó(vi,v;) (à, j — 1,2,...,n). (5.320) 
In the case that every edge has weight 1, i.e., the distance between v and w is equal to the minimum 
number of edges which have to be traversed in the graph to get from v to w, then the distance between 
two vertices can be determined using the adjacency matrix: Let v1, vo, ..., v, bethe vertices of G. The 
adjacency matrix of G is A = (a;;), and the powers of the adjacency matrix with respect to the usual 
multiplication of matrices (see 4.1.4, 5., p. 272) are denoted by A?" — (a5), m € N. 
There is a shortest path of length k from the vertex v; to the vertex v; (i Z j) iff: 

ak #0 and af,=0 (s=1,2,...,k—1). (5.321) 
W Tho weighted graph represented in Fig. 5.39 has the distance matrix D beside it. 
E The graph represented in Fig. 5.40 has the adjacency matrix A beside it, and for m — 2 or m = 3 
the matrices A? and A? are obtained. Shortest paths of length 2 connect the vertices 1 and 3, 1 and 4, 


1 and 5, 2 and 6, 3 and 4, 3 and 5, 4 and 5. Furthermore the shortest paths between the vertices 1 and 
6, 3 and 6, and finally 4 and 6 are of length 3. 
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Figure 5.40 
5.8.2.2 Euler Trails 


1. Euler Trail, Euler Graph 

A trail containing every edge of a graph G is called an open or closed Euler trail of G. 

A connected graph containing a closed Euler trail is an Euler graph. 

W The graph G; (Fig. 5.41) has no Euler trail. The graph G2 (Fig. 5.42) has an Euler trail, but it 
is not an Euler graph. The graph G5 (Fig. 5.43) has a closed Euler trail, but it is not an Euler graph. 
The graph G4 (Fig. 5.44) is an Euler graph. 


G, G, G, G, 
Figure 5.41 Figure 5.42 Figure 5.43 Figure 5.44 


2. Theorem of Euler-Hierholzer 

A finite connected graph is an Euler graph iff all vertices have positive even degrees. 

3. Construction of a Closed Euler Trail 

If Gis an Euler graph, then one chooses an arbitrary vertex vı of G and constructs a trail F} by traversing 
a path, starting at v4 and proceeding until it cannot be continued. If Fi does not yet contain all edges of 
G, then one constructs another path F5 containing the edges not in F1, but starting at a vertex v € Fi 
and proceeds until it cannot be continued. Then one composes a closed trail in Œ using F} and Fy: 
Starting to traverse Fi at v, until vg is reached, then continuing to traverse F5, and finishing at the 
edges of F; not used before. Repeating this method a closed Euler trail is obtained in finitely many 
steps. 

4. Open Euler Trails 

There is an open Euler trail in a graph G iff there are exactly two vertices in G with odd degrees. 
Fig. 5.45 shows a graph which has no closed Euler trail, but it has an open Euler trail. The edges are 
consecutively enumerated with respect to an Euler trail. In Fig. 5.46 there is a graph with a closed 
Euler trail. 
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Figure 5.45 Figure 5.46 Figure 5.47 


5. Chinese Postman Problem 
The problem, that a postman should pass through all streets in his service area at least once and return 
to the initial point and use a trail as short as possible, can be formulated in graph theoretical terms as 
follows: Let G — (V, E, f) bea weighted graph with f(e) > 0 for every edge e € E. Determine an edge 
sequence F with minimum total length 

L-— M f(e). (5.322) 

ecF 

The name of the problem refers to the Chinese mathematician Kuan, who studied this problem first. 
Το solve it two cases are distinguished: 
1. Gis an Euler graph — then every closed Euler trail is optimal — and 
2. G has no closed Euler trail. 
An effective algorithm solving this problem is given by Edmonds and Johnson (see [5.25]). 


5.8.2.3 Hamiltonian Cycles 


1. Hamiltonian Cycle 

A Hamiltonian cycle is an elementary cycle in a graph covering all of the vertices. 

W 1n Fig. 5.47, lines in bold face show a Hamiltonian cycle. 

The idea of a game to construct Hamiltonian cycles in the graph of a pentagondodecaeder, goes back 
to Sir W. Hamilton. 

Remark: The problem of characterizing graphs with Hamiltonian cycles leads to one of the classical 
NP-complete problems. Therefore, an efficient algorithm to determine the Hamilton cycles cannot be 
given here. 

2. Theorem of Dirac 

If a simple graph G = (V, E) has at least three vertices, and dg(v) > |V|/2 holds for every vertex v of 
G, then G has a Hamiltonian cycle. This is a sufficient but not a necessary condition for the existence 
of Hamiltonian cycles. The following theorems with more general assumptions give only sufficient but 
not necessary conditions for the existence of Hamilton cycles, too. 


W Fig. 5.48 shows a graph which has a Hamiltonian cycle, but does 
not satisfy the assumptions of the following theorem of Ore. 
3. Theorem of Ore 
If a simple graph G = (V, £) has at least three vertices, and dg(v) + 
dg(w) > |V| holds for every pair of non-adjacent vertices v, w, then G 
contains a Hamiltonian cycle. 
4. Theorem of Posa 
Let G — (V, E) bea simple graph with at least three vertices. There is 
Figure 5.48 a Hamiltonian cycle in G if the following conditions are satisfied: 
1. For 1 <k < (|V| — 1)/2, the number of vertices of degree not exceeding k is less than k. 
2. If|V| is odd, then the number of vertices of degree not exceeding (|V| — 1)/2 is less than or equal to 
(V| - 0/2. 
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Figure 5.49 Figure 5.50 Figure 5.51 


5.8.3 Trees and Spanning Trees 


5.8.3.1 Trees 


1. Trees 

An undirected connected graph without cycles is called a tree. Every tree with at least two vertices has 
at least two vertices of degree 1. Every tree with n vertices has exactly n — 1 edges. 

A directed graph is called a tree if G is connected and does not contain any circuit (see 5.8.6, p. 410). 
W Fig. 5.49 and Fig. 5.50 represent two non-isomorphic trees with 14 vertices. They demonstrate 
the chemical structure of butane and iso-butane. 


2. Rooted Trees 

A tree with a distinguished vertex is called a rooted tree, and the distinguished vertex is called the 
root. In diagrams, the root is usually on the top, and the edges are directed downwards from the root 
(see Fig. 5.51). Rooted trees are used to represent hierarchic structures, as for instance hierarchies in 
factories, family trees, grammatical structures. 

Hf Fig. 5.51 shows the genealogy of a family in the form of a rooted tree. The root is the vertex assigned 
to the father. 


3. Regular Binary Trees 

If a tree has exactly one vertex of degree 2 and otherwise only vertices of degree 1 or 3, then it is called 
a regular binary tree. 

The number of vertices of a regular binary tree is odd. Regular trees with n vertices have (n + 1)/2 
vertices of degree 1. The level of a vertex is its distance from the root. The maximal level occurring 
in a tree is the height of the tree. There are several applications of regular binary rooted trees, e.g., in 
informatics. 


4. Ordered Binary Trees 
Arithmetical expressions can be represented by binary trees. Here, the numbers and variables are as- 
signed vertices of degree 1, the operations “+” ,“—”, “-” correspond to vertices of degree > 1, and the 
left and right subtree, respectively, represents the first and second operand, respectively, which is, in 
general, also an expression. These trees are called ordered binary trees. 
The traverse of an ordered binary tree can be performed in three different ways, which are defined in a 
recursive way (see also Fig. 5.52): 
Inorder traverse: Traverse the left subtree of the root (in inorder traverse), 
visit the root, 
traverse the right subtree of the root (in inorder traverse). 
Preorder traverse: Visit the root, 
traverse the left subtree (in preorder traverse), 
traverse the right subtree of the root (in preorder traverse). 
Postorder traverse: Traverse the left subtree of the root (in postorder traverse), 
traverse the right subtree of the root (in postorder traverse), 
visit the root. 
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Using inorder traverse the order of the terms does not change in comparison with the given expression. 
The term obtained by postorder traverse is called postfix notation PN or Polish notation. Analogously, 
the term obtained by preorder traverse is called prefix notation or reversed Polish notation. 

Prefix and postfix expressions uniquely describe the tree. This fact can 

be used for the implementation of trees. (+) 

E In Fig. 5.52 the term a-(b—c) +d is represented by a graph. Inorder 

traverse yields a - b — c+ d, preorder traverse yields + - —bcad, and ο (d) 
postorder traversal yields abc — -d+. 


5.8.3.2 Spanning Trees @) 9 


1. Spanning Trees (5) © 
A tree, being a subgraph of an undirected graph G, and containing all 
vertices of G, is called a spanning tree of G. Every finite connected graph 


: - Figure 5.52 
G contains a spanning tree H: 


If G contains a cycle, then delete an edge of this cycle. 


The remaining graph G; is still connected and can be i 1 

transformed into a connected graph G» by deleting a fur- οκ... 
ther edge of a cycle of G;, if there exists such an edge. Af- 2 2 

ter finitely many steps a spanning tree of G is obtained. 3 4 3 4 


W Fig. 5.54 shows a spanning tree H of the graph G 
shown in Fig. 5.53. 
2. Theorem of Cayley 
Every complete graph with n vertices (n » 1) has exactly n"? spanning trees. 
3. Matrix Spanning Tree Theorem 
Let G — (V, E) bea graph with V = {v1, v2,...,un} (n > 1) and E = {e1, €2,...,em}. Define a matrix 
D = (dij) of type (n,n): 
0 for i Z 1, 
dg(v;) for i = J, 
which is called the degree matrix. The difference between the degree matrix and the adjacency matrix 
is the admittance matrix L of G: 
L=D-A. (5.323b) 
Deleting the i-th row and the i-th column of L the matrix L; is obtained. The determinant of L; is 
equal to the number of spanning trees of the graph G. 


Figure 5.53 Figure 5.54 


dij = 


(5.323a) 


E The adjacency matrix, the degree matrix and the admittance matrix of the graph in Fig. 5.53 are: 


2110 4000 2-1-1 0 
1020 0300 -1 3-2 0 
A=l1901]> D=|o040]> L-5.1.9 4.1 
0010 0001 0 0-1 1 


Since detL3 = 5, the graph has five spanning trees. 
4. Minimal Spanning Trees 
Let G = (V, E, f) beaconnected weighted graph. A spanning tree H of G is called a minimum spanning 
tree if its total length f(H) is minimum: 
F(E) =} Fle). (5.324) 
ecH 
Minimum spanning trees are searched for, e.g., if the edge weights represent costs, and one is interested 
in minimum costs. A method to find a minimum spanning tree is the Kruskal algorithm: 
a) Choose an edge with the least weight. 
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b) Continue, as long as it is possible, choosing a further edge having least weight and not forming a 
cycle with the edges already chosen, and add such an edge to the tree. 

In step b) the choice of the admissible edges can be made easier by the following labeling algorithm: 

e Let the vertices of the graph be labeled pairwise differently. 

e At every step, an edge can be added only in the case that it connects vertices with different labels. 

e After adding an edge, the label of the endpoint with the larger label is changed to the value of the 
smaller endpoint label. 


5.8.4 Matchings 


1. Matchings 

A set M of edges of a graph G is called a matching in G, iff M contains no loop and two different edges 

of M do not have common endpoints. 

A matching M* of G is called a saturated matching, if there is no matching M in G such that M* C M. 

A matching M** of G is called a maximum matching, if there is no matching M in G such that |M| 7 

M**|. 

k M τ a matching of G such that every vertex of G is an endpoint of an edge of M, then M is called à 
1 perfect matching. 


6 5 WB In the graph in Fig. 5.55 Mi — ((2,3). (5,6]] is a saturated 
4 matching and M» — ((1,2), (3, 4), (5,6) ) is à maximum matching 
5 3 which is also perfect. 


Remark: In graphs with an odd number of edges there is no perfect 
Figure 5.55 matching. 


2. Theorem of Tutte 

Let q(G — S) denote the number of the components of G — $ with an odd number of vertices. A graph 
G = (V, E) has a perfect matching iff |V| is even and for every subset S of the vertex set q(G — S) < |S]. 
Here G — $ denotes the graph obtained from G by deleting the vertices of S and the edges incident with 
these vertices. 

Perfect machings exist for example in complete graphs with an even number of vertices, in complete 
bipartite graphs A, and in arbitrary regular bipartite graphs of degree r > 0. 

3. Alternating Paths 

Let G be a graph with a matching M. A path W in G is called an alternating path iff in W every edge 
e with e € M (or e € M) is followed by an edge e' with e' Z M (or e € M). 

An open alternating path is called an increasing path iff none of the endpoints of the path is incident 
with an edge of M. 


4. Theorem of Berge 

A matching M in a graph G is maximum iff there is no increasing alternating path in G. 

If W is an increasing alternating path in G with corresponding set E(W) of traversed edges, then 
M' — (MN E(W)) U (E(W) NV M) forms a matching in G with |M’| = |M| +1. 

Mi In the graph of Fig. 5.55 ({1, 2}, {2,3}, {3, 4}) is an increasing alternating path with respect to 
matching M;. Matching M» with |M»| — |Mi| 1- 1 is obtained as described above. 

5. Determination of Maximum Matchings 

Let G be a graph with a matching M. 

a) First form a saturated matching M* with M C M*. 

b) Chose a vertex v in G, which is not incident with an edge of M*, and determine an increasing 
alternating path in G starting at v. 

c) If such a path exists, then the method described above results in a matching M' with |M'| > |M*|. 
If there is no such path, then delete vertex v and all edges incident with v in G, and repeat step b). 





410 5. Algebra and Discrete Mathematics 





There is an algorithm of Edmonds, which is an effective method to search for maximum matchings, but 
it is rather complicated to describe (see [5.24]). 


5.8.5 Planar Graphs 
Here, the considerations are restricted to undirected graphs, since a directed graph is planar iff the 
corresponding undirected graph is a planar one. 


1. Planar Graph 
A graph is called a plane graph iff G can be drawn in the plane 
with its edges intersecting only in vertices of G. A graph iso- 
morphic with a plane graph is called a planar graph. 
Fig. 5.56 shows a plane graph G;. The graph G in Fig. 5.57 
is isomorphic {ο (αι, it is not a plane graph but a planar graph, 
since it is isomorphic with Gy. 
2. Non-Planar Graphs 

Figure 5.56 Figure 5.57 The complete graph A’; and the complete bipartite graph 133 
are non-planar graphs (see 5.8.1, 5., p. 402). 


3. Subdivisions 

A subdivision of a graph G is obtained if vertices 

of degree 2 are inserted into edges of G. Every 

graph is a subdivision of itself. Certain subdivi- 

sions of X5 and X5, are represented in Fig. 5.58 

and Fig. 5.59. 

4. Kuratowski's Theorem 

A graph is non-planar iff it contains a subgraph 

which is a subdivision either of the complete bipar- Figure 5.58 Figure 5.59 
tite graph 33 or of the complete graph Ks. 


5.8.6 Pathsin Directed Graphs 


1. ArcSequences 

A sequence F’ = (e1,€9,...,€s) of arcs in a directed graph is called a chain of length s, iff F does not 
contain any arc twice and one of the endpoints of every arc e; for à — 2,3,...,s — 1 is an endpoint of 
the arc e;_; and the other one an endpoint of e;41. 





A chain is called a directed chain iff for i = 1,2,...,s— 1 the terminal point of the arc e; coincides with 
the initial point of e;,4. 

Chains or directed chains traversing every vertex at most once are called elementary chains and ele- 
mentary directed chains, respectively. 

A closed chain is called a cycle. A closed directed path, with every vertex being the endpoint of exactly 
two arcs, is called a circuit. 

W Fig. 5.60 contains examples for various kinds of arc sequences. 


2. Connected and Strongly Connected Graphs 

A directed graph G is called connected iff for any two vertices there is a chain connecting these vertices. 
The graph G is called strongly connected iff to every two vertices v, w there is is assigned a directed 
chain connecting these vertices. 

3. Algorithm of Dantzig 

Let G — (V, E, f) be a weighted simple directed graph with f(e) > 0 for every arc e. The following 
algorithm yields all vertices of G, which are connected with a fixed vertex v by a directed chain, together 
with their distances from v: 

a) Vertex v; gets the label t(v;) — 0. Let $1 — (v1. 
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"E" UP 


chain directed elementary elementary directed cycle circuit 
chain chain chain 





Figure 5.60 
b) The set of the labeled vertices is Sin. 
c) If Un = {ele = (vi, vj) € E, vi € Sa, vj Z Sm} = Ú, then one finishes the algorithm. 
d) Otherwise one chooses an are e* = (x*, y*) with minimum t(2*) + f(e*). One labels e* and y* and 
puts t(y*) = t(a*) + f(e*) and also Sin41 = Sm U {y*} and repeats b) with m := m+ 1. 
(If all arcs have weight 1, then the length of a shortest directed chain from a vertex v to a vertex w can 
be found using the adjacency matrix (see 5.8.2.1, 4., p. 404)). 





If a vertex v of G is not labeled, then there is no di- 
rected path from v4 to v. 

If v has label t(v), then t(v) is the length of such a 
directed chain. A shortest directed path from v, to 
v can be found in the tree given by the labeled arcs 
and vertices, the distance tree with respect to vy. 
W in Fig. 5.61, the labeled arcs and vertices rep- 
resent the distance tree with respect to v, in the 
graph. The lengths of the shortest directed chains 
are: 


from v4 to vs : 2 fromu tovg: T 
from v4 to v; : 3 from v to vg : 7 
from v4 to U9 : 3 from vu tovs: 8 
from v to v2 : 4 from w tows: 8 
from v; to vo: 5 from v to uj: 9 
from v4 to v, : 6 from wu tovg: 10 
from v to vu: 6. 





Remark: There is also a modified algorithm to 
Figure 5.61 find the shortest directed chains in the case that 
G = (V, E, f) has arcs with negative weights. 


5.8.7 Transport Networks 
1. Transport Network 


A connected directed graph is called a transport network if it has two labeled vertices, called the source 
Q and sink S which have the following properties: 

a) There is an arc u; from S to Q, where wy is the only arc with initial point S and the only arc with 
terminal point Q. 

b) Every arc u; different from u4 is assigned a real number c(u;) 7 0. This number is called its capacity. 
The arc u; has capacity oo. 

A function y, which assigns a real number to every arc, is called a flow on G, if the equality 


YS guv)= YO φίυ,υ) (5.325a) 


(u,v)EG (v,w)EG 
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holds for every vertex v. The sum 


Y e(Qv) (5.325b) 
(Q,v)eG 

is called the intensity of the flow. A flow y is called compatible to the capacities, if for every arc u; of G 
0 € q(u;) € c(u;) holds. 
W For an example of a transport network see p. 412. 
2. Maximum Flow Algorithm of Ford and Fulkerson 
Using the maximum flow algorithm one can recognize whether a given flow y is maximal. 
Let G be a transport network and g a flow of intensity vı compatible with the capacities. The algorithm 
given below contains the following steps for labeling the vertices, and after finishing this procedure one 
can realize how much the intensity of the flow could be improved depending on the chosen labeling 
steps. 
a) One labels the source Q and sets e(Q) = oo. 
b) If there is an arc u; = (x, y) with labeled x and unlabeled y and (u;i) < c(u;), then one labels y and 
(x,y), and sets e(y) = min{e(x), c(u;) — y(ui)}, then one repeats step b), otherwise follows step c). 
c) If there is an arc u; = (x,y) with unlabeled x and labeled y, y(u;) > 0 and u; Æ u1, then one labels 
x and (x,y), substitutes e(x) = min{e(y), y(ui)} and returns to continue step b) if it is possible. 
Otherwise one finishs the algorithm. 
If the sink S of G is labeled, then the flow in G can be improved by an amount of e(S). If the sink is 
not labeled, then the flow is maximal. 


W Maximum flow: For the graph in Fig. 5.62 the weights are written next to the edges. A flow with 
intensity 13, compatible to these capacities, is represented in the weighted graph in Fig. 5.63. It isa 
maximum flow. 





Figure 5.62 


Wi Transport network: A product is produced 
by p firms Fi, Fz,..., Fp. There are q users 
γι, να,.... Μα. During a certain period there will be 
si units produced by F; and t; units required by V;. 
cij units can be transported from F; to Vj during 
the given period. Is it possible to satisfy all the re- 
quirements during this period? The corresponding 
graph is shown in Fig. 5.64. 





Fp 
Figure 5.64 
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5.9 Fuzzy Logic 


5.9.1 Basic Notions of Fuzzy Logic 
5.9.1.1 Interpretation of Fuzzy Sets 


Real situations are very often uncertain or vague in a number of ways. The word "fuzzy" also means 
some uncertainty, and the name of fuzzy logic is based on this meaning. Basically there are to distin- 
guish two types of fuzziness: vagueness and uncertainty. There are two concepts belonging here: The 
theory of fuzzy sets and the theory of fuzzy measure. In the following practice-oriented introduction 
the notions, methods, and concepts of fuzzy sets are discussed, which are the basic mathematical tools 
of multi-valued logic. 


1. Notions of Classical and Fuzzy Sets 

The classical notion of (crisp) set is two-valued, and the classical Boolean set algebra is isomorphic to 
two-valued propositional logic. Let X be a fundamental set named the universe. Then for every A C X 
there exists a function 


fa: X > {0,1}, (5.326a) 
such that it says for every x € X whether this element x belongs to the set A or not: 
fa(r)-1er€4A and fa(r)-0 e vg A. (5.326b) 


The concept of fuzzy sets is based on the idea of considering the membership of an element of the set as a 
statement, the truth value of which is characterized by a value from the interval [0, 1]. For mathematical 
modeling of a fuzzy set A a function is necessary whose range is the interval [0, 1] instead of {0,1}, i.e.: 
ua: X > [0,1]. (5.327) 
In other words: To every element x € X is to assign a number j14(x) from the interval [0, 1], which 
represents the grade of membership of z in A. The mapping ju, is called the membership function. The 
value of the function j14(a) at the point x is called the grade of membership. The fuzzy sets A, B,C, 
etc. over X are also called fuzzy subsets of X. The set of all fuzzy sets over X is denoted by F(X). 
2. Properties of Fuzzy Sets and Further Definitions 
The properties below follow directly from the definition: 
(E1) Crisp sets can be interpreted as fuzzy sets with grade of membership 0 and 1. 
(E2) The set of the arguments x, whose grade of membership is greater than zero, i.e., j14(%) > 0, is 
called the support of the fuzzy set A: 
supp(A) = {x € X | a(x) > 0}. (5.328) 
The set ker(A) — (x € X : a(x) = 1} is called the kernel or core of A. 
(E3) Two fuzzy sets A and B over the universe X are equal if the values of their membership functions 
are equal: 
A = B, if a(x) = up(x) holds for every x € X. (5.329) 
(E4) Discrete representation or ordered pair representation: If the universe X is finite, i.e., 
X= Un. X2,...,Xn} it is reasonable to define the membership Table 5.7 Tabular re 
function of the fuzzy set with a table of values. The tabular 
representation of the fuzzy set A is seen in Table 5.7. 


presentation of a 
fuzzy set 




















net } ET T2 send a | 
Also it is possible to write [ Malai) | alae) |... | naQa) | 
A := pa(ai)/t1 +--+ + pa(tn)/tn = ΣΟ μα(σι)/αι. (5.330) 
σι 


In (5.330) the fraction bars and addition signs have only symbolic meaning. 
(E5) Ultra-fuzzy set: A fuzzy set, whose membership function itself is a fuzzy set, is called, after Zadeh, 
an ultra-fuzzy set. 
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3. Fuzzy Linguistics 

Assigning linguistic values, e.g., “small”, “medium” or “big”, to a quantity then it is called a linguistic 
quantity or linguistic variable. Every linguistic value can be described by a fuzzy set, for example, by 
the graph of a membership function (5.9.1.2) with a given support (5.328). The number of fuzzy sets 
(in the case of “small”, “medium”, “big” they are three) depends on the problem. 

In 5.9.1.2 the linguistic variable is denoted by x. For example, x can have linguistic values for temper- 
ature, pressure, volume, frequency, velocity, brightness, age, wearing, etc., and also medical, electrical, 
chemical, ecological, etc. variables. 

W By the membership function j/4(x) of a linguistic variable, the membership degree of a fixed (crisp) 
value can be determined in the fuzzy set represented by a(x). Namely, the modeling of a “high” 
quantity, e.g., the temperature, as a linguistic variable given by a trapezoidal membership function 
(Fig. 5.65) means that the given temperature a belongs to the fuzzy set “high temperature” with the 
degree of membership £ (also degree of compatibility or degree of truth). 


5.9.1.2 Membership Functions on the Real Line 


The membership functions can be modeled by functions with values between 0 and 1. They represent 
the different grade of membership for the points of the universe being in the given set. 

1. Trapezoidal Membership Functions 

Trapezoidal membership functions are widespread. Piecewise (continuously differentiable) member- 
ship functions and their special cases, e.g., the triangle shape membership functions described in the 
following examples, are very often used. Connecting fuzzy quantities gives smoother output functions 
if the fuzzy quantities were represented by continuous or piecewise continuous membership functions. 





W A: Trapezoidal function (Fig. 5.65) corresponding to (5.331). The graph of this function 
turns into a triangle function 
μι) if a = a3 = a and αι < 
0 T <, a < a4. Choosing differ- 
1 i ent values for a4, ..., a4 gives 
EHE a < T< a, symmetrical or asymmetri- 
0.5 a2 — ay cal trapezoidal functions, a 
δι... μα(α) --4 1 az < x < ag , (5.331) symmetrical triangle func- 
tion (a2 = a3 = a and |a — 
C eg aı| = |a4 — a|) or asymmet- 
0 a® a, a a,x a4 — a3 : E rical triangle function (az = 
0 x > ag. az = aand |a — a4| Z |a4 — 

Figure 5.65 al). 


WB B: Membership function bounded to the left and to the right (Fig. 5.66) corresponding to (5.332): 











u(x) 1 A 
Ean eee ü3— c 
Qj « $t « 5, 
a2 — ay, 
μα(α) -- { 0 a < £ < ag, (5.332) 
T — a: 
2 a3 < T < Q4, 
0 a a, a; ay a4 — a3 
1 ας. 


Figure 5.66 
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W C: Generalized trapezoidal function (Fig. 5.67) corresponding to (5.333). 








0 £ < a, 
NS τ. Qi < T < Q2, 
ag — ay 
1} b,=b a E 
n ' ' tbs = bye = Ba} iu ag < X X da, 
i ! a3 — a2 
b, | ' μαία) = bg = bg = 1 a3 < T < 4, (5.333) 
Eod od b, — b hae 
" (b= P)4= 2) cras 
0 a, a, a; ay as 5 — 04 
bs(ag — x) - 
Figure 5.67 gc dg. a5 < T < 6, 
0 ας στ. 
2. Bell-Shaped Membership Functions 0 ae 


W A: A class of bell-shaped, differentiable membership 

functions is given by the function f(x) from (5.334) by f(x) = e- Mrz) a « x « b, (5.334) 
choosing an appropriate p(x): 

For p(x) = k(x — a)(b — x) and, e.g., k = 10 or k = 1 or 0 x 2 b. 

k; = 0.1, there is a family of symmetrical curves of different 


width with the membership function jr4(x) = f(x) N C Ἴ where i/ f Cz *) is the normal- 


izing factor (Fig. 5.68). The exterior curve follows with the value k = 10 and the interior one with 
k — 0.1. 

Asymmetrical membership functions in [0,1] follow e.g. for p(w) = «(1 — x)(2 — 2) or for p(x) = 
z(1— x)(x 4-1) (Fig. 5.69), using appropriate normalizing factors. The factor (2— Ὁ) in the first poly- 
nomial results in the shifting of the maximum to the left and it yields an asymmetrical curve shape. 
Similarly, the factor (a + 1) in the second polynomial results in a shifting to the right and in an asym- 
metric form. 


i MAGO i w) 
05 0.5 
0 
a agh bx 06 0.5 1 x 


Figure 5.68 Figure 5.69 
W B: A more flexible class of membership functions can be got by the formula 
a A fH) f (t(u)) du 
(5.335) 


= f(t ) du 
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where f is defined by (5.334) with p(x) = (x — a)(b — x) and t is a transformation on [a,b]. If t is 
a smooth transformation on |a, b], i.e., if t is differentiable infinitely many times in the interval [a, b] , 
then F; is also smooth, since f is smooth. Requiring t to be either increasing or decreasing and to be 
smooth, then the transformation t allows to change the shape of the curve of the membership function. 
In practice, polynomials are especially suitable for transformations. The simplest polynomial is the 
identity t(x) = x on the interval [a,b] = [0, 1]. 


The next simplest polynomial with the given properties is t(x) = — 2er? + cr? + (1 — 5) x with a 


constant c € [—6, 3]. The choice c = —6 results in the polynomial of maximum curvature, its equation 
is q(x) = 4x3 — 6x? + 3x. Choosing for go the identity function, ie., go(x) = x, then can be got 
recursively further polynomials q by the formula g; = qoq_1 fori € IN. Substituting the corresponding 
polynomial transformations qo, q1, ... into (5.335) for t£, gives a sequence of smooth functions F5, F;, 
and F,, (Fig. 5.70), which can be considered as membership functions j14(x), where Fy, converges to 
aline. The trapezoidal membership function can be approximated by differentiable functions using the 


function Fy, , its reflection and a horizontal line (Fig. 5.71). 


Μα(χ) Hal) 
1 1 
0.5 0.5 
0 0 
0 025 05 075 1x 0 025 05 075 1x 


Figure 5.70 Figure 5.71 
Summary: Imprecise and non-crisp information can be described by fuzzy sets and represented by 
membership functions u(x). 


5.9.1.3 Fuzzy Sets 
1. Empty and Universal Fuzzy Sets 
a) Empty fuzzy set: A set A over X is called empty if wa(x) — 0 Vx € X holds. 
b) Universal fuzzy set: A set is called universal if a(x) = 1 Yx € X holds. 
2. Fuzzy Subset 
If ug(v) € a(x) V x € X, then B is called a fuzzy subset of A (one writes: B C A). 
3. Tolerance Interval and Spread of a Fuzzy Set on the Real Line 
If A is a fuzzy set on the real line, then the interval 

[a,0] 2 (x € X|ua(x) — 19. (a, b const, a « b) (5.336) 
is called the tolerance interval of the fuzzy set A, and the interval [c, d] — cl(suppA) (c. d const,c « d) 
is called the spread of A, where cl denotes the closure of the set. (The tolerance interval is sometimes 
also called the peak of set A.) The tolerance interval and the kernel coincide only if the kernel contains 
more then one point. 
W A: In Fig. 5.65 [a, as] is the tolerance interval, and [a, a4] is the spread. 
E B: a = az — a (Fig. 5.65), gives a triangle-shaped membership function p. In that case the 
triangular fuzzy set has no tolerance, but its kernel is the set {a}. If additionally ay = a = ag holds, 
too, then a crisp value follows; it is called a singleton. A singleton A has no tolerance, but ker(A) = 
supp(A) = {a}. 
4. Conversion of Fuzzy Sets on a Continuous and Discrete Universe 
Let the universe be continuous, and let a fuzzy set be given on it by its membership function. Discretiz- 
ing the universe, every discrete point together with its membership value determines a fuzzy singleton. 
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Conversely, a fuzzy set given on a discrete universe can be converted into a fuzzy set on the continuous 
universe by interpolating the membership value between the discrete points of the universe. 
5. Normal and Subnormal Fuzzy Sets 
If A is a fuzzy subset of X, then its height is defined by 

H(A) := max{pa(x)|a € X}. (5.337) 
Ais called a normal fuzzy set if H(A) = 1, otherwise it is subnormal. 
The notions and methods represented in this paragraph are limited to normal fuzzy sets, but it easy to 
extend them also to subnormal fuzzy sets. 


6. Cut of a Fuzzy Set 


"The a cut A?? or the strong o. cut A?* of a fuzzy set A are the subsets of X defined by 


A?^* — (x € X|ua(x) 7 aj, A?^ — (x € X|ua(z) 2 al a € (0,1]. (5.338) 
and A79 — cl(A?9). The a cut and strong o cut are also called a-level set and strong a-level set, 
respectively. 


1. Properties 

a) The a cuts of fuzzy sets are crisp sets. 

b) The support supp(A) is a special a cut: supp(A) = A??. 

c) The crisp 1 cut A?! — {x € X|14(x) = 1} is called the kernel of A. 

2. Representation Theorem 

To every fuzzy subset A of X can be assigned uniquely the families of its a cuts (A??),et,) and its 


strong a cuts (429) The a cuts and strong a cuts are monotone families of subsets from X, 


a€(0,1]* 
since: 

a « B — A?* 2 A7? and. A?? D A?£, (5.3392) 
Conversely, if there exist the monotone families (Ua) acjo,1) OT (Va)ac(a,1] Of Subsets from X, then there 
are uniquely defined fuzzy sets U and V such that U>* = Uy and V2% = Va and moreover 

uulx) = sup{a € [0,1))|a € Ug}, Lv (x) = supf{a € (0,1]|x € V4). (5.339b) 
7. Similarity of the Fuzzy Sets A and B 
1. The fuzzy sets A, B with membership functions u4, 4p: X — [0,1] are called fuzzy similar if for 
every a € (0, 1] there exist numbers o; with a; € (0, 1]; (; — 1, 2) such that: 

supp(oi44)a C supp(up)a, ^ supp(aoun)a € supp(uA)a. (5.340) 


TOR ; -- : _ f olz) if polz) > a 
(ic) represents a fuzzy set with the membership function (uc)s = { 0 thera and (Buo) 
B i uc(z) > B 


represents a fuzzy set with the membership function (buo) = { Ü otherwise 


2. Theorem: Two fuzzy sets A, B with membership functions 44, 4p: X — [0,1] are fuzzy-similar if 
they have the same kernel: 


σαργίμα)ι -- «αρρ(μα)ι, (5.341a) 
since the kernel is equal to the 1 cut, i.e. 
supp(ua) — (x € X|nua(x) = 1}. (5.341b) 


3. A, B with ua, ug: X —> [0,1] are called strongly fuzzy-similar if they have the same support and 
the same kernel: 





418 5. Algebra and Discrete Mathematics 





supp(ua): — δαρρίµβ)ι, (5.342a) supp(/14)o = supp(ug)o. (5.342b) 


6.9.2 Connections (Aggregations) of Fuzzy Sets 
Fuzzy sets can be aggregated by operators. There are several different suggestions of how to generalize 
the usual set operations, such as union, intersection, and complement. of fuzzy sets. 


5.9.2.1 Concepts for Aggregations of Fuzzy Sets 


1. Fuzzy Set Union, Fuzzy Set Intersection 

The grade of membership of an arbitrary element « € X in the sets AU B and AN B should depend 
only on the grades of membership j4(x) and jug(x) of the element in the two fuzzy sets A and B. The 
union and intersection of fuzzy sets is defined with the help of two functions 


s,t: [0,1] x [0, 1] — [0, 1], (5.343) 
and they are defined in the following way: 
Iaup(c) :— s (pa(x), up(x)) , (5.344) Hans(«) :— t (pa(x), up(x)). (5.345) 


The grades of membership j1.4() and jig(x) are mapped in a new grade of membership. The functions 
t and s are called the ¢ norm and t conorm; this last one is also called the s norm. 
Interpretation: The functions jJjayg and Jang represent the truth values of membership, which is 
resulted by the aggregation of the truth values of memberships ja(r) and jip(x). 
2. Definition of the tNorm: 
The t norm is a binary operation t in [0, 1]: 

t: [0, 1] x [0,1] — [0,1]. (5.346 
It is symmetric, associative, monotone increasing, it has 0 as the zero element and 1 as the neutral 
element. For z, y, z, v, w € [0,1] the following properties are valid: 


(E1) Commutativity: t(r,y) — t(y, x). (5.342. 
(E2) Associativity: t(r,t(y. z)) — t(t(x, y), z). (5.347b 
(E3) Special Operations with Neutral and Zero Elements: 

t(x,1) = xand because of (E1): t(1,r)— x; t(v,0) — t(0, x) = 0. (5.347¢ 
(E4) Monotony: Ifx<vandy<w, then t(x,y) € t(v, w) is valid. (5.347d 


3. Definition of the sNorm: 


The s norm is a binary function in [0, 1]: 


s: [0,1] x [0,1] > (0, 1]. (5.348 
It has the following properties: 
(E1) Commutativity: s(v,y)-— s(y,). (5.349a 
(E2) Associativity: s(z, s(y, z)) — s(s(v, y), z). (5.349b 
(E3) Special Operations with Zero and Neutral Elements: 

s(r,0) 2 s(0,x) — v; s(x, 1) — s(1, v) — 1. (5.349c 
(E4) Monotony: lIfz X vand y € w, then s(r,y) < s(v,w) is valid. (5.349d 


With the help of these properties a class T' of t norms and a class S of s norms can be introduced. 
Detailed investigations proved that the following relations hold: 
min(z,y) 2 t(z, y) Vt € T, Vr,y € [0,1] and (5.349e 


max(z,y) € s(z,y) Vs € S, Vz,y € [0,1]. (5.349f 
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5.9.2.2 Practical Aggregation Operations of Fuzzy Sets 
1. Intersection of Two Fuzzy Sets 


The intersection AN B of two fuzzy sets A and B is defined by the minimum operation min(.,.) on 
their membership functions j14(2) and g(x). Based on the previous requirements there is: 


C :— An B and uic(x) :- min(ua(x),upg(x) | Vx € X, where: (5.3502) 
p _ fa, ifa<b, 
min(a, b) :— ts ifa» b. (5.350b) 


The intersection operation corresponds to the AND operation of two membership functions (Fig.5.72). 
The membership function uo(x) is defined as the minimum value of jj4(x) and μμ(α). 

2. Union of Two Fuzzy Sets 

The union AU B of two fuzzy sets is defined by the maximum operation max(., .) on their membership 
functions a(x) and μμ(α): 


C :— AU B and jic(x) :— max(ua(r),ug(v) | Vv € X, where: (5.351a) 
ifa >b, 
max(a, b) := { e if ee (5.351b) 


The union corresponds to the logical OR operation. Fig.5.73 illustrates j;jc(x) as the maximum value 
of the membership functions j1.4(a) and jup(«). 

WB Thetnorm t(z, y) — min(z, y] and the s norm s(z, y) — max(z, y) define the intersection and the 
union of two fuzzy sets, respectively (see (Fig.5.74) and (Fig.5.75)). 


u(x) uO) 
1 Hao) 1 μείχ) 
0 x 0 X 
Figure 5.72 Figure 5.73 





Figure 5.74 Figure 5.75 


3. Further Aggregations 
Further aggregations are the bounded, the algebraic, and the drastic sum and also the bounded difference, 
the algebraic and the drastic product (see Table 5.8). 
The algebraic sum, e.g., is defined by 

C :— A + B and uc(x) :— pa(x) + pB(£)— a(x): p(x) for every x E€ X. (5.352a) 
Similarly to the union (5.351a,b), this sum also belongs to the class of s norms. They are included in 
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Table 5.8 ¢ and s norms, p E€ R 



























































Author t norm s norm 
Zadeh intersection: t(x, y) — min(z, y) union: s(x, y) = max{x, y} 
Lukasiewicz | bounded difference bounded sum 
tolx, y) = max{0,x + y— 1} s(x, y) = min(1,x +y} 
algebraic product algebraic sum 
talz, y) = xy Salz, yY) = £ +y -— Ty 
drastic product drastic sum 
min(z, y), whether x — 1 maxíz, y), whether z = 0 
tay(v,y) — ory=1 Sas(v, y) — ory =0 
0 otherwise 1 otherwise 
Hamacher 
ry t+y-= gy- (l= p)ty 
ρ3 tals, y) Ξ---------------- Sy(z,y) = —— =m 
rey η pc Q-p)zty- zy) ue 1- (1— py 
P i vy 29 
Finste ἱε(α,ῃ) ------------- Sm, y) = 
instein (x,y) T+ a=») Se(x, y) Ey 
Frank tg(m,y) — sr(m,y) — 1— 
nt — 1)(p¥ — 1 pl-* — 1)(pl-9 — 1 
es Quel) do h DP | lo, h VG Xp | 
p-l p-1 
Yager tya(z, y) = 1- Sya(X, y) = min (1. (a? + y?) 
(p » 0) min (1. ((1— x) --(1— yy") 
Schweizer t,(r,y) — max(0, x^? 4- y^? — 1) VP ss(r,y) — 1— 
(p » 0) max (0, (1 — x)? + (1 — y)? — 1) V» 
Dombi tay (, y) = Sgo(v, y) — 1— 
[p i Dp 
1=2\? 1-y\? ^ g A? y 
; DE —— 1 - 
Ὁ | ( a y+ ( y ) T (τι) l-y 
Weber tur, y) = max(0, (1 + p) Sw(x,y) = min(1, £ + y+ pry) 
(pz -1) (zy - 1)- pzy) 
Dubois tau (t, y) — 2y. Saul£, y) = 


max(z, y, p) 
z-y-—cy-— min(z,y, (1— p)) 
max((1— x), (1 — y). p) 
Remark: For the values of the t and s norms listed in the table, the following ordering is valid: 
tap Sth Ste Sta & ty € t X S € Sp € Sa € Se X Sy < Sds- 


(0€ p € 1) 


























the right-hand column of Table 5.8. In Table 5.9 is given a comparision of operations in Boolean logic 
and fuzzy logic. 
Analogously to the notion of the extended sum as a union operation, the intersection can also be ex- 
tended for example by the bounded, the algebraic, and the drastic product. So, e.g., the algebraic 
product is defined in the following way: 

C :— A- B and pe(x) := a(x) - a(x) for every x € X. (5.352b) 
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It also belongs to the class of t norms, similarly to the intersection (5.350a,b), and it can be found in 
the middle column of Table 5.8. 


5.9.2.3. Compensatory Operators 
Sometimes operators are necessary lying between the t and the s norms; they are called compensatory 
operators. Examples for compensatory operators are the lambda and the gamma operator. 
1. Lambda Operator 

pasle) = λ]μλ(α)με(α)] -- (1— A) [μαία) -- μπα) - μα(α)μαίω)] with λε [0,1]. (5.859) 
Case à = 0: Equation (5.353) results in a form known as the algebraic sum (Table 5.8, s norms); it 
belongs to the OR operators. 
Case \ = 1: Equation (5.353) results in the form known as the algebraic product (Table 5.8, t norms); 
it belongs to the AND operators. 
2. Gamma Operator 

Haya () = (wa()ua(@))7 [1 — (1—pa(e)) (1—pa(2))]” with y € [0,1]. (5.354) 
Case y= 1: Equation (5.354) results in the representation of the algebraic sum. 
Case y= 0: Equation (5.354) results in the representation of the algebraic product. 
The application of the gamma operator on fuzzy sets of any numbers is given by 


n 1-7 n Ύ 
(a) = |f μα) h -ᾖα- μίο)) | (5.355) 
i=1 i=1 
and with weights 5;: 
n 1-y n y n 
μία) = (i cy h -Πι- iso withae X, 4-1, vye[01] (5.356) 
il il i=l 


5.9.2.4 Extension Principle 

In the previous paragraph there are discussed the possibilities of generalizing the basic set operations 
for fuzzy sets. Now, the notion of mapping is extended on fuzzy domains. The basis of the concept is the 
acceptance grade of vague statements. The classical mapping 6: X" — Y assigns a crisp function value 
Φίσι,..., τμ) € Y tothe point (z1,...,7,) € X". This mapping can be extended for fuzzy variables as 
follows: The fuzzy mapping is Ê: F(X)^ — F(Y), which assigns a fuzzy function value d(u,, . . . , Hn) 
to the fuzzy vector variables (x1, .. . , &n) given by the membership functions (j1,..., 44) € F(X)". 


5.9.2.5 Fuzzy Complement 


A function c: [0,1] — [0, 1] is called a complement function if the following properties are fulfilled for 
V x, y € [0,1]: 


(EK1) Boundary Conditions: c(0) — 1 and c(1) = 0. (5.3572) 
(EK2) Monotony: πα ο» (5.357b) 
(EK3) Involutivity: c(c(r)) — x. (5.357c) 
(EK4) Continuity: c(x) should be continuous for every x € [0, 1]. (5.357d) 


W A: The most often used complement function is (continuous and involutive): 

c(z) :21— c. (5.358) 
ΒΒ: Other continuous and involutive complements are the Sugeno complement cy(x) :— (1— x)(1 + 
Ax)~1 with \ € (—1, 00) and the Yager complement cp(x) :— (1 — a?)"? with p € (0, oo). 
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Table 5.9 Comparison of operations in Boolean logic and in fuzzy logic 








Operator | Boolean logic | Fuzzy logic (ua, up € [0, 1]) 

AND C=AAB lang = min(ua, HB) 

OR C=AVB Laug = πιαχίµαΑ, μη) 

ΝΟΤ C=-7A uS — 1— pA (uS as complement of u4) 














5.9.3 Fuzzy-Valued Relations 
5.9.3.1 Fuzzy Relations 
1. Modeling Fuzzy-Valued Relations 


Uncertain or fuzzy-valued relations, as e.g. “approximately equal”, “practically larger than", or ^prac- 

y , 8 PP A 3 8 ; p: 
tically smaller than" , etc., have an important role in practical applications. A relation between numbers 
is interpreted as a subsets of R?. So, the equality “=” is defined as the set 


ΠΩΣ (5.359) 


i.e., by a straight line y = x in R°. 

Modeling the relation “approximately equal” denoted by Hj, can be used a fuzzy subset on R?, the 

kernel of which is A. Furthermore it is to require that the membership function should decrease and 

tend to zero getting far from the line A. A linear decreasing membership function can be modeled by 
Lg, (x,y) —- max(0,1— a|r — y]) with a€ R, a» 0. (5.360) 

For modeling the relation Rz “practically larger than”, it is useful to start with the crisp relation “>”. 

The corresponding set of values is given by 


{αι ν) c R?|z « y) i (5.361) 
It describes the crisp domain above the line x = y. 
The modifier “practically” means that a thin zone under the half-space in (5.361) is still acceptable 
with some grade. So, the model of Rg is 


max{0, 1 — aļx — y|} fory< x 


Lr, (X,Y) = { 1 br z} with a€ R, a>0. (5.362) 


If the value of one of the variables is fixed, e.g., y = yo, then Rə can be interpreted as a region with 

uncertain boundaries for the other variable. 

Handling the uncertain boundaries by fuzzy relations has practical importance in fuzzy optimization, 

qualitative data analysis and pattern classification. 

The foregoing discussion shows that the concept of fuzzy relations, i.e., fuzzy relations between several 

objects, can be described by fuzzy sets. In the following section the basic properties of binary relations 

are discussed over a universe which consists of ordered pairs. 

2. Cartesian Product 

Let X and Y be two universes. Their “cross product” X x Y, or Cartesian product, is a universe G: 
G=XxY={(z,y)|ce XAYyEY}. (5.363) 

Then, a fuzzy set on G is a fuzzy relation, analogously to classical set theory, if it consists of the valued 

pair of universes X and Y. A fuzzy relation R in G is a fuzzy subset R € F(G), where F(G) denotes the 

set of all the fuzzy sets over X x Y. R can be given by a membership function pir(x, y) which assigns 

a membership degree jip(a, y) from [0,1] to every element of (a, y) € G. 

3. Properties of Fuzzy-Valued Relations 

(E1) Since the fuzzy relations are special fuzzy sets, all propositions stated for fuzzy sets will also be 

valid for fuzzy relations. 

(E2) All aggregations defined for fuzzy sets can be defined also for fuzzy relations; they yield a fuzzy 
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relation again. 
(E3) The notion of a cut defined above can be transmitted without difficulties to fuzzy relations. 
(E4) The 0 cut (the closure of the support) of a fuzzy relation R € F(G) is a usual relation on G. 
(E5) Denoting the membership value by ug(z, y), i.e., the degree by which the relation R between the 
pair (x,y) holds. The value ug(x, y) — 1 means that Ft holds perfectly for the pair (x, y), and the value 
In(v, y) — 0 means that R does not at all hold for the pair (a, y). 
(E6) Let R € F(G) be a fuzzy relation. Then the fuzzy relation S :— R1, the inverse of R, is defined 
by 

Ls(x.y) — un(y. x) for every (x,y) € G. (5.364) 
Bl The inverse relation R7 ! means "practically smaller than” (see 5.9.3.1, 1., p. 422); theunion Ut; 1 
can be determined as “practically smaller or approximately equal". 
4. n-Fold Cartesian Product 
Let n be the number of universal sets. Their cross product is an n-fold Cartesian product. A fuzzy set 
on an n-fold Cartesian product represents an n-fold fuzzy relation. 
Consequences: The fuzzy sets, considered until now, are unary fuzzy relations, i.e., in the sense of the 
analysis they are curves above a universal set. A binary fuzzy relation can be considered as a surface 
over the universal set G. A binary fuzzy relation on a finite discrete support can be represented by a 
fuzzy relation matrix. 
W Colour-ripe grade relation: The well-known correspondence between the colour x and the ripe grade 
y of a friut is modeled in the form of a binary relation matrix with elements {0, 1}. The possible colours 
are X = {green, yellow, red} and the ripe grades are Y = {unripe, half-ripe, ripe}. The relation 
matrix (5.365) belongs to the table: 


unripe 






half-ripe ripe 





green | 1 5 0 : E 
yellow | 0 1 0 mme (5.365) 
red 0 0 1 


Interpretation of this relation matrix: IF a fruit is green, THEN it is unripe. IF a fruit is yellow, 
'THEN it is half-ripe. IF a fruit is red, THEN it is ripe. Green is uniquely assigned to unripe, yellow to 
half-ripe and red to ripe. If beyond it should be formalized that a green fruit can be considered half-ripe 
in a certain percentage, then the following table with discrete membership values can be arranged: 


Hr (green, unripe) --1.0, pp (green, half-ripe) = 0.5, The relation matrix with ug € [0, 1] 
Lr (green, ripe) = 0.0, jor (yellow, unripe) =0.25, is: 

Lr (yellow, halfripe) = 1.0, jue (yellow, ripe) = 0.25, 1.0 0.5 0.0 

UR (red, unripe) =0.0, pR (red, half-ripe) = 0.5, R= | 0.25 1.0 0.25]. (5.366) 
Lr (red, ripe) e. 0.0 0.5 1.0 


5. Rules of Calculations 
The AND-type aggregation of fuzzy sets, e.g. pı: X — [0,1] and u2: Y — [0,1] given on different 
universes is formulated by the min operation as follows: 
Ig(x,y) — min(pa (x), uo(y)) or (pi x pa)(x, y) = min(ja(x), uo(y)) with (5.3672) 
Ia X u2: G > [0,1], where G = X xY. (5.367b) 
The result of this aggregation is a fuzzy relation R on the cross product set (Cartesian product universe 


of fuzzy sets) G with (x, y) € G. If X and Y are discrete finite sets and so 4 (x), ui (y) can be represented 
as vectors, then holds: 


Ia X Jio — pu 0 ul and  uga(r,y):— ug(y,v) V(a,y) eG. (5.368) 
The aggregation operator o does not denote here the usual matrix product. The product is calculated 
here by the componentwise min operation and addition by the componentwise max operation. 
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The validity grade of an inverse relation R~! for the pair (x,y) is always equal to the validity grade of 
R for the pair (y, x). 

If the fuzzy relations are given on the same Cartesian product universe, then the rules of their aggre- 
gations can be given as follows: Let R1, Ho: X x Y — [0,1] be binary fuzzy relations. The evaluation 
rule of their AND-type aggregation uses the min operator, namely for V(x, y) € G: 


Inn; y) 7 min(um, (v. y), pm; (v. Ἡ)). (5.369) 
A corresponding evaluation rule for the OR-type aggregation is given by the max operation: 
μπιυπι(α, ἡ) ^ max(un, (2, y). uns (v; v)). (5.370) 


5.9.3.2. Fuzzy Product Relation R o S 


1. Composition or Product Relation 
Suppose R € F(X x Y) and S € F(Y x Z) are two relations, and it is additionally assumed that 
R,S € F(G) with G C X x Z. Then the composition or the fuzzy product relation Ro S is: 

Ros (x, 2) :— supyey (min(un(a, v), us(y, z))) V(v,2) € X x Z. (5.371) 
If a matrix representation is used for a finite universal set analogously to (5.366), then the composition 
Ro S is motivated as follows: Let X — (zi,...,za4], Y — (yi... Z — {a,...,a} and R € 
F(X xY), S € F(Y x Z) and let the matrix representations R, S be in the form R = (rij) and 
S = (sje) fori =1,...,n;7 =1,...,m;k =1,...,l, where 


Tij = Hr(Ti Yj) and δι Ξ nus(yj, zx). (5.372) 
If the composition T = Ro S has the matrix representation t;,, then 
ty — sup min(rij, 55,). (5.373) 
j 


The final result is not a usual matrix product, since instead of the summation operation there is the 
least upper bound (supremum) operation and instead of the product there is the minimum operator. 
B With the representations for rj; and sj, and with (5.371), the inverse relation R^! (rj), can also 
be computed taking into consideration that R-! can be represented by the transpose matrix, i.e., 
RA = (rij). 

Interpretation: Let R be a relation from X to Y and S bea relation from Y to Z. Then the following 
compositions are possible: 

a) Ifthe composition Ro S of Rand S is defined as a max-min product, then the resulted fuzzy compo- 
sition is called a max-min composition. The symbol sup stands for supremum and denotes the largest 
value, if no maximum exists. 

b) If the product composition is defined as the usual matrix multiplication, then the max-prod com- 
position is obtained. 

c) For max-average composition, “multiplication” is replaced by the average. 

2. Rules of Composition 

The following rules are valid for the composition of fuzzy relations R,S,T € F(G): 

(E1) Associative Law: 


(RoS)oT — Ro(SoT). (5.374) 
(E2) Distributive Law for Composition with Respect to the Union: 

Ro(SUT) —(RoS)U(RoT). (5.375) 
(E3) Distributive Law in a Weaker Form for Composition with Respect to Intersection: 

Ro(SnT)C(RoS)n(RoT). (5.376) 


(E4) Inverse Operations: 
(RoS)!—SoR!, (RUS)! —R "US" and (RASJ!=R NS. (5.377) 
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(E5) Complement and Inverse: 


- - σ 
(απ, (19) 1)”. (5.378) 
(E6) Monotonic Properties: 
RCS-—RoTCSoT undToRCToS. (5.379) 


W A: Equation (5.371) for the product relation £o S is defined by the min operation as we have done 
for intersection formation. In general, any t norm can be used instead of the min operation. 

W B: Theo cuts with respect to the union, intersection, and complement are: (AUB)?® = A?*UB?*, 
(AN B)?* = A?* 7M B®, (A49)?» « ASi-* — (x € X|ua(z) X 1— aj). Corresponding statements 
are valid for strong o cuts. 

3. Fuzzy Logical Inferences 

It is possible to make a fuzzy inference, e.g., with the IF THEN rule by the composition rule jjj — j44 o R. 
The detailed formulation for the conclusion jz is given by 


is) — maxsex (min(qu(r), pa(r. 9) (5.380) 
with y € Y, m: X > [0,1], #2: Y > [0,1], R: G > [0, 1] und G = X x Y. 


5.9.4 Fuzzy Inference (Approximate Reasoning) 
Fuzzy inferenceis an application of fuzzy relations with the goal of getting fuzzy logical conclusions with 
respect to vague information (see 5.9.6.3, p. 428). Vague information means here fuzzy information 
but not uncertain information. Fuzzy inference, also called implication, contains one or more rules, a 
fact and a consequence. Fuzzy inference, which is called by Zadeh, approximate reasoning, cannot be 
described by classical logic. 
1. Fuzzy Implication, IF THEN Rule 
The fuzzy implication contains one IF THEN rule in the simplest case. The IF part is called the premise 
and it represents the condition. The THEN part is the conclusion. Evaluation happens by jig = p0 R 
and (5.380). 
Interpretation: jg is the fuzzy inference image of jj; under the fuzzy relation R, i.e., a calculation 
prescription for the IF THEN rule or for a group of rules. 
2. Generalized Fuzzy Inference Scheme 
The rule IF A; AND A45 AND A3... AND A, THEN B with A;: μι: X; — [0,1] (i — 1,2,..., n) and 
the membership function of the conclusion B: p: Y — [0,1] is described by an (n 4- 1)-valued relation 
R: XQ x Xo x X4 x Y > [0,1]. (5.381a) 
For the actual input with crisp values 21,25, ..., 77, the rule (5.381a) defines the actual fuzzy output 
by 
ng) — na(z,x5,...,a,, y) — min(ji(,), uo(25), ..., ps (5), up(y)) where y € Y. (5.381b) 
Remark: The quantity min(ga (11), 12 (25), ... 15 (v7,)) is called the degree of fulfillment, and the 
quantities (43 (21); ua(25), ... , ps (27,)) represent the fuzzy-valued input quantities. 


W Forming the fuzzy relations for a connection between the quantities “medium” pressure and “high” 
temperature (Fig. 5.76): (p, T) = ja(p) VT € Xa with ja: X1 — [0,1] is a cylindrical extension 
(Fig. 5.76c) of the fuzzy set medium pressure (Fig. 5.76a). Analogously, fi2(p,T) = p(T) Vp € 
X, with pig: X_ — [0,1] is a cylindrical extension (Fig. 5.76d) of the fuzzy set high temperature 
(Fig. 5.76b), where Γι, μ3: G= X) Xx Χο-» (0, 1]. 

Fig. 5.77a shows the graphic result of the formation of fuzzy relations: In Fig. 5.77 b the result of the 
composition medium pressure AND high temperature with the min operator ug(p, T) — min(ja(p), 
ji2(T)) is represented, and (Fig. 5.77b) shows the result of the composition OR with the max operator 
Hr(p, 1) = max(pus(p), us (1). 
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Figure 5.77 
5.9.5 Defuzzification Methods 
One has to get a crisp set from a fuzz-valued set in many cases. This process is called defuzzification. 
There are different methods available for this task. 
1. Maximum-Criterion Method 


An arbitrary value 7 € Y is selected from the domain where the fuzzy set pone has the maximal 
membership degree. 


2. Mean-of-Maximum Method (MOM) 


The output value is the mean value of the maximal membership values: 


sup (nas d ) = y dy 
d yesup(ug Ps, ) EM 
tu € Υ]μαν (V μας. FU "vy e Yy; (5.382) TIMOM — een) .(5.383) 
i.e., the set Y is an interval, which should not be empty and ο Output) dy 
it is characterized by (5.382), from which follows (5.383). "S 
3. Center of Gravity Method (COG) m n(y)y dy 
In the center of gravity method, one takes the abscissa — Ying ` 5.384 
ane s eps COG ‘Ysup ἃ (5. ) 
value of the center of gravity of a surface with a fictitious f u(y) dy 
homogeneous density of value 1. Yint  " 
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4. Parametrized Center of Gravity Method Ysup 


(PCOG) N μ(υ)ν ἀν 
The parametrized method works with the exponent y € R. mooo = ha ~ sup (5.385) 
From (5.385) it follows for; — 1, coc — rjcoc and for JE noy dy y) Ydy 
770, npcoc = mom- "m 
5. Generalized Center of Gravity Method 
(GCOG) 
The exponent y is considered as a function of y in the Yesup 
PCOG method. Then (5.386) follows obviously. The u(yy 9? y dy 
GCOG method is a generalization of the PCOG method, "Gcoa — XL. (5.386) 
where u(y) can be changed by the special weight y depend- f μι dy 
Vinf 


ing itself on y. 


6. Center of Area (COA) Method 

One calculates a line parallel to the ordinate axis so that a najas [ sup u(y) dy. (5.387) 
the area under the membership function is the same on the Yinf n 

left- and on the right-hand side of it. 


7. Parametrized Center of Area (PCOA) Ydy = [uau dy. (5.388 
Method / u(y)” dy |. u(y)" dy. (5.388) 


8. Method of the Largest Area (LA) 


The significant subset is selected and one of the methods defined above, e.g., the method of center of 
gravity (COG) or center of area (COA) is used for this subset. 


5.9.6 Knowledge-Based Fuzzy Systems 


There are several application possibilities of multi-valued fuzzy logic, based on the unit interval, both 
in technical and non-technical life. The general concept consists in the fuzzification of quantities and 
characteristic numbers, in the aggregation them in an appropriate knowledge base with operators, and 
if necessary, in the defuzzification of the possibly fuzzy result set. 


5.9.6.1 Method of Mamdani 


The following steps are applied for a fuzzy control process: 
1. Rule Base Suppose, for example, for the i-th rule 

Rí: If eis E! ANDéis AE! THEN u is U*. (5.389) 
Here e characterizes the error, é the change of the error and u the change of the (not fuzzy valued) output 
value. Every quantity is defined on its domain E, AFE and U. Let the entire domain be E x AE x U. 
The error and the change of the error will be fuzzified on this domain, i.e., they will be represented by 
fuzzy sets, where linguistic description is used. 
2. Fuzzifying Algorithm In general, the error e and its change é are not fuzzy-valued, so they must 
be fuzzified by a linguistic description. The fuzzy values will be compared with the premisses of the 
IF THEN rule from the rule base. From this it follows, which rules are active and how large are their 
weights. 
3. Aggregation Module The active rules with their different weights will be combined with an 
algebraic operation and applied to the defuzzification. 
4. Decision Module In the defuzzification process a crisp value should be given for the control 
quantity. With a defuzzification operation, a non-fuzzy-valued quantity is determined from the set of 
possible values, i.e., a crisp quantity. This quantity expresses how the control parameters of the system 
should be set up to keep the deviation minimal. 
Fuzzy control means that the steps from 1. to 4. are repeated until the goal, the smallest deviation e 
and its change €, is reached. 


Y Yinf 
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5.9.6.2 Method of Sugeno 


The Sugeno method is also used for planning of a fuzzy control process. It differs from the Mamdani 
concept in the rule base and in the defuzzification method. It has the following steps: 


1. Rule Base: The rule base consists of rules of the following form: 
Fi: IF 2 is Ay AND ... AND zy is Ay, THEN u; = ph + pizi + phae +--+ + pire. (5.390) 
The notations mean: 
Aj: fuzzy sets, which can be determined by membership functions; 
zj: crisp input values as, e.g., the error e and the change of the error é, which tell us something about 
. the dynamics of the system; 
pi: weights of; (j-—1,2,...,k); 
τη: the output value belonging to the i-th rule (i = 1,2,...,n). 
2. Fuzzifying Algorithm: A ji; € [0, 1] is calculated for every rule Rf. 
3. Decision Module: A non-fuzzy-valued quantity is calculated from the weighted mean of u;, where 
the weights are 1; from the fuzzification: 


n n τι 
u= XO piui (x 2 ' (5.391) 
σι ic 


Here u is a crisp value. 

The defuzzification of the Mamdani method does not work here. The problem is to get the weight 
parameters pj available. These parameters can be determined by a mechanical learning method, e.g., 
by an artificial neuronetwork (ANN). 


5.9.6.3 Cognitive Systems 
To clarify the method, the following known example will be investigated with the Mamdami method: 
The regulation of a pendulum that is perpendicular to its moving base (Fig. 5.78). The aim of the 
control process is to keep a pendulum in balance so that the pendulum rod should stand vertical, i.e., 
the angular displacement from the vertical direction and the angular velocity should be zero. It must 
be done by a force F acting at the lower end of the pendulum. This force is the control quantity. The 
model is based on the activity of a human “control expert” (cognitive problem). The expert formulates 
its knowledge in linguistic rules. Linguistic rules consist, in general, of a premise, i.e., a specification of 
the measured values, and a conclusion which gives the appropriate control value. 

For every set of values X1, X2, . . . , Xn for the measured values and Y for the control quantity the appro- 
priate linguistic terms are defined as “approximately zero” , “small positive” , etc. Here “approximately 
zero” with respect to the measured value & can have a different meaning as for the measured value ξο. 
E Inverse Pendulum on a Moving Base (Fig. 5.78) 

1. Modeling For the set X, (values of angle) and analogously for the input Ib 

quantity X» (values of the angular velocity) the seven linguistic terms, negative θ 
large (nl), negative medium (nm), negative small (ns), zero (z), positive small 
(ps), positive medium (pm) and positive large (pl) are chosen. 











F 
For the mathematical modeling, a fuzzy set must be assigned by graphs to every ———924-2——— 
one of these linguistic terms (Fig. 5.77), as was shown for fuzzy inference (sce 


5.9.4, p. 425). Figure 5.78 








2. Choice of the Domain of Values 

e Values of angles: ©(—90° < © < 90°): X, := [-90°, 90°]. 

e Values of angular velocity: O(—45° s-! < Ò < 45° s71): Xz := [—45° s71, 45° s71. 
e Values of force F: (C10N € F € 10N): Y :— [-10N, 10 N]. 
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The partitioning of the input quantities X; and X5 and the output quantity Y is represented graphically 








in Fig. 5.79. Usually, the initial values are actual measured values, e.g., 9 = 36°, Ò = —235?s1. 
μίθ) μίθ) u(F) 
1 1 1 
i > > > 
nl nm 45 pmpl 0 nl D. pl ò nl 5 pl F 
a) b) c) 


Figure 5.79 


3. Choice of Rules Considering the following table, there are 49 possible rules (7 x 7) but there are 
only 19 important in practice, so the following two are to be discussed: R1 and R2. 


R1: If O is positive small (ps) and © zero (z), then F is positive small (ps). For the degree of fulfillment 
(also called the weight of the rules) of the premise with a = min {μῶ (8); ph (6)) = min{0.4;0.8} = 






































0.4 one gets the output set (5.392) by an a cut, hence the output fuzzy set is positive small (ps) in the 
height a = 0.4 (Fig. 5.80c). 
Table: Rule base with 19 practically meaning- 
ful rules 2 ο 

ΘᾺΘ | nl | ππὶ | ns | 2 ps | pm | pl 5/7 0Sy«cl 

nl ps | pl 0.4 1<y<4, 

nm pm a RI 

ns nm ns | ps Loo, 52.28 (y) = 2 (5.392) 

z nl | nm | ns | z ps | pm | pl 2 = 5! 4<y<5, 

ps ns | ps pm . 

pm nm 0 otherwise. 

pl nl |ns 

: A 1 25x 4 
R2: If O is positive medium (pm) and O is 55^ "m s 
zero (z), then F is positive medium (pm). 0.6 4& y € 6, 
For the performance score of the premise fol- 
in [n9 (0); € Mss aas) — (5.393) 

lows a = min (uf (O); uí 6) = min{0.6; E ' 3— 24 6<y <75, 
0.8} = 0.6, the output set (5.393) analogously 5 
to rule R1 (Fig. 5.80f). 0 otherwise. 


4. Decision Logic The evaluation of rule Ry with the min operation results in the fuzzy set in 
Figs. 5.80a—c. The corresponding evaluation for the rule R3 is shown in Figs. 5.80d-f. The control 
quantity is calculated finally by a defuzzification method from the fuzzy proposition set (Fig. 5.80g). 
The result is the fuzzy set (Fig. 5.80g) by using the max operation and taking into account the fuzzy 
sets (Fig. 5.80c) and (Fig. 5.80f). 

a) Evaluation of the fuzzy set obtained in this way, which is aggregated by operators (see max-min 
composition 5.9.3.2, 1., p. 424). The decision logic yields: 


Output , 


Mee, XY [061]5;y- maxen.ag [min (109 (1), τ M gs pi )}) 5 (5.394) 
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b) After taking the maximum (5.395) is ob- 2i fr 0<y<1, 
tained for the function graph of the fuzzy set. 5 
c) For the other 17 rules results a degree of ful- 0.4 fo 1<y<3.5, 


fillment equal to zero for the premise, i.e., it re- 


1 : 2 
sults in fuzzy sets, which are zeros themselves. y—1for 35<y< 4, 


5. Defuzzification The decision logic yields μας, oos (U) Ξ4 5 (5.395) 
no crisp value for the control quantity, but 0.6 for 4<y<6, 

a fuzzy set. That means, by this method, 

one gets a mapping, which assigns a fuzzy set 3 - Ὃν fr 6<y<75 

Output of Y to every tuple (a1, ..., an) € X1x , uc 

Xa X ++- X Xn of the measured values. 0 for otherwise. 


Defuzzification means that there is to determine a control quantity using defuzzification methods. 


The center of gravity method and the maximum criterion method result in the value for control quantity 
F =3.950r F = 5.0. 
μίθ) 


μίθ) με) 










1.0 1.0 


min operation 
— 






04 



























> > > 
22536 9 5,951156 2.5 F 
a) positive small b) C) positive small 
6 μ(β) 
10 μίθ) μίθ) με) 10 
0.6 - : 
min opera 10n t 
Max operation 
I r T , E 
36 45 0 225  Ó 50 F 4 
d) positive medium e) zero f) positive medium g) control quantity 
6. Remarks Figure 5.80 


1. The “knowledge-based” trajectories should lie in the rule base so that the endpoint is in the center 
of the smallest rule deviation. 

2. By defuzzification an iteration process is introduced, which leads finally to the center of the partition 
space, i.e., which results in a zero control quantity. 

3. Every non-linear domain of characteristics can be approximated with arbitrary accuracy by the 
choice of appropriate parameters on a compact domain. 

5.9.6.4 Knowledge-Based Interpolation Systems 


1. Interpolation Mechanism 
Interpolation mechanisms can be built up with the help of fuzzy logic. Fuzzy systems are systems 
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to process fuzzy information. With them it is possible to approximate and interpolate functions. A 
simple fuzzy system, by which this property can be investigated , is the Sugeno controller. It has n 


input variables &,...,&, and defines the value of the output variable y by rules R;,..., E; in the form 
Ry: IF & is AP and --- and & is AO, THEN is y — fi(&,...,6,) (i— 1,2,...,m). (5.396) 


The fuzzy sets A. i3 AP always partition the input sets X;. The conclusions f;(&1,...,£,) of the 
rules are singletons, which can depend on the input variables £1, .. . , £4. 

By asimple choice of the conclusions the expensive defuzzification can be omitted and the output value 
y will be calculated as a weighted sum. To do this, the controller calculates a degree of fulfillment o; 
for every rule R; with a t norm from the membership grades of the single inputs and determines the 
output value 


XR. oft...) 
X αι i 
2. Restriction to the One-Dimensional Case 


For fuzzy systems with only one input x — £1, fuzzy sets represented by triangular functions are often 
used which are cut at the height 0.5. Such fuzzy sets satisfy the following three conditions: 


y= (5.397) 


1. For every rule R; there is an input ;, for which only one rule is fulfilled. For this input z;, the output 
is calculated by f;. By this, the output of the fuzzy system is fixed at N nodes 21,...,7y. Actually, 
the fuzzy system interpolates the nodes x1,...,7y. The requirement that at the node x; only one rule 
R; holds, is sufficient for an exact interpolation, but it is not necessary. For two rules R; and Ra, as 
they will be considered below, this requirement means that a1(z2) = a2(x1) = 0 holds. To fulfill the 
first condition, o4(r3) = as(z1) — 0 must hold. This is a sufficient condition for an exact interpolation 
of the nodes. 


2. There are at most two rules fulfilled between two consecutive nodes. If xı and xz are two such nodes 
with rules R; and Ra, then for inputs z € [r1, x] the output y is 


an(@) fi(@) + o2(2) fa(@) alx) 
1(@) + g(x) [fa(z) — fi(x)] with g := ———À————. 
a (z) 4- ag(z) fala) + (x) [fa() — fi(c)] a(s) + a(x) 
The actual shape of the interpolation curve between x; and x2 is determined by the function g. The 
shape depends only on the satisfaction grades o; and a2, which are the values of the membership func- 
tions jz) and ju4@ at the point x, ie., ay = yao (x) and az = pao) (x) are valid, or in short form 











(5.398) 


αι µια) απἀ ag = ji(x). The shape of the curve depends only on the relation 44/15 of the mem- 
bership functions. 

3. The membership functions are positive, so the output y is a convex combination of the conclusions 
fi- For the given and for the general case hold (5.399) and (5.400), respectively: 


min(fi fo) < y < max(fi, fe), (5-399) MI SY Span, fi (9:400) 


For constant conclusions, the terms f; and f» cause only a translation and stretching of the shape of 
the curve g. If the conclusions are dependent on the input variables, then the shape of the curve is 
differently perturbed in different sections. Consequently, another output function can be found. 


Applying linearly dependent conclusions and membership functions with constant sum for the input 
N 

αν then the output is y 2 c 3; o;(z) f; (x) with a; depending on x and a constant c, so that the interpo- 
i 

lation functions are polynomials of second degree. These polynomials can be used for the construction 

of an interpolation method with polynomials of second degree. 

In general, choosing polynomials of n-th degree, an interpolation polynomial of (n + 1)-th degree is 


obtained as a conclusion. In this sense fuzzy systems are rule-based interpolation systems besides con- 
ventional interpolation methods interpolating locally by polynomials, e.g., with splines. 





6 Differentiation 


6.1 Differentiation of Functions of One Variable 
6.1.1 Differential Quotient 


1. Differential Quotient or Derivative of a Function 
7ο + Ax) — f(to) . i ι... 
Λίσα + Az) — F (z0) if this limit 
0 Δα 
exists and is finite. The derivative function of a function y = f(x) with respect to the variable x is 
d df (x 
another function of x denoted by the symbols y', y, Dy, T f(x), Df (x), or f(x) 
da: dx 
every x is equal to the limit of the quotient of the increment of the function Ay and the corresponding 
increment Ax for Ax — 0, if this limit exists: 


The differential quotient of a function y = f(x) at x is equal to Am 
d» 





, and its value for 


f(a+ Ax) — f(x) 


AS (6.1) 


αλ a 

Te) m, 
2. Geometric Representation of the Derivative 
If y = f(x) is represented as a curve in a Cartesian coordinate 
system as in Fig. 6.1, and if the x-axis and the y-axis have the 
same unit, then 

f'(x) = tana (6.2) 
is valid. The angle a between the z-axis and the tangent line of 
the curve at the considered point defines the angular coefficient or 
slope of the tangent (see 3.6.1.2, 2., p. 245). The angle is measured 
from the positive x-axis to the tangent in a counterclockwise di- 
Figure 6.1 rection, and it is called the angle of slope or angle of inclination. 


3. Differentiability 

From the definition of the derivative it obviously follows that f(a) is differentiable with respect to x 
for the values of x where the differential quotient (6.1) has a finite value. The domain of the derivative 
function is a subset (proper or trivial) of the domain of the original function. If the function is contin- 
uous at x but the derivative does not exist, then perhaps there is no determined tangent line at that 
point, or the tangent line is perpendicular to the z-axis. In this last case the limit in (6.1) is infinity. 
For this case is used the notation f'(x) = +00 or —oo. 


y 











a) 


Figure 6.2 
'(0) 2 oo. At the point 0 the limit (6.1) tends to infinity, so the 


MA: f(x) = Ju: f'(z) — aa f 


derivative does not exist at the point 0 (Fig. 6.2a). 


1 

E B: f(x) = vsin— for xz z 0. At the point z — 0 the function f(x) is not defined, but it has zero 
T 

limit, so one writes f(0) = 0. However the limit (6.1) does not exist at x — 0 (Fig. 6.2b). 
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4. Left-Hand and Right-Hand Differentiability 

If the limit (6.1) does not exist for a value x — a, but the left-hand 
limit or the right-hand limit exists, this limit is called the left-hand 
derivative or right-hand derivative respectively. If both exist, the 
curve has two tangents here: 


f'(a — 0) — tana, f'(a 4- 0) 2 tan as. (6.3) 
Geometrically this means that the curve has a knee (Fig. 6.2c, 
Fig. 6.3). 





5 

B f(x) = oe for x z 0. For x = 0 the function is not defined, 
+ ee 

Figure 6.3 but it has zero limit at a — 0, so one writes f(x) = 0. At the point 


x = 0 there is no limit of type (6.1), but there is a left-hand and a right-hand limit f'(—0) — 1 and 
f'(+0) = 0, i.e., the curve has a knee here (Fig. 6.2c). 


6.1.2 Rules of Differentiation for Functions of One Variable 
6.1.2.1 Derivatives of the Elementary Functions 


The elementary functions have a derivative on all their domains except perhaps some points, as repre- 
sented in Fig. 6.2. 


A summary of the derivatives of elementary functions can be found in Table 6.1. Further derivatives 
of elementary functions can be found by reversing the results of the indefinite integrals in Table 8.1. 


Remark: In practice, it is often useful to transform the function into a more convenient form to per- 
form differentiation, e.g., to transform it into a sum where parentheses are removed (see 1.1.6.1, p. 11) 
or to separate the integral rational part of the expression (see 1.1.7, p. 14) or to take the logarithm of 
the expression (see 1.1.4.3, p. 9). 











1 2 3 5 
2— ᾿{4ϑπ!α᾽ 2 -5 x i ο — 
BA: y= BVa FAVE tE = 3x2 2442 342; 2y jisa ας 3+1 
T T da 2 3 
dy 1 2x 1 2x 2r 
EB: y=! ij = Ξί ) Ξί ) - 
gs ) dr  2N2241 2λα2-1 α--1 





6.1.2.2 Basic Rules of Differentiation 
Assume u, v, w, and y are functions of the independent variable x, and w', v/, w', and y' are the deriva- 
tives with respect to x. The differential is denoted by du, dv, dw, and dy (see 6.2.1.3, p. 446). The basic 
rules of differentiation, which are explained separately, are summarized in Table 6.2, p. 439. 
1. Derivative of a Constant Function 
The derivative of a constant function c is the zero function: 

c =0. (6.4) 
2. Derivative of a Scalar Multiple 
A constant factor c can be factored out from the differential sign: 

(cu) =cu', d(cu)= cdu. (6.5) 
3. Derivative of a Sum 
If the functions u, v, w, etc. are differentiable one by one, their sum and difference is also differentiable, 
and equal to the sum or difference of the derivatives: 

(u+v- w) =u +v- w, (6.6a) 


d(u + v — w) = du + dv — dw. (6.6b) 
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It is possible that the summands are not differentiable separately, but their sum or difference is. Then 
the derivative must be calculated by definition formula (6.1). 


Tab 











e 6.1 Derivatives of elementary functions in the intervals on which they are defined and the 
occurring numerators are not equal to zero 
Function Derivative Function Derivative 
C (constant) 0 sec © 2 
cos? x 
— cosa 
T 1 cosec x € 
sin g 
2 (nc R nant arcsin g (|x| <1 =Á 
1 1 1 
z (x #0) -3 (x #0) arccosz (|v| « 1) δρα 
1 1 
m (x #0) 2 arctan x Irz 
1 1 
vc (x0) σα arccot £ IFA 
1 1 
Vr (n€ R, n zZ 0, x » 0) arcsecz (r » 1) 
n zn-l uv x? T 1 
e? εὖ arccosec z (r1 = 
( ) ryz? — 1 
e (be R) ῥοῦν sinh x cosh x 
a* (a 5 0) a” lna cosh x sinh x 
1 
a* (L€ R, a» 0) ba Ina tanh x 3 
cosh“ x 
1 1 
Inz (x 0) = cothx (x #0) em 
T sinh^r 
1 1 ] 1 
log,r (a20,aZ1, x»0)|-—log,e— Arsinh x 
x slna 142? 
1 0.4343 1 
lgx (x > 0) —lgex ——_ Arcoshx (a > 1) 
x x x2 —1 
1 
sina cos x Artanha (|v| « 1) 
1-23? 
1 
ST — sinx Arcothz (|v| » 1) | — 
cos x sin x rcothz (|v| » 1) PIC 
1 
tanz (x Æ (2k+1)5, k € 2) | —— =sec?x [f(x)|” (n€ R) n[f (x)]^71 f'(x) 
cos? x 
"P miles erac * > Pv) 
cotz (rz km, ke) —,— — —cosec^z | In f(x) (f(x) > 0) 
sin^r f(x) 









































4. Derivative of a Product 
If two, three, or n functions are differentiable one by one, then their product is differentiable, and can 
be calculated as follows: 


a) Derivative of the Product of Two Functions: 


(uv) =u v+ uv, 


d(uv) = vdu + u dv. 


(6.7a) 
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It is possible that the terms are not differentiable separately, but their product is. Then the derivative 
must be calculated by definition formula (6.1). 


b) Derivative of the Product of Three Functions: 
(uvw)' 2 u'vw-- uv w-cuvw, d(uvw)-vwdu- uwdv- uvdw. (6.7b) 


c) Derivative of the Product of n Functions: 


n 
(Uy Ug +++ Un)! = Σ Uy Ug + UL es Un. (6.7c) 
i=l 
WB A: y—2?cosz, y! = 32? cosa— x sina. 


E B: y= r’e cosy, y! = 347e* cosa + 2e* cosa — xe sina. 


5. Derivative of a Quotient 
If both u and v are differentiable, and v(x) Æ 0, their ratio is also differentiable: 


uN vu —uwv' u v du — u dv 
(E) i. a) roc (6.8) 


























υ υ υ υ 
sina cosr)(sinz) — (sinz)(cosr) cos? x + sin? x 1 
E= tans a" (cos x)(sin x)’ — (sin. x)(cos x) + 
: cosg’ " cos? a cos? a cos? a 


6. Chain Rule 
The composite function (see 2.1.5.5, 2., p. 61) y — u(v(x)) has the derivative 
dy i j du dv 
— —u(v)v(z)- —— 
απ ΠΠ dv dx’ 
where the functions u = u(v) and v = v(x) must be differentiable functions with respect to their own 
variables. u(v) is called the exterior function, and v(x) is called the interior function. According to 


(6.9) 


du dv 
this, T is the exterior derivative and de is the interior derivative. It is possible that the functions u 
υ g 
and v are not differentiable separately, but the composite function is. Then one gets the derivative by 
the definition formula (6.1). 
Similarly one has to proceed if there is a longer “chain”, i.e., in the case of a composite function of 
several intermediate variables. For example for y = u(v(w(x))): 
dy du dv dw 
ja ae (6.10) 
dx dv dw dx 
2 
Sin g è 
dy d (e ) d (sin? a) d (sin x) 


dr q (sin? 9) d(sinz) dæ 


d ( etan yn 
ως... eee d(anVT)d(/T) tanyz 1 1 
᾽ "dg d(tanyx) (γα) da cos? γη ον 
7. Logarithmic Differentiation 
If y(x) > 0 holds, one can calculate the derivative y' starting with the function In y(x), whose derivative 
(considering the chain rule) is 


d(In y(x)) l3 











iney in? to : 
BA: y= eB m eI © 9 sin x cos x. 


EXE 11 
da vx)" - 
From this rule 
i d(ln y(x 
y - yo) ne (6.12) 


dx 
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follows. 

Remark 1: With the help of logarithmic differentiation it is possible to simplify some differentiation 
problems, and there are functions such that this is the only way to calculate the derivative, for instance, 
when the function has the form 


y= u(z)" (2) with u(x) > 0. (6.13) 
The logarithmic differentiation of this equality follows from the formula (6.12) 
1 (In u" d(vl vu' 
gg hu Ub) Uu) nu) =u'(o'nu+—). (6.14) 
da dx u 
f 3r-2. 





E y = (2x - 1)*, Iny 2 3rln(2z 4 1), -5 3In(2x 4- 1) 4 ; 
y 2r 1 





; 2x 
Po Ἢ 3r 
y' —3(2r - 1) (mz 0 € 1). 


Remark 2: Logarithmic differentiation is often used in the case to differentiate a product of several 
functions. 


1 
WA: y—vz?c"sinr, My= (sin x +4a +Insinz), 
y 17/3 cos & 1 
y - ( +4+ --) , y= 





4364 sin x 


y 2 


£ sin x 


È + icta). 
T 


/ 


1 1 
MB: y=uv, lnny-—lnu-lnv, 9 = Lu’ + v. From this identity it follows that y' — (uv)' — 
y u v 


vu'+uv’, so one gets the formula for the derivative of a product (6.7a) (under the assumption u, v > 0). 











u y 1, 1, soi ocu ; uy’ 
W C: y , my=Inu-Inv, > = -u v’. From this identity it follows that y’ = (| —)] = 
υ y u v υ 


u uv è vuu 


g aua p which is the formula for the derivative of a quotient (6.8) (under the assumption 
u,v > 0). 

8. Derivative of the Inverse Function 

If y — q(x) is the inverse function of the original function y = f(x), then both forms y = f(x) and 
x = p(y) are equivalent. For every corresponding value of x and y such that f is differentiable with 
respect to z, and q is differentiable with respect to y, e.g., none of the derivatives is equal to zero, 
between the derivatives of f and its inverse function ¢ is valid the following relation: 





1 dy 1 
EUN pod : al 
fo-a "ond (6.15) 
dy 


E The function y = f(x) = arcsin x for —1 < x < 1 is equivalent to the function x = y(y) — sin y for 
—n/2 « y < 1/2. From (6.15) it follows that 
1 1 1 1 


(siny) cosy γι —sinty  vVl—a? ᾿ 


9. Derivative of an Implicit Function 
Suppose the function y = f(x) is given in implicit form by the equation F(x, y) = 0. Considering the 
rules of differentiation for functions of several variables (see 6.2, p. 445) calculating the derivative with 
respect to x gives 
OF OF, P 
——-—— -0 andso y ———, 
ὃτ Oy y : Fy 


if the partial derivative Fy, differs from zero. 























(arcsin x)’ because cos y # 0 for —1/2 < y < 1/2. 


(6.16) 
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E The equation = = +e = 1 of an ellipse with semi-axes a and b can be written in the form F(x,y) = 


m 
ey 
a Rp 1 = 0. For the slope of the tangent line at the point of the ellipse (x, y) one gets according 
to » (636) 
fae oes 
9M ary” 


10. Derivative of a Function Given in Parametric Form 
If a function y = f(x) is given in parametric form x = x(t), y = y(t), then the derivative y’ can be 
ME by the formula 


λες (6.17) 


with the help of the derivatives y(t) = 
a(t) £0 holds. 
E Polar Coordinate Representation: Ifa function is given with polar coordinates (see 3.5.2.2, 3., 
p. 192) p = p(y), then the parametric form is 

r=p(p)cosy, y=ply)sing (6.18) 
with the angle y as a parameter. For the slope y’ of the tangent of the curve (see 3.6.1.2, 2., p. 245 or 
6.1.1, 2., p. 432) one gets from (6.17) 

dp 


TS emi or PEON where p= —. (6.19) 
pcosq — psinq dp 
Remarks: 


dy dx 
E and a(t) = di with respect to the variable t, if of course 


1. The derivatives t, jj are the components of the tangent vector at the point (x(t), y(t)) of the curve. 
2. Tt is often useful to consider the complex relation: 
x(t)--iy(t) — z(t), ασ ει - z(t). (6.20) 
; iwt ; "ET: i(wt + 5) 
Bi Circular Movement: z(t) = rel’ (r,w const), Zz(t) — rise!" — rue V^ * 2 
vector runs ahead by a phase-shift 7/2 with respect to the position vector. 


. The tangent 


11. Graphical Differentiation 

If a differentiable function y — f(x) is represented by its curve P in the 
Cartesian coordinate system in an interval a < x < b, then the curve I” 

of its derivative can be constructed approximately. The construction of a 
tangent estimated by eye is pretty inaccurate. However, if the direction 

of the tangent MN (Fig. 6.4) is given, then one can determine the point 

of contact A more precisely. 

1. Construction of the Point of Contact of a Tangent 

One draws two secants Mı N; and M2N3 parallel to the direction MN of 
the tangent so that the curve is intersected in points being not far from M 
each other. Then there are to be determine the midpoints of the secants, 
and a straight line through them must be drawn. This line PQ intersects 
the curve at the point A, which is approximately the point, where the tan- Figure 6.4 

gent has the given direction MN. To check the accuracy, one draws a third line close to and parallel to 
the first two lines, and the line PQ should intersect it at the midpoint. 


2. Construction of the Derivative Curve 











a) Choose some directions l4, l2,..., ln, which could be the directions of some tangents of the curve 
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y — f(x) in the considered interval as in Fig. 6.5, and determine the corresponding points of contact 


Aj, Ag,..., An, where the tangents themselves must not be constructed. 
b) Choose a point P, a “pole”, on the negative side of 1 Ay 
the x-axis, where the longer the segment PO = a, the 1 
flatter the curve is. Ze 1 

c) Draw the lines through the pole P parallel to the di- l 


with the y-axis by B1, B2,... Bn. 

d) Construct the horizontal lines B101, B2092, . . . , BnCn 
through the points Pi, B2,..., Bn to the intersection 
points C1, C5, ..., C4, with the orthogonal lines from the 
points A1, Ao, ..., As. P 
e) Connect the points C1, C5, ..., C, with the help of a 
curved ruler. The resulting curve satisfies the equation 

y — af'(r). If the segment a is chosen so that it cor- 
responds to the unit length on the y-axis, then the curve 
one gets is the curve of the derivative. Otherwise, one has 

to multiply the ordinates of C1, C2, . . . , Cn by the factor 
1/a. The points D1, D2,..., Dn given in Fig. 6.5 are on 
the correctly scaled curve I" of the derivative. Figure 6.5 


rections l4, l2, .. . ln, and denote their intersection points Ν 








6.1.3 Derivatives of Higher Order 
6.1.3.1 Definition of Derivatives of Higher Order 


d (d 
The derivative of y' = f’(a), which means (y')' or e (2). is called the second derivative of the 
x \ da 


d? 12 a 
function y = f(x) and it is denoted by y”, ü, dq f" (2) or : I ) 


analogously. The notation for the n-th derivative of the function y = f(x) is: 





. Higher derivatives can be defined 


yi = EY mu a PO) (y= 0,1,... 5 y@) = FW) = γα). (6.21) 


E da” 


6.1.3.2. Derivatives of Higher Order of some Elementary Functions 
The n-th derivatives of the simplest functions are collected in Table 6.3. 


6.1.3.3 Leibniz's Formula 


To calculate the n-th-order derivative of a product of two functions, the Leibniz formula can be used: 


n(n—1 
D” (uv) = u D”v +7 Du D' v4 phu Dy 4... 
jede c mie n — ED pay ps-mo 4... 1 D'uv (6.22) 
m! 
d” 
Here the notation D” = τα is used. If Du is replaced by u and Dv by v, then one gets the formula 
T 


(6.23) whose structure corresponds to the binomial formula (see 1.1.6.4, p. 12): 


D"(uv) = Y ( r) Du D^". (6.23) 


mo Mn 


6.1 Differentiation of Functions of One Variable 


439 





BA: 


2x, y" 2 2, v" 2 v0 — 


Table 6.2 Differentiation rules 











Expression Formula for the derivative 
Constant function c —0 (c const) 
Constant multiple (cuy = cu’ (c const) 





Sum 


/ 














(ut vy =u +u 





Product of two functions 


(uv)' 2 u'v 4- uv' 





Product of n functions 


n 
ο» 
i=1 











4 / m. 4 / 
Quotient (9) = —- (v £0) 
Chain rule for πο. du dv 
two functions Y= UVa): Y= dv dz 
Chain rule for . du dv dw 


three functions 


υπ αωίωα)): y 





— dv dw dx 





(uey = aut 


(«€ R, o z 0) 








P or ΙΝ’ a 
did specially : (5) = Sa (u #0) 
u u 
d(inny(r) 1, ., . d(Iny) 
Logarithmic da B y” — cs dx 


differentiation 


vu' 
special: (u")' — u" (v lnu + x) (u » 0) 
u 





y inverse function of f, i.e. y = f(x) 4> z = ọ(y) : 








Differentiation of the f(a) = 1 or dy = d 
inverse function mum vy) dr du 
dy 
F(z,y)=0: F,+F,y’=0 or 
Implicit (z Ws T P "ap 
differentiation y= -Τ, F- Oa? d Ὃν ; ἕν o) 





Derivative in 


x= x(t), y = y(t) (t parameter): 





dy ù . dx dy 
parameter form y——-—- i-—.)]g-—— 
: y ds 4 τ Ὁ 
z = plp) cos p : 
τς τ(φ): (angle p as parameter) 


Derivative in 
polar coordinates 





p = ply)sing 
, dy _ psing+pcosp 
da 





. απ 
y p dp 


pcosy—rsiny 











x? cos ax): Ifv = z?, u — cosax are substituted, then follows u = a* cos (αυ + k3) ο 





-4 
5 = - 2a?8 cos (ax + 487) 


2 — 0. Except the first three cases, all the summands are equal to zero, so 


(αυ) 50) = x?a° cos (ax + 507) + = - 2xa" cos (ax + 47) + 
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— a9[(2450 — a22?) cos ax — 100ax sin ax] . 


WB B: (7267) — (Ὁ) -x'e + 0 -βαλεῦ + (2 - 6xe* 4- (5 - 6e? = e” (x? + 18x? + 90y + 120). 


Table 6.3 Derivatives of higher order of some elementary functions 











Function n-th-order derivative 
gm m(m — 1)(m —2)...(m—n 4 1)z "^" 
(for integer m and n > m the n-th derivative is 0) 
1 
z -1/, 
Ina (x > 0) (—1)" (n — 1)! zm 
" .. 11 
log, x (a > 0) (-1)" “Waa ue 
ele [ποκα 
a? (In a)"a* 
ake (kIna)"a** 
. ; no 
sin x sin(x + το 
nT 
COS x cos(x + ro. 
sin ka k" sin(kx 4- T) 
cos kx: k” cos(ka + T) 
sinh x sinh x for even n, cosh z for odd n 
cosh £ cosh x for even n, sinh x for odd n 











6.1.3.4 Higher Derivatives of Functions Given in Parametric Form 


If a function y = f(x) is given in the parametric form x = x(t), y = y(t), then its higher derivatives 


di dx 
(u", y". etc.) can be calculated by the following formulas, where y(t) = a a(t) = ap y(t) — 
Ὃν. απ re : 
de £= TP’ etc., denote the derivatives with respect to the parameter t: 
dt ! 


d'y  ij—-g? By  iàj-3ij-39i?— ay 
da? 4? ° dr x 2 


(a(t) £0). (6.24) 


6.1.3.5 Derivatives of Higher Order of the Inverse Function 


If y = (a) is the inverse function of the original function y = f(x), then both forms y = f(x) and 
x = (y) are equivalent. Supposing v'(y) 7 0 holds, the relation (6.15) is valid for the derivatives of 
the function f and its inverse function q. For higher derivatives (y, y", etc.) one gets 


Py oly) dy αρ) -- οφ”) 


dx? — [p(y)P' dr? [e (y)? ib (6.25) 
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6.1.4 Fundamental Theorems of Differential Calculus 
6.1.4.1 Monotonicity 


If a function f(x) is defined and continuous in a connected interval, and if it is differentiable at every 
interior point of this interval, then the relations 
f(x) 2 0 fora monotone increasing function, (6.264) 


f(x) X 0. for a monotone decreasing function (6.26b) 


are necessary and sufficient. If the function is strictly monotone increasing or decreasing, then the 
derivative function f'(r) must not be identically zero on any subinterval of the given interval. In 
Fig. 6.6b this condition is not fulfilled on the segment BC. 

The geometrical meaning of monotonicity is that the curve of an increasing function never falls for 
increasing values of the argument, i.e., it either rises or runs horizontally (Fig. 6.6a). Therefore the 
tangent line at any point of the curve forms an acute angle with the positive x-axis or it is parallel to 
it. For monotonically decreasing functions (Fig. 6.6b) analogous statements are valid. If the function 
is strictly monotone, then the tangent can be parallel to the x-axis only at some single points, e.g., at 
the point A in Fig. 6.6a, i.e., not on a subinterval such as BC in Fig. 6.6b. 











» 
b) 0| X 


Figure 6.6 Figure 6.7 
6.1.4.2 Fermat’s Theorem 


If a function y = f(x) is defined on a connected interval, and it has a maximum or a minimum value at 
an interior point x — c of this interval (Fig. 6.7), i.c., if for every x in this interval 


fle) > f(z) (6.27a) or f(e) < f(z), (6.27b) 
holds, and if the derivative exists at the point c, then the derivative must be equal to zero there: 
f'(c) — 0. (6.27c) 


The geometrical meaning of the Fermat theorem is that if a function satisfies the assumptions of the 
theorem, then its curve has tangents parallel to the x-axis at A and B (Fig. 6.7). 

The Fermat theorem gives only a necessary condition for the existence of a maximum or minimum value 
at a point. From Fig. 6.6a it is obvious that having a zero derivative is not sufficient to give an extreme 
value: At the point A, f'(x) = 0 holds, but there is no maximum or minimum here. 

To have an extreme value differentiability is not a necessary condition. The function in Fig. 6.8d has 
a maximum at e, but the derivative does not exist here. 


6.1.4.3 Rolle's Theorem 


If a function y = f(x) is continuous on the closed interval [a, b], and differentiable on the open interval 


(a,b), and 


f(a)=0, f(b)=0 (ας ὐ) (6.28a) 
hold, then there exists at least one point c between a and b such that 
f(c) 20 (a«c«b) (6.28b) 


holds. The geometrical meaning of Rolle's theorem is that if the graph of a function y = f(a) which is 
continuous on the interval (a, b) intersects the a-axis at two points A and B, and it has a non-vertical 





442 6. Differentiation 





tangent at every point, then there is at least one point C between A and B such that the tangent is 
parallel to the x-axis here (Fig. 6.8a). It is possible, that there are several such points in this inter- 








Figure 6.8 


val, e.g., the points C, D, and E in Fig. 6.8b. The properties of continuity and differentiability are 
important in the theorem: in Fig. 6.8c the function is not continuous at x = d, and in Fig. 6.8d the 
function is not differentiable at x = e. In both cases f'(v) z 0 holds everywhere where the derivative 
exists. 


6.1.4.4 Mean Value Theorem of Differential Calculus 


If a function y = f(x) is continuous on the closed interval [a, b] and differentiable on the open interval 
(a, b), then there is at least one point c between a and b which satisfies the following relation: 


f(b) = f(a) 
b—a 


holds. Substituting b = a + h, and O means a number between 0 
and 1, then the theorem can be written in the form 

J(a-- h) 2 f(a) - hf'(a-- Gh) (0«0O «1). (6.29b) 
1. Geometrical Meaning The geometrical meaning of the 
theorem is that if a function y = f(x) satisfies the conditions of 
the theorem, then its graph has at least one point C between A 
X and B such that the tangent line at this point is parallel to the 

line segment between A and B (Fig. 6.9). There can be several 
Figure 6.9 such points (Fig. 6.8b). 


ΞΟ (a«c«) (6.292) 






1 
i 
1 
i 
1 
i f 
: i 
a ς 


That the properties of continuity and differentiability are important can be shown in examples and also 
as can be observed in Fig. 6.8c,d. 
2. Applications The mean value theorem has several useful applications. 
Wi A: This theorem can be used to prove some inequalities in the form 
|f (b) — f(a)| « K|b — a], (6.30) 


where K is an upper bound of | f’()| for every x in the interval [a, 0]. 








1 
E B: How accurate is the value of f(r) = Dpr? if m is replaced by the approximate value 7 = 3.14? 
T 
2c 1 
' have: — f(n)| 9 στο —T| < 0. - 0. ) — 0. 5 ic cans is 
We have: |f(x) — f(7)| | (dy |r — 7| < 0.053 - 0.0016 = 0.000085, which means Tas 





between 0.092 084 + 0.000 085. 
6.1.4.5 Taylor's Theorem of Functions of One Variable 


If a function y = f(x) is continuously differentiable (it has continuous derivatives) n — 1 times on the 
interval [a, a 4- ^], and if also the n-th derivative exists in the interior of the interval, then the Taylor 
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formula or Taylor expansion is 
dires hiy h y ped 
flat ) = f(a) + gf o + uif (a) + Teu 


with 0 < O < 1. The quantity h can be positive or negative. The mean value theorem (6.29b) is a 
special case of the Taylor formula for n = 1. 


6.1.4.6 Generalized Mean Value Theorem of Differential Calculus 
(Cauchy’s Theorem) 
If two functions y = f(x) and y = (x) are continuous on the closed interval [a,b] and they are dif- 
ferentiable at least in the interior of the interval, and (x) is never equal to zero in this interval, then 
there exists at least one value c between a and b such that 
Io 

IIE) ΤΙ ο. (6.32) 

pb) - pla) p(o) 
The geometrical meaning of the generalized mean value theorem corresponds to that of the first mean 
value theorem. Supposing, e.g., that the curve in Fig.6.9 is given in parametric form 2; = y(t), y = 
f(t), where the points A and B belong to the parameter values t = a and t = b respectively. Then for 
the point C 


[O79 (a) + E f(a +Oh) (6.31) 








b '(c 
tana = J0) σία) σας (6.39) 
φίῦ)-- ία) vc) 
is valid. For y(x) = x the generalized mean value theorem is simplified into the first mean value 
theorem. 


6.1.5 Determination of the Extreme Values and Inflection Points 
6.1.5.1 Maxima and Minima 


The substitution value f (xo) of a function f (x) is called the relative maximum (M) or relative minimum 
(m) if one of the inequalities 


f(to +h) < f(xo) (for maximum), (6.34a) 
f(to +h) > f(xo) (for minimum) (6.34b) 


holds for arbitrary positive or negative values of h small enough. At a relative maximum the value 
f (xo) is greater than the values in the neighborhood, and similarly, at a minimum it is smaller. The 
relative maxima and minima are called relative or local extrema. The greatest or the smallest value of 
a function in an interval is called the global or absolute maximum or global or absolute minimum in this 
interval. 


y y M 














Figure 6.10 


6.1.5.2 Necessary Conditions for the Existence ofa Relative 

Extreme Value 
A function can have a relative maximum or minimum only at the points where its derivative is equal 
to zero or does not exist. That is: At the points of the graph of the function corresponding to the 
relative extrema the tangent line is whether parallel to the z-axis (Fig. 6.10a) or parallel to the y-axis 
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(Fig. 6.10b) or does not exist (Fig. 6.10c). Anyway, these are not sufficient conditions, e.g., at the 
points A, B, C in Fig. 6.11 these conditions are obviously fulfilled, but there are no extreme values of 
the function. 

If a continuous function has relative extreme values, then maxima and minima follow alternately, that 
means, between two neighboring maxima there is a minimum, and conversely. 


6.1.5.3 Determination of the Relative Extreme Values and the Inflection 
Points of a Differentiable Explicit Function y — f(x) 


Since f'(x) = 0 is a necessary condition where the deriva- 
tive exists, after determining the derivative f'(x), first 
one calculates all the real roots z4,29,...,i,..., t4 of 
the equation f'(x) = 0. Then each of them has to be 
checked , e.g., z; with one of the following methods. 

1. Method of Sign Change 

For values x- and x+, which are slightly smaller and 
greater than z;, and for which between x; and x... and z4 
Figure 6.11 no more roots or points of discontinuity of f'(x) exist, one 








xY 


0] 


checks the sign of f'(x). When during the transition from f'(x..) to f'(x.,) the sign of f'(r) changes 
from “+” to “—”, then there is a relative maximum of the function f(x) at x — v; (Fig. 6.12a); if it 
changes from “—” to “+”, then there is a relative minimum there (Fig. 6.12b). If the derivative does 
not change its sign (Fig. 6.12c,d), then there is no extremum at x — x;, but it has an inflection point 
with a tangent parallel to the x-axis. 

2. Method of Higher Derivatives 

If a function has higher derivatives at x = x;, then one can substitute, e.g., the root x; into the second 
derivative f”(x). If f”(x;) < 0 holds, then there is a relative maximum at zx;, and if f”(x;) > 0 holds, 
a relative minimum. If f"(z;) — 0 holds, then x; must be substituted into the third derivative f"(a:). 
If f"(x;) Z 0 holds, then there is no extremum at x = x; but an inflection point. If still f"(y;) = 0 
holds, then one substitutes it into the forth derivative, etc. If the first non-zero derivative at x = x; is 
an even one, then f(x) has an extremum here: If the derivative is positive, then there is minimum, if 
it is negative, then there is a maximum. If the first non-zero derivative is an odd one, then there is no 
extremum there (actually, there is an inflection point). 








Figure 6.12 


3. Further Conditions for Extreme Points and Determination of Inflection Points 
If a continuous function is increasing below xo and decreasing after, then it has a maximum there; if it 
is decreasing below and increasing after, then it has a minimum there. Checking the sign change of the 
derivative is a useful method even if the derivative does not exist at certain points as in Fig. 6.10b,c 
and Fig. 6.11. If the first derivative exists at a point where the function has an inflection point, then 
the first derivative has an extremum there. So, to find the inflection points with the help of derivatives, 
one has to do the same investigation for the derivative function as one has done for the original function 
to find its extrema. 


Remark: For non-continuous functions, and sometimes also for certain differentiable functions the 
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determination of extrema needs individual ideas. It is possible that a function has an extremum so 
that the first derivative exists and it is equal to zero, but the second derivative does not exist, and 
the first one has infinitely many roots in an arbitrary neighborhood of the considered point, so it is 
meaningless to say it changes its sign there. For instance f(x) — a?(2 4 sin(1/z)) for x z 0 and 
f(0) =0. 

6.1.5.4 Determination of Absolute Extrema 

The considered interval of the independent variable is divided into subintervals such that in these in- 
tervals the function has a continuous derivative. The absolute extreme values are among the relative 
extreme values, or at the endpoints of the subintervals, if their endpoints belong to them. For non- 
continuous functions or for non-closed intervals it is possible that no maximum or minimum exists on 
the considered interval. 


Examples of the Determination of Extrema: 

E A: y= e7”, interval [-1, +1]. Greatest value at x = 0, smallest at the endpoints (Fig. 6.13a). 
E B: y= x? — x°, interval [--1, --2]. Greatest value at z — 1-2, smallest at z — —1, at the ends of the 
interval (Fig. 6.13b). 





1 

BC: y= z , interval [—3,+3], x # 0. There is no maximum or minimum. Relative minimum 
l+ez 

at « = —3, relative maximum at x = 3. If one defines y = 1 for « = 0, then there will be an absolute 

maximum at x — 0 (Fig. 6.13c). 


MD: y=2- 3, interval [-1, +1]. Greatest value at z — 0 (Fig. 6.13d, the derivative is not finite). 


6.1.5.5 Determination of the Extrema of Implicit Functions 

If the function is given in the implicit form F(r,y) — 0, and the function F itself and also its partial 
derivatives F7, Fy are continuous, then its maxima and minima can be determined in the following way: 
1. Solution of the Equation System F(a, y) — 0, F,(z, y) = 0 and substitution of the resulting 
values (21,1). (12. 92)... .. (zi, yi)... in Fyj and Fo. 

2. Sign Comparison for F, and F,, at the Point (z;, y;): When they have different signs, the 
function y — f(x) has a minimum at z;; when F, and F,, have the same sign, then it has a maximum at 
xi. If either Fy or Fy, vanishes at (x;, yi), then one needs further and rather complicated investigation. 





Figure 6.13 


6.2 Differentiation of Functions of Several Variables 
6.2.1 Partial Derivatives 
6.2.1.1 Partial Derivative of a Function 








The partial derivative of a function u = f (x1, £2,- . . , ti,- - , En) with respect to one of its n variables, 
e.g., with respect to x, is defined by 
Ou ] qj Δι, T2, L3, -- -, Zn) — f (£1, T2, £33- -3 L: 
- "nen 1, T2, T3 n) — f (t1, £2, £3 DA (6.35) 


Óxi — ^m0 Azı 
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so only one of the n variables is changing, the other n — 1 are considered as constants. The symbols 


I TT ðu ὃ ] F : 
for the partial derivatives are an? di ο , fi- A function of n variables can have n first-order partial 
ae: X 
Ou Ou Ou Ou 


derivatives: —— , —— ,—— ,...,z— . The calculation of the partial derivatives can be done following 
Oz; Ox3 Ora Or, 


the same rules as there are for the functions of one variable. 


xy Ou xy Qu x? Ou xy 
Bu=—, = στο. : 
2 Ou Zz ὃν z Oz 2? 








6.2.1.2 Geometrical Meaning for Functions of Two Variables 


If a function u = f(x,y) is represented as a surface in a Cartesian coordinate system, and this surface 
is intersected through its point P by a plane parallel to the x, u plane (Fig. 6.14), then holds 


z = tana, (6.36a) 
where a is the angle between the positive x-axis and the tangent line of the intersection curve at P, 
which is the same as the angle between the positive x-axis and the perpendicular projection of the 
tangent line into the zx, u plane. Here, a is measured starting at the x-axis, and the positive direction is 
counterclockwise if looking toward the positive half of the y-axis. Analogously to a, f is defined with 
a plane parallel to the y, u plane: 


ðu 
— — tan f. (6.36b) 
The derivative with respect to a given direction, the so-called directional derivative, and derivative with 
respect to volume, will be discussed in vector analysis (see 13.2.1, p. 708 and p. 709). 

















u u 
u=f(x,y) 
Au 

0 > 

y 
P 

dx 
X dy 
Figure 6.14 Figure 6.15 


6.2.1.3 Differentials of x and f (x) 
1. The Differential dz of an Independent Variable x 


is equal to the increment Az, i.e., 
dx = Ac (6.37a) 
for an arbitrary value of Az. 
2. The Differential dy of a Function y — f(x) of One Variable x 
is defined for a given value of x and for a given value of the differential dx as the product 


dy — f'(x) dz. (6.37b) 
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3. TheIncrement of a Function y — f(x) for x + Δα 
is the difference 

Ay = f(a + Ax) — f(x). (6.37c) 
4. Geometrical Meaning of the Differential 
If the function is represented by a curve in a Cartesian coordinate system, then dy is the increment of 
the ordinate of the tangent line for the change of « by a given increment dx (Fig. 6.1). In an analogous 
way Ay is the increment of the ordinate of the curve. 


6.2.1.4 Basic Properties of the Differential 
1. Invariance 
Independently of whether x is an independent variable or a function of a further variable t 
dy = f'(x) dx (6.38) 
is valid. 
2. Order of Magnitude 
If dx is an arbitrarily small value, then dy and Ay = y(x + Ax) — y(x) are also arbitrarily small, but of 


Ai . : πα 
eU d Consequently, the difference between them is also arbitrarily 


equivalent amounts, i.e., lim 
Az>0 dy 


small, but of higher order than dz, dy and Ax (except if dy = 0 holds). Therefore, one gets the relation 
_ Ay ; ; 

a =1, Ayxdy= f'(x)dz, (6.39) 

which allows to reduce the calculation of a small increment to the calculation of its differential. This 

formula is frequently used for approximate calculations (see 6.1.4.4, p. 442 and 16.4.2.1, 2., p. 855). 

6.2.1.5 Partial Differential 

For a function of several variables u = f(x, y,...) one can form the partial differential with respect to 

one of its variables, e.g., with respect to x, which is defined by the equality 


du = df = a dz. (6.40) 


6.2.2 Total Differential and Differentials of Higher Order 


6.2.2.1 Notion of Total Differential of a Function of Several Variables 
(Complete Differential) 


1. Differentiability 
The function of several variable u — f(x, 12, ..., vi, ..., v4) is called differentiable at the point 
Pozo, 290»... , i9, -. - tno) if at a transition to an arbitrarily close point P(x194- A, 1294- A9, . . . , rio 
Δεν no d- Ara) with the arbitrarily small quantities Au, Ax, ..., Ax... , A, the complete 
increment 

Au = f(z10 + Azı, £2 + AT2,... , Lio + ATi, . . . , Eno + Azn) 





— f (110; 120, . .. , Lio, - - - 2η) (6.41a) 
of the function differs from the sum of the partial differentials of all variables 
Qu Qu Qu f 
(55, 55 + a m Toe Jr, hm) 210-820-800 (6.41b) 
by an arbitrarily small amount in higher order than the distance 
PoP = Ax? + Ar? +... + Az? = y dz +daz+...+dx?. (6.419) 


A continuous function of several variables is differentiable at a point if its partial derivatives, as func- 
tions of several variables, are continuous in a neighborhood of this point. This is a sufficient but not a 
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necessary condition, while the simple existence of the partial derivatives at the considered point is not 
sufficient even for the continuity of the function. 


2. Total Differential 
If u is a differentiable function, then the sum (6.41b) 





du= mU n + ρα Tec du dry (6.42a) 
Oxy Ox, Be 
is called the total differential of the function. With the n-dimensional vectors 
i T 
ο mu. Bus fai dr = (dx, dzx9,...,dtq)7 (6.42c) 
pu ο ὅτι Ότο OXn, 


the total differential can be expressed as the scalar product 
du = (grad u)? - dr. (6.42d) 
In (6.42b), there is the gradient, defined in 13.2.2, p. 710, for n independent variables. 


3. Geometrical Representation 

The geometrical meaning of the total differential of a function of two variables u = f(x, y), represented 
in a Cartesian coordinate system as a surface (Fig. 6.15), is that du is the same as the increment of the 
applicate (see 3.5.3.1, 3., p. 210) ofthe tangent plane (at the same point) if d and dy are the increments 
of x and y. 

From the Taylor formula (see 6.2.2.3, 1., p. 449) it follows for functions of two variables that 


ð ὃ : 
f(x,y) = F(%0, Yo) + A (ao yo)( — xo) + L tao, yoly — yo) + Ri. (6.43a) 
Ignoring the remainder Rj, holds that 
o ὃ 
u = fleo, y0) + EEr, wo) (e = z0) + SEa, votu — i) (6.435) 


gives the equation of the tangent plane of the surface u = f(x,y) at the point P (xo, yo, uo). 
4. The Fundamental Property of the Total Differential 

is the invariance with respect to the variables as formulated in (6.38) for the one-variable case. 
5. Application in Error Calculations 


In error calculations one uses the total differential du for an estimation of the error Au (sce (6.41a)) 
(see, e.g., 16.4.1.3, 5., p. 852). From the Taylor formula (see 6.2.2.3, 1., p. 449) follows 


|Au| = |du + Ri| < |du| + |Ri| ez |du], (6.44) 
i.e., the absolute error |Au| can be replaced by |du| as a first approximation. It follows that du is a 
linear approximation for Au. 


6.2.2.2 Derivatives and Differentials of Higher Order 


1. Partial Derivatives of Second Order, Schwarz’s Exchange Theorem 
The second-order partial derivative of a function u — f(x1, v2, ..., vi, ..., En) can be calculated 
u Ou 


--π, -π»::. or with respect to another 
0x2' Ox 7 


with respect to the same variable as the first one was, i.e., 


ὅτι ὅτι ὅτι 
Oxi0z3' Oxg0r4' Org0zxi 
If at the considered point the mixed derivatives are continuous, then 
Ou ou 
ὃπιὃτο ~ Ox2021 








variable, i.e. ... In this second case one talks about mixed derivatives. 


(6.45) 
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holds for given xı and x» independently of the order of sequence of the differentiation (Schwarz's ez- 
change theorem). 

Ou Bu 

ðr?’ Oxrðy 
2. Second-Order Differential of a Function of One Variable y = f(x) 

The second-order differential of a function y = f(x) ofone variable, denoted by the symbols dy, d? f(x), 
is the differential of the first differential: d'y = d(dy) = f”(x)dx?. These symbols are appropriate only 
if x is an independent variable, and they are not appropriate if x is given, e.g., in the form x = z(v). Dif- 
ferentials of higher order are defined analogously. If the variables x1, £2, .. . , Zi, . . - , &n are themselves 
functions of other variables, then one gets more complicated formulas (see 6.2.4, p. 452). 

3. Total Differential of Second Order of a Function of Two Variables u — f(z, y) 


Partial derivatives of higher order such as, e.g., are defined analogously. 











ὅτι ὅτι ru 
@u = d(du) σης ο” H 21ὸι dx dy 4 ap dy? (6.46a) 
or symbolically 
a P 2 
u = (5 dx + Za) u. (6.46b) 
4. 'Total Differential of n-th Order of a Function of Two Variables 
d'u = (sae + xt) u. (6.47) 


5. Total Differential of n-th Order of a Function u = f(a1,2%2,...,%m) of m Vari- 
ables 


ὃ ὃ ὃ E 
d'u = (= dzı + ra dxo +... + δι dan) u. (6.48) 


6.2.2.3 Taylor's Theorem for Functions of Several Variables 


1. Taylor's Formula for Functions of Two Variables 


a) First Form of Representation: 























Of(z.y Of (ry 
ΠΟ (r—a)- G ) (y — b) 
ὃς \(a.y)=(a) Oy — l(za)- (a) 
1 (0? f(x, y) > 2F (x,y) 
+24 — Oo g—ay +222 z—aYvy-b 
ait ὂτΣ lea- ia ) andy need )(y — 8) 
P f(x, 1 1 
τ.» (- We zbopes t nb Re (6.492) 
Oy? — aa)- (a) 3! ni 
Here (a, b) is the center of expansion and Rp is the remainder. Sometimes one writes, e.g., instead of 
ofwy) the shorter expression of (Xo, Yo) - 
Or — lu) (roo) δε 





The terms of higher order in (6.49a) can be represented in a clear way with the help of operators: 


f) ο ο. 








(z,y)=(a,b) 


1 ð 832 
zc cuo (p Daa} f(a.y) 





(x,y)=(a,b) 
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+ Rn. (6.49b) 





1 3 
*tyl F FE Yenen t ub Tfl ων 


This symbolic form means that after using the binomial theorem the powers of the differential operators 





ὃ ; rr : SUN 
and δι represent the higher-order derivatives of the function f(x,y). Then the derivatives must 
y 


Ox 
be taken at the point (a, b). 
b) Second Form of the Representation: 

1/9, 9 τ. 
{αντ κ) -- Ταν) τη (5: h+ =) 145,9) τσ (5 h+ =) f(x,y) 


3! VOa ὃ n! Và: δι 


c) Remainder: The expression for the remainder is 


3 n 
m (s+ 5) Γον) στι στ : (z^ -2) f(x, y) + Rn. (6.496) 


R l 2 ia S um T Oh, y + Ok) (0<@<1) (6.49d) 
τη. 3 UN ; . 4 
(n 4-1)! V Ox Oy ` τ 
2. Taylor Formula for Functions of m Variables 
The analogous representation with differential operators is 
f(z-h, y +k,...,t+1) 








ὃ ὃ. Δ᾽ 
=f (Bites DIE x x +5) f(x,y,...,t) + Rn, (6.50a) 
where the remainder can be ices by the expression 
tL fo. 3 a 
Rn πι πα... f(x -Oh,y--Ok,...,t--Ol) 
(0 « O « 1). (6.50b) 


6.2.3 Rules of Differentiation for Functions of Several Variables 
6.2.3.1 Differentiation of Composite Functions 
1. Composite Function of One Independent Variable 








u— f(zi23,..,2.), T1 = 21E), x2 —cx»(£),..., Ln = ση(ξ) (6.51a) 
Ou Oudz, Ou dx Ou din 

ETE ; 51k 
ὃς ὅτι d£ + Ox dé ann, dé (6 5 j) 


2. Composite Function of Several Independent Variables 











u — f(z1,22,..., 0), 

&i-—un(e;m.;T)h à 4e Thes, Ba = tae NT) (6.52a) 
Ou — Ou Ox, , Ou σα Ou Orn 
0t Ox, OF | Oxy OF | | Oy OE’ 
Ou Ou Ox, , Ou Ox2 Ou Oxy 
dn Ox, On  ὅπδη Ox. δη) (6.52b) 
ðu _ Ou ‘Oa, Ou δα» l Qu Or, 














ðr Ox, OT | Oxy OF | Ox, ὅτ᾽ 
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6.2.3.2 Differentiation of Implicit Functions 
1. A Function y — f(x) of One Variable is given by the equation 


F(x,y) =0. (6.534 
Differentiating (6.53a) with respect to x with the help of (6.51b) one gets 
F, 
F, + Fyy' =0 (6.53b) and y'— -E (F, #0). (6.53c 
y 


Differentiation of (6.53b) yields in the same way 
Fea + 2Feyy' + Fyyly')? + Fy" = 0, (6.53d 
so considering (6.53b) one has 
y" c 2P.FVF. - GV Es = (Fo)? Eyy 





6.53 
ο ο. 
In an analogous way one can calculate the third derivative 
Fear + 3Frayy! + 3Feyy(y')? + Fyyy(y’)? + 3Feyy” + 3Fyyy'y” + Fyy” = 0, (6.53f 


from which y” can be expressed. 
2. A Function u — f(xi1,23,..., ti, ..., $4) of Several Variables is given by the equation 
πο 24,4) — 0. (6.54a 
The partial derivatives 
Ou Fy, Ou Pu Ou PF 
Ox, Ει᾽ Ότο E, n, T 
can be calculated similarly as it has been shown above but here the formulas (6.52b) are to be used. 
The higher-order derivatives can be calculated in the same way. 


3. Two Functions y = f(x) and z = (æ) of One Variable are given by the system o 
equations 








(6.54b 

















F(x,y,z)=0 and (x,y,z) =0. (6.55a 
Then differentiation of (6.55a) according to (6.51b) results in 
F, + Fy + Fiz =0, 6, +G,y + 6,2 =0, (6.55b 
P,&, — ®,F, F,®@, — Fy®, 
y px BEEN (6.55c 
Fo, —F4, Fj0, — F., 


The second derivatives y” and z” are calculated in the same way by differentiation of (6.55b) considering 
y’ and 2’. 
4. n Functions of One Variable Let the functions yy — f(r),ys — q(x),.... ya — v(x) be given 
by a system 

F(£,Y1,Y2,:--; Yn) =0, B(T, Y1, Y2,---,Yn) =0, .. M (m yii yas Yn) =0 (6.56a) 
of n equations. Differentiation of (6.56a) using (6.51b) results in 





PF, + Fy yj τ m ES, = 0, 
υπ (6.505) 

τ τ. Ε cB icd : =0, 

Wy Uy) yo ys — 0. 
Solving (6.56b) yields the derivatives yj, 5. ..., y, which are to be looking for. In the same way one 


can calculate the higher-order derivatives. 
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5. Two Functions u — f(z,y),v — q«(z,y) of Two Variables are given by the system of 
equations 

F(xr,y,u,v) 20 and dÓ(r,y,u,v) — 0. (6.57a) 
Then differentiation of (6.57a) with respect to x and y with the help of (6.52b) results in 




















OF OF Ou OF Ov 0 OF OFOu OF Ov 

ee ee e —— πω μμ 

Ór  OuOr Ov Ox ý ὃν  QuOy  OvOy . 

ðb bdu δδὂν Boh) 0b  0b0u , 000v (6.976) 

3z Ouüz  Ovür 7 ὃν OuOy v əy 

. Ou Ov Ou ðv . . 
Solving the system (6.57b) for —, — and the system (6.57c) for —,— give the first-order partial 
Ox’ Ox Oy’ Oy 


derivatives. The higher-order derivatives should be calculated in the same way. 
6. n Functions of m Variables Given by a System of n Equations The first-order and 
higher-order partial derivatives can be calculated in the same way as in the previous cases. 


6.2.4 Substitution of Variables in Differential Expressions and 


Coordinate Transformations 
6.2.4.1 Function of One Variable 


Suppose, given a function y(x) and a differential expression F containing the independent variable, the 
function, and its derivatives: 


dy y dı 
y = f(x), (6.584) F=F (« n τ 2 πο 2 ) (6.58b) 


If the variables are substituted, then the derivatives can be calculated in the following way: 
Case la: The variable x is replaced by the variable t, and they have the relation 











x = g(t). (6.59a 
Then holds 
dy 1 dy dy ι DU HPUGT 
dr —q(t)dt' da? [p(t] 0 dp * (0 dt χ᾽ .. 
d*y 1 / 2 dy / n d*y n 2 / In dy 
— = ρ (ἑ}}--5- t t)— t= t E ETE .99c 
Te = pape Po a 50 oct eO eo e tim) .... Gove 


Case 1b: If the relation between the variables is not explicit but it is given in implicit form 
(x,t) = 0, (6.60 
onu UU Py dy eee m 
then the derivatives di^ du quse calculated by the same formulas, but the derivatives p’ (t), y” (t), 
z^ dr x 
q" (t) must be calculated according to the rules for implicit functions. In this case it can happen tha 
the relation (6.58b) contains the variable x. To eliminate x, the relation (6.60) is used. 


Case 2: If the function y is replaced by a function u(x), and the relation between them is 








y — e(u), (6.61α 
then the calculation of the derivatives can be performed using the following formulas: 
dy , du dy E Pu i duy? . 
dr © (u) de? dx © (u) Tz Fé pu drj ’ om 
dy ,, .@u n du Pu " duV? 
2 = — — Ἔκ —]|,.... 61 
di ^ us vee uo da? T M) dr] ' Gls 
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Case 3: The variables x and y are replaced by the new variables t and u, and the relations between 
them are given by 














r—g(tu), y-w(tu) (6.622. 
For the calculation of the derivatives the following formulas are used: 
Ov | Op du 
dy | Qt  Qudt : 
de Op | Opdu' (6.62b 
Ot Ou dt 
dy | d (dy\ _ d | Bt " Budt| .. 1 d "9t ' Qu dt (6.62c 
dz? dx \dx dr |O , Op du Op , Apdu dt | Op Ov du |’ ` 
Ot ' ðu dt Ot ' Ou dt Ot  Oudt 
ld/A 1 dA dB 
5:3) 5 (e - AG) (6.624 
. . Ob Ov du |. Op | Opdu . 
with A= E + Bu dt (6.6209) and B— BE + Oud (6.62 


3, 
The determination of the third derivative qu can be done in an analogous way. 
gr 
Wi For the transformation from Cartesian coordinates into polar coordinates according to 
X -—pcosp, y=psing (6.634) 
the first and second derivatives should be calculated as follows: 
dy _ p'sing + pcosp 


dy _ p? 4- 2p? — pp" 
dr  ffcosp— psing ' 


.63b ' 
(5585) dx? — (p'cosq — psinq)? 


(6.63c) 


6.2.4.2 Function of Two Variables 


Suppose given a function w(x, y) and a differential expression F containing the independent variables, 
the function and its partial derivatives: 


2 2 2 
ðw ðw w Aw Aw ) (6.64b) 


w= f(x,y), (6.644) BaP (yun ge E a e 


If x and y are replaced by the new variables u and v given by the relations 
r= (u,v), y— (uv), (6.652) 
then the first-order partial derivatives can be expressed from the system of equations 


Ow  OwOp , δωδύψ ðw  OwOp ,OwOv 

















t ; Ξ ! 6.65} 
Ou  OrOu  OyOu' Ov  OrOv Oy Ov ( ) 
with the new functions A, B, C, and D of the new variables u and v 
Ow Ow Ow Ow Ow Ow 
=A B =C + D—. 6.65 
Ox 8u Bv! Oy Ou Ov (6:650) 
The second-order partial derivatives are calculated with the same formulas, only without using w in 
ὃ 
them but its partial derivatives = ὡς and —, e.g., 
Ox Oy 


Ow ὃ (=) ὃ ( Ow x) ^ (A55 Pw | 0A80w, aar] 


Ox? | Ox VOx Ox Ou Ov Ou? ðuðv ðu ðu du dv 
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Audo P a? ' ðv ðu ðv zit (6.66) 


The higher partial derivatives can be calculated in the same way. 


zi Pw Pw , 940v , 0BOw 


W The Laplace operator (see 13.2.6.5, p. 716) is to be expressed in polar coordinates (see 3.5.2.1, 2., 
p. 191): 














2 32 
Aw = = + au ; (6.672) Y —pcosQ, y-— psing. (6.67b) 
The calculations are: 
Ow Ow - ὃω á Ow Ow, " Ow e 
= 5 sin ç = sin "AS S 
Op Ox m ὃν a Op ag? in? ay? ws 
Ow Ow  sinpgOÓw Ow . ðw cosp dw 
COS ¢ — Sin Q— 4 ] 
Ox Ad" p Og ὃν oP Op p Og 
Pw eee ὃ της ðw sing Ow sing à e ðw  sinpOw 
— cos S sy- ‘ 
Ox? 79 5 Op p Oo p Op T Op p Op 
2w 
Similarly, —7 is calculated, so finally: 
Oy? 
2 18w 1 
Aga es Oe p A (6.67c) 


Op pg pp 
Remark: If functions of more than two variables should be substituted, then similar substitution 
formulas can be derived. 


6.2.5 Extreme Values of Functions of Several Variables 


6.2.5.1 Definition of a Relative Extreme Value 


A function u — f(x1,22,..., vj, ..., 14) hasa relative extreme value at a point Py(x19, 129, . .. , tio; ..., 
£no), if there is a number e such that for every point P(x1, 12, ..., 4) belonging to the domain z19— € « 
£1 < £10 +€, T20 — € < T2 < T20 + €,- .., Eno — E€ < Tn < Tno + € and to the domain of the function but 
different from Po, then for a maximum the inequality 


f(t, X3,... E < f (210, £20, ee 20) (6.68a) 
holds, and for a minimum the inequality 
f(1,22,...,2n) > f (X10, $20, ..., no) (6.68b) 


holds. Using the terminology of several dimensional spaces (see 2.18.1, p. 118) a function has a relative 
maximum or a relative minimum at a point if it is greater or smaller there than at the neighboring 
points. 


6.2.5.2 Geometric Representation 


In the case of a function of two variables, represented in a Cartesian coordinate system as a surface 
(see 2.18.1.2, p. 119), the relative extreme value geometrically means that the applicate (see 3.5.3.1, 3., 
p. 210) of the surface in the point A is greater or smaller than the applicate of the surface in any other 
point in a sufficiently small neighborhood of A (Fig. 6.16). 

If the surface has a relative extremum at the point P, which is an interior point of its domain, and 
if the surface has a tangent plane at this point, then the tangent plane is parallel to the x,y plane 
(Fig. 6.16a,b). This property is necessary but not sufficient for a maximum or minimum at a point 
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Figure 6.16 


Py. For example Fig. 6.16c shows a surface having a horizontal tangent plane at Po, but there is a 
saddle point here and not an extremum. 


6.2.5.3 Determination of Extreme Values of Differentiable Functions of 
Two Variables 


If u — f(x,y) is given, then one solves the system of equations f, = 0, fy = 0. The resulting pairs of 





values (21, y1), (12, ya), ... can be substituted into the second derivatives 
Of 03] Of 
Ao. pics τν. 6.69 
Ox?" που ᾽ Oy? (6:69) 
Depending on the expression 
AB : 
A= | Ρο AC — BP = [fox fy — (fay) le i-us (i— 1,2...) (6.70) 


it can be decided whether an extreme value exists and of what kind it is: 


1. In the case A > 0 the function f(r, y) has an extreme value at (z;, y;), and for f, « 0 it isa 
maximum, for fes > 0 it is a minimum (sufficient condition). 


2. In the case A < 0 the function f(x,y) does not have an extremum. 
3. In the case A = 0, one needs further investigation. 


6.2.5.4 Determination of the Extreme Values of a Function of n Variables 


If u = f (x1, %2, ..., 4) is given, then first it is to find a solution (x10, 720, ... , a0) of the system of the 
n equations 
Ja = 0; fon = 0 iren = 0; (6.71) 
because it is a necessary condition for an extreme value. (6.71) is not a sufficient condition. Therefore 
Pf 


Then it is to 





one prepares a matrix of the second-order partial derivatives such that aj; = Bons 
πιό] 

substitute a solution of the system of equations (6.71) into the terms, and to prepare the sequence of 

left upper subdeterminants (a11, 411422 — 12421, ...). Then there are the following cases: 


1. The signs of the subdeterminants follow the rule —,+,—,+, ..., then there is a maximum there. 








2. The signs of the subdeterminants follow the rule +,+,+,+, ..., then there is a minimum there. 

3. There are some zero values among the subdeterminants, but the signs of the non-zero subdetermi- 
nants coincide with the signs of the corresponding positions of one of the first two cases. Then fur- 
ther investigation is required: Usually one checks the values of the function in a close neighborhood of 
10, £20,--+5Uno- 

4. The signs of the subdeterminants does not follow the rules given in cases 1. and 2.: There is no 
extremum at that point. 
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The case of two variables is of course a special case of the case of n variables, (see [6.4]). 


6.2.5.5 Solution of Approximation Problems 


Several different approximation problems can be solved with the help of the determination of the ex- 
treme values of functions of several variables, e.g., fitting problems or mean squares problems. 


Problems to solve: 

e Determination of Fourier coefficients (see 7.4.1.2, p. 475, 19.6.4.1, p. 992). 

e Determination of the coefficients and parameters of approximation functions (see 19.6.2, p. 984 ff). 

e Determination of an approximate solution of an overdetermined linear system of equations (see 19.2.1.3, 
p. 958). 

Methods: For these problems the following methods are used: 

e Gaussian least squares method (see, e.g., 19.6.2, p. 984). 

e Least squares method (see 19.6.2.2, p. 985). 

e Approximation in mean square (continuous and discrete) (see, e.g., 19.6.2, p.984). 

e Calculus of observations (or fitting) (sce 19.6.2, p. 984) and regression (sce 16.3.4.2, 1., p. 841). 


6.2.5.6 Extreme Value Problem with Side Conditions 


To determine are the extreme values of a function u = f(z1,29,..., 2,4) of n variables with the side 
conditions 
Q(21, £3,..., 24) — 0, (x1, 22,..., 24) —0,... , X(z1,92,..., 24) — 0. (6.72a) 


Because of the conditions, the variables are not independent, and if the number of conditions is k, 
obviously k « n must hold. One possibility to determine the extreme values of u is to express k variables 
with the others from the system of equations of the conditions, to substitute them into the original 
function. Then the result is an extreme value problem without conditions for n— k variables. The other 
way is the Lagrange multiplier method. Introducing k undefined multipliers A, u,..., &, and composing 
the Lagrange function (Lagrangian) of n -- k variables z1, 23, ... 34, À Jt ..., & gives: 

D (Ri Coy acs Bigs Ap pees Be) 

= f (£1, £2, .. . , En) + À (£1, T2, . - -, En) - μψί(σι, 20,..., η) ἠ---: 

-κχίαι, 2, ..., t5). (6.72b) 
The necessary condition for an extremum of the function & is a system of n + k equations (6.71) with 
the unknowns 21, %2,.-.,%p,A, ly... ΑΝ: 

yp=0,~=0, ..., x=0, &, =0, &, =0,... ,O, =0 (6.72c) 
The necessary condition for an extremum of the function f at the point Po(x10, 29, .. ., 49) with the 
side conditions (6.72a) is that the system of values x19, 129, . .., 49 must fulfill the equations (6.72c). 
So it is to look for the extremum points of f among the solutions x49, 129, . .. , 49 of the system of 
equations (6.72c). To determine whether there are really extreme values at these points fulfilling the 
necessary conditions requires further investigations, for which the general rules are fairly complicated. 
Usually one uses some appropriate and individual calculations depending on the function f to prove if 
there is an extremum, or not. Often approximation calculations are helpful, i.e., the comparison with 
values of the function in the neighborhood of Pp. 
W The extreme value of the function u = f(x, y) with the side condition y(, y) = 0 will be determined 
from the three equations 


ὃ ὃ 
olay) =0, a flay) + Ava] =0, τ. ἰδία, ν) Ελα, υ)] -- 0. (6.73) 
There are three unknowns, x,y, 4. Since the three equations (6.73) are only necessary but not suffi- 
cient conditions for the existence of an extremum, further investigation is needed whether there is an 
extremum at the solution of this system. A mathematical criterion is rather complicated (see [6.3], 
[6.8]); comparisons of the values of the function at points in the close neighborhood are often useful. 





7 Infinite Series 


7.1 Sequences of Numbers 


7.1.1 Properties of Sequences of Numbers 


7.1.1.1 Definition of Sequence of Numbers 

An infinite sequence of numbers (hereafter briefly sequence of numbers) is an infinite system of numbers 
@1,42,.-.,4n,... Or briefly {an} with n = 1,2,..., (7.1) 

arranged in a given order. The numbers of the sequence of numbers are called the terms of the sequence. 

Among the terms of a sequence of numbers the same numbers can occur several times. A sequence is 

considered to be defined if the law of formation, i.e., a rule is given, by which any term of the sequence 

can be uniquely determined. Mostly there is a formula for the general term ay. 

Examples of Sequences of Numbers: 

















BA: a, =n: 1,2,3,4,5,.... MB: a, =4+3(n-—1): 4,7, 10,13, 16,.... 
δε, 353 3 3 
Qia-3[(—— Due ts E Mas D:ds-—(Cpeu qo qp... 
a C: o -3(75) ρα απο M Dici 
1 1.3,7 3 11 
2 dn = 3— e ad 2- — —). 
BE: a, =3 $e 052,32 3i (read 27 P 
1 1 —n-l 
BF: a,= 33 5 10 2 forodd n and 
1 2 πο 
απ = 33 tz 10 2 for even n: 3; 4; 3.3; 3.4; 3.33; 3.34; 3.333; 3.3345; .... 
1 1111 
4 mE EE. uer Men a = ntin = E 
EG: an= L PTE E H: o, — (-1) : 1, 22,3, 4,5, —6,. 
,+1 
H I: a, = E E for odd n and a, = 0 for even n: —1,0, —2,0, —3,0,—4,0,.... 
1 1.3 ..3 
W J: a, — 3— ———, for odd n and an = 13— z for even n: 1, 11,2, 12, 2l. 12-,2—,12- 
225 23 — 29) 4" 
7.1.1.2. Monotone Sequences of Numbers 
A sequence of numbers ay, @2,...,@n,... is monotone increasing if 
dj €a5€a3 €: Xa S, (7.2) 
is valid and it is monotone decreasing if 
@ 2 dg > 3 26+ 2 An 2°: (7.3) 


is valid. One talks about a strictly monotone increasing sequence of numbers or strictly monotone de- 
creasing sequence of numbers, if equality never holds in (7.2) or (7.3). 
Examples of Monotone Sequences of Numbers: 


W A: Among the sequences of numbers from A to J the sequences A, B, E are strictly monotone in- 
creasing. 


W B: The sequence of numbers G is strictly monotone decreasing. 
7.1.1.3. Bounded Sequences of Numbers 
A sequence of numbers is called bounded if for all terms 


las] « K (7.4) 
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is valid for a certain K > 0. If such a K (bound) does not exist, then the sequence of numbers is 
unbounded. 


W Among the sequences of numbers from A to J the sequences of numbers C with K = 4, D with 
K —2, E with K —3, F with K = 5, G with K — 2 and J with K = 13 are bounded. 


7.1.2 Limits of Sequences of Numbers 


1. Limit of a Sequence of Numbers 
An infinite sequence of numbers (7.1) has a limit A if for an unlimited increase of the index n the 
difference a, — A becomes arbitrarily small. Precisely defined this means: For an arbitrarily small € > 
0 there exists an index no(¢) such that for every n > no 

lan — Al <e. (7.5a) 
The sequence of numbers has the limit +-oo (—oo), if for arbitrary K > 0 there exists an index no(K’) 
such that for every n > no 

an>K (an <—K). (7.5b) 


2. Convergence of a Sequence of Numbers 

If a sequence of numbers (a; ) satisfies (7.5a), then one says it converges to A. This is denoted by 
lim a, — A or a, A. (7.6) 
n—-co 


W Among the sequences of numbers from A to J on the previous page, C with A = 0, E with A = 3, 
F with A= 3i , G with A = 0 are convergent. 


3. Divergence of a Sequence of Numbers 

Non-convergent sequences of numbers are called divergent. One talks about proper divergence in the 
case of (7.5b), i.e., if as n exceeds any value, a, exceeds any large positive number K (K > 0) so that 
it never goes below, or if as n exceeds any value, a, goes below any negative number — K (K > 0) with 
arbitrarily large magnitude and never increases above it, i-e., if it has the limit -too. Then one writes: 





lim Gz = 00 (dn > K Yn 7 ng) or lim a,— —oo (a, « —K Vn > ng). (7.7) 
n—oo n—-—oo 


Otherwise the sequence of numbers is called improperly divergent. 
Examples of Divergent Sequences of Numbers: 
W A: Among the sequences of numbers from A to J on the previous page, A and B tend +00, they 
are properly divergent. 
W B: Among the sequences of numbers D is improperly divergent. 
4. Theorems for Limits of Sequences of Numbers 
a) If the sequences of numbers (a) and (b) are convergent, then 
dri) o Re scr lim bas (7-8) rig (anbn) = (hima, an)( him, bn) (9) 


hold, and if bp, 4 0 for every n, and jim, bn # 0, then holds 


x 





i : (7.10) 

n—oo b, Jim. b, 

Remark: If lim bj = 0 and {an} is bounded, then jim (nbn) = 0 even if {an} does not have any 

finite limit. 

b) If lim a4 — lim b, = A is valid and at least, from an index n; and beyond, the inequality an < 
noo noo 

Cn < bp holds, then also holds 
lim cn = A. (7.11) 
noo 
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c) A monotone and bounded sequence of numbers has a finite limit. If a monotone increasing sequence 
of numbers ay < ag < ag < ... is bounded above, i-e., a, < Ky for all n, then it is convergent, and 
its limit is equal to its least upper bound which is the smallest possible value for Ky. If a monotone 
decreasing sequence of numbers a, > ag > a3 > ... is bounded below, i.e., a4, 7 K» for all n, then it is 
convergent, and its limit is equal to its greatest lower bound which is the largest possible value for K3. 


7.2 Number Series 

7.2.1 General Convergence Theorems 

7.2.1.1 Convergence and Divergence of Infinite Series 

1. Infinite Series and its Sum 

From the terms a, of an infinite sequence of numbers {a;} (see 7.1.1.1, p. 457) the formal expression 


a) +ag+++++an +++: => ax (7.12) 
k=1 


can be composed and this is called an infinite series (hereafter briefly series); a, is the general term of 
the series. The finite sums 


Me 


Si =a, S2 = a1 + Q2, ο ος ak (7.13) 


k 


1 
are called partial sums. 

2. Convergent and Divergent Series 

A series (7.12) is called convergent if the sequence of partial sums {Sn} is convergent. The limit 


S= lim S=) a (7.14) 


k=1 





is called the sum of the series. If the limit (7.14) does not exist or it is equal to +00, then the series is 
called divergent. In this case the partial sums are not bounded or they oscillate. So the determination 
of the convergence of an infinite series is reduced to the determination of the limit of a sequence {Sp}. 
E A: The geometric series (see 1.2.3, p. 19) 
dr edic e d 1 

ipod oq queue dese (7.15) 
is convergent with the sum S — 2 (see (1.54b), p. 19) with ag — 1,q — 1/2). 
B B: The harmonic series (7.16) and the series (7.17) and (7.18) 


1 1 1 
bega miik Ee (7.16) ο... (7.17) 
1—1-41----(-1?3 4. (7.18) 


are divergent. For the series (7.16) and (7.17) lim 5, — oo holds, (7.18) oscillates. 

3. Remainder - 

The remainder of a convergent series S = 5 dy is the difference between its sum $ and the partial sum 
k=1 


Sn: 





R,=8 = Σ Ak = An41 t An+2 EeswR (7.19) 


k=n+1 
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7.2.1.2. General Theorems about the Convergence of Series 
1. Necessary Criterion for the Convergence of a Series The sequence of terms of a convergent 
series is a null sequence: 


jim dn = 0. (7.20) 


This is only a necessary but not sufficient condition. 
W For the harmonic series (7.16) lim a, = 0 holds, but lim 5, = oo. 
noo n—oo 
2. Leaving out the Initial Terms If one leaves out finitely many initial terms of a series or one 
introduces finitely many further terms into it at the begin or if changing the order of finitely many 


terms, then its convergence behavior does not change. Exchange of the order of finitely many terms 
does not affect the sum if it exists. 


3. Multiplication of all Terms If all terms of a convergent series are multiplied by the same factor 
c, then the convergence of the series does not change; its sum is multiplied by the factor c. 


4. Termwise Addition or Subtraction By adding or subtracting two convergent series 
oo oo 
a taz+ às de γαι -- δι, (7.21a) bb bb e YO de = SQ (7.21b) 
k=1 
term by term, then the result is a convergent series, and its sum is 


(αι bı) + (az b2) Ted (an ΞΕ bn) H- = S1 E S2. (1.219) 




















7.2.2 Convergence Criteria for Series with Positive Terms 


7.2.2.1 Comparison Criterion 


Suppose there are two series 
ay + ag +++ + Gn +251 = απ (122a) by + bg te + bp Fe = Dn (7.22b) 


with only positive terms (a; > 0, b, 7 0). If a4 > by holds from a certain no, then the convergence of 
the series (7.22a) yields the convergence of the series (7.22b), and the divergence of the series (7.22b) 
yields the divergence of the series (7.22a). In the first case (7.22a) is called a convergent majorant and 
in the second case (7.22b) a divergent minorant . 

E A: Comparing the terms of the series 


1 1 1 
ἔπ... 7.234 
3 = n^ ( 
with the geometric series (7.15), the convergence of the series (7.23a) follows. From n = 2 the terms of 
the series (7.23a) are smaller than the terms of the convergent series (7.15): 


1 1 





nm gn (n 2 2). (7.23b) 
W B: From the comparison of the terms of the series 
1 1 1 
Lp te a t+ pet (7.24a) 


γα ws vn 


with the terms of the harmonic series (7.16) follows the divergence of the series (7.24a). For n > 1 the 
terms of the series (7.24a) are greater than those of the divergent series (7.16): 

P opm Ax) (7.24) 
—»-— (n ; .24b 
yn n 
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7.2.2.2 D'Alembert's Ratio Test 


If for the series 





oo 
a, tag +++ +Gn +++: = Yo an (7.25a) 
n=1 
all the ratios os are smaller than a number q < 1 from a certain ng onwards, then the series is 
convergent: . 
San, <q<1 (n> no). (7.25b) 
n 


If these ratios are greater than a number Q > 1 from a certain no onwards, then the series is divergent. 
From the two previous statements it follows that if the limit 


Qn+1 


lim 7€ =p (7.25¢ 





exists, then for p < 1 the series is convergent, for p > 1 it is divergent. In the case p = 1 the ratio tes 
gives no information whether the series is convergent or not. 














1. 2 

W A: The series atat t (7.26a 

is convergent, because 1 

14- 

Qof(ntlo nY , 4o . 
p= lim (Zr: E) = lim 2 = 5 <1 holds. (7.26b 
4 1 
E B: For the series ο ιβ (7.27a 
4 9 n? 


the ratio test does not give any result about whether the series is convergent or not, because 


] n+2 n+l 
p= lim (Z4 : ux] =i (7.27b 





7.2.2.3 Root Test of Cauchy 


If for a series 
ay + ag ++++ + On +++ = ᾱπ (7.28a) 
n=1 


from a certain ng onwards for every value v/a;, 


Va, «q«1 (7.28b) 
holds, then the series is convergent. If from a certain no every value ϕ/αμ is greater than a number Q 
where Q > 1 holds, then the series is divergent. 

From the previous statements it follows that if 


Jim, Van = p (7.28c) 


exists, in the case p « 1 the series is convergent, in the case p > 1 it is divergent. For p = 1 with this 
test one cannot tell anything about the convergence behavior of the series. 


usas ss didt: gsm n vU 
E The series 3+6) +(3) ee (AX) Tee (7.29a) 
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is convergent because 


: n? 1 1 
p= lim. (=) = lim |—| == <1. holds. (7.29b) 


1+=- 
n 





7.2.2.4 Integral Test of Cauchy 


1. Convergence Ifa series has the general term a, = f(n), and f(x) is a monotone decreasing 
function such that the improper integral 


EO dx (see 8.2.3.2, 1., p. 507) (7.30) 


exists (it is convergent), then the series is convergent. 

2. Divergence If the above integral (7.30) is divergent, then the series with the general term a, = 
f (n) is divergent, too. 

The lower limit c of the integral is almost arbitrary but it must be chosen so that for c < x < oo the 
function f(x) should be monotone decreasing. 

E The series (7.27a) is divergent because 








21-51 vegz-1 
m / x? 


dx = LE - | = 00. (7.31) 


tle 


f(a) = 


2 


7.2.8 Absolute and Conditional Convergence 


7.2.3.1 Definition 


Along with the series (7.12) whose terms can have different signs, one considers also the series 
oo 
[as | 4- [ao] - --- 4 lan| +--+ = X lanl, (7.32) 
n=1 


whose terms are the absolute values of the terms of the original sequence (7.12). If the series (7.32) 
is convergent, then the original one (7.12) is convergent, too. (This statement is valid also for series 
with complex terms.) In this case, the series (7.12) is called absolutely convergent. If the series (7.32) 
is divergent, then the series (7.12) can be either divergent or convergent. In the second case, the series 
(7.12) is called conditionally convergent. 


sina sin2a@ sinna 
m Ut gm 


where ais an arbitrarily constant number, is absolutely convergent, because the series of absolute values 


E A: The series 











ss (7.33a) 








is convergent. This is obvious by comparing it with the geometric series 





with the general term 











(7.15): 
sinna 1 
< —. á 
m |S J (7.33b) 
: T. ul 
B B: Theseries 1— 24-2 —--4-(-1) +- (7.34) 
2 3 n 
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is conditionally convergent, because it is convergent according to (7.36b), and the series made of the 


: ; : ' Qd 
absolute values of the terms is the divergent harmonic series (7.16) whose general term is — — |a; |. 
n 


7.2.3.2 Properties of Absolutely Convergent Series 


1. Exchange of Terms 

a) The terms of an absolutely convergent series can be exchanged with each other arbitrarily (even 
infinitely many of them) and the sum does not change. 

b) Exchanging an infinite number of terms of a conditionally convergent series can change the sum and 
even the convergence behavior. Theorem of Riemann: The terms of a conditionally convergent series 
can be rearranged so that the sum will be equal to any given value, even to coo. 

2. Addition and Subtraction 

Absolutely convergent series can be added and subtracted term-by-term; the result is absolutely con- 
vergent. 

3. Multiplication 

Multiplying a sum by a sum, the result is a sum of the products where every term of the first factor 
is multiplied by every term of the second one. These two-term products can be arranged in different 
ways. 'The most common way for this arrangement is made as if the series were power series, i.e.: 


(ay + dg ++++ +Gn + +++)(by ο e Bo e) 








= aby + dah) + aybg + agby + dgbe + ayb3 ++ ++ + anbi + an-1b2 +++- + iby +--+. (7.35a) 
s A a SS 
If two series are absolutely convergent, then their product is absolutely convergent, so it has the same 
sum in any arrangement. If Σ Qn = Sq and 5 bn = Sp hold, then the sum of the product is 
$ — S,5, (7.35b) 
a oo oo 
If two series ay + ag + +++ +@n +++: = YX an and bi + bg +--+ +b, +--+ = Y b, are convergent, and 
n=1 n=1 


E 
go 


at least one of them is absolutely convergent, then their product is also convergent, but not necessarily 


absolutely convergent. 
7.2.3.3 Alternating Series 
1. Leibniz Alternating Series Test (Theorem of Leibniz) 
For an alternating series 
ay — dg + a3 —++- +a, +: , (7.36a) 
where a, are positive numbers, a sufficient condition of convergence is if the following two relations 


hold: 
1. im an = 0 and 2. a, >a, >a3>°--:>a,>-°-. (7.36b) 








M@ The series (7.34) is convergent because of this criterion. 

2. Estimation of the Remainder of an Alternating Series 

Considering the first n terms of an alternating series, then the remainder R,, has the same sign as the 
first omitted term a,41, and the absolute value of R, is smaller than |an41|: 


sign Ry, —sign(a,41) with R,—$-— Sn, (7.37a) [S — S4| « Jana]. (7.37b) 
W Considering the series 
1.1 1 1 ; 3 1 
l—-z-c-z--.-c4eck-ces-dn2 (738a) theremainderis |ln2 — $,| « ——-.  (7.38b) 








2 3 4 n n+1 
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7.2.4 Some Special Series 
7.2.4.1 The Values of Some Important Number Series 













































































ΤῊ (Τ.39 
! pto i esi (7:40 
i ᾿ ᾿ ἔς; (7.41 
1+5 its 2 =2, (7.42 
exped + OF ped, (7.43 
fee ee a it a. ae (7.44 

3 5 7 9 2n —1 4 
DgtggtgqtU σον πι (7.45 
πα τας τση ο απ οι eee 
tees tee i Mae 
LI τῇ mt t4cpacp 3 F (7.48 
πα v Ππτηπτη Td I 
ΓΣ ascen aS ee a 
ee (7.51 
i ο : ΕΟ, 9: (7.52 

2 

ΠΤ ΒΕ gU. (7.53 
ltgtatgtotat -- (Τ.54 

T z 
F i x i x ae τ (7.56 





7.8 Number Series 465 























1 1 1 1 q2592k-1 

Ic 22k 32k ate gpk TT xk πάτο (2k)! By, (7.57) 
1 1 1 (0η ο - 

1 Jk gx κ Τ᾽ ae n (2k)! Br, (7.58) 
1 1 1 1 n (22 — 1) 

1 | Rd bes = Bg, 7.59 

tg t gm t uk (n — 19 2-() * d 

1 1 1 (0η πλέει 
1 32k+1 ! 52kk1 — T2kdl πο. (2n — 1)2k41 Te gei I (7.60) 


7.2.4.2 Bernoulli and Euler Numbers 


1. First Definition of the Bernoulli Numbers The Bernoulli numbers Bj occur in the power 
series expansion of some special functions, e.g., in the trigonometric functions tan x, cot x and csc x, 
also in the hyperbolic functions tanh x, coth z, and cosech x. The Bernoulli numbers Bx can be defined 
as follows 
2 4 2n 
x - q x € 
1 
=1 + By Bo +---+(-1)""'B 4 
e? —] 2 2! 4] ( " (2n)! 





Ee (|z| < 27) (7.61) 























and they can be calculated by the coefficient comparison method with respect to the powers of x. The 
first in this way calculated values are given in Table 7.1. 
Table 7.1 The first Bernoulli numbers 
1 1 7 174611 
T = 4 = = 1 
6 30 f 6 y 330 
1 5 3617 854513 
2 = = 11 
30 66 510 138 
1 691 43 867 
42 τ 2730 i 798 






































2. Second Definition of Bernoulli Numbers Some authors define the Bernoulli numbers in the 
following way: 
2 2n 








zr — zr στ T 
aa 1+ Bi + B2 7 bed Ban yi Eee (απ « 27). (1.62) 
So one gets the recursive formula 
Ῥει-(Β 11;  (k=1,2,3,...), (7.63) 


where after the application of the binomial theorem (see 1.1.6.4, 1., p. 12) it is to replace B” by B,, 
i.e., the exponent becomes the index. The first few numbers are: 











o 1 e 1 n jx ἢ 
Boa]. Bee Bee. . Base, 
t Οἱ ΓΈΡΑ gor πο 49 
= 1 — 5 Q4 60 . 7 
B Bee Bu = Bu e 7.64 
8 30° 10 66) 12 2730’ 14 6 ( ) 





* By are the Bernoulli numbers 
ΤΕΙ are the Euler numbers 
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Big ————,..., B; = B; = By =-:-=0. 
16 510" 3 5 7 
The following relation is valid: 
By =(-1)"*"By (k= 1,2,3,...). (7.65) 


3. First Definition of Euler Numbers The Euler numbers Ex occur in the power series expansion 
of some special functions, e.g., in the functions sec z and sech x. The EULER numbers Ex can be defined 


as follows 
2 4 2n 


x z x 

2300 r | Pet 
secr = 1 + E at Ez 1 FE, Gn)! 
and they can be calculated by coefficient comparison with respect to the powers of x. Their values are 
given in Table 7.2. 
4. Second Definition of Euler Numbers Analogously to (7.63) the Euler numbers can be defined 
with the recursive formula 

(E-1*-(E-1*-0  (k-12,3,..) (7.67) 
where after the application of the binomial theorem it is to replace E” by E,. For the first values one 
gets: 





bee (lal < 5) (7.66) 





Fy=—-1, Ey=5, EBe=—6l, Eg = 1385, 





Eio —50521, Ei = 2702765, Επι -- --199 560981, (7.68) 
Eye = 19391512 145,..., Ei = E = Es =---=0. 

The following relation is valid: 
E; = (—1) Ex = (k = 1,2,3,...). (7.69) 


Table 7.2 First Euler numbers 























k Ek k Ek 

1 1 5 50521 
2 5 6 2702 765 
3 61 7 199 360 981 
4 1385 





5. Relation Between the Euler and Bernoulli Numbers The relation between the Euler and 
Bernoulli numbers is: 


ght y OH 
a= (B: = 1) (k —1,2,...). (7.70) 
7.2.5 Estimation ofthe Remainder 

7.2.5.1 Estimation with Majorant 


In order to determine how well the n-th partial sum approximates the sum of a convergent series, the 
absolute value of the remainder 


X ak S x [az] (7.71) 








[5 — Snl = μι] ES 
k=n+1 k=n+1 
οο oo 
of the series Y ^ aj must be estimated. For this estimation one uses a majorantfor Y^ |a,|, usually a 
k=1 k=n+1 


geometric series or another series which is easy to sum or to estimate. 
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oo 

















α 
WB Estimate the remainder of the series e — V7 —. For the ratio m+ of two subsequent terms of 
Zn am 
] ] : Qm+1 m! 1 1 . 
this series with m > n + 1 holds: = < q < 1. So the remainder 
αι (m+1)! m-c-1^ n-2 
1 1 1 
Rn = + ——— + —— +.--- can be majorized by the geometric series (7.15) with the 


(31511. (12)  (n +3)! 














1 
quotient q — a and with the initial term a = (n3 i - E , and it yields: 
α 1 n42 1 n+2 1 
Ra < = < : 7.72 
1-ᾳ (n+1)!n+1 — nln?+2n  n-nl oe 





7.2.5.2 Alternating Convergent Series 


For a convergent alternating series, whose terms tend to zero with monotone decreasing absolute values, 
the easy estimation for the remainder is (see 7.2.3.3, 1., p. 463): 


[Ft] 5 |8 — S4] αφ] (7.73) 
7.2.5.3 Special Series 


For some special series, e.g., Taylor series, there are special formulas to estimate the remainder (see 


7.3.3.3, p. 471). 


7.3 Function Series 
7.3.1 Definitions 


1. Function Series is a series whose terms are functions of the same variable x: 

Λία) - fa(z) - [ι(α - -Σ Λι(α). (7.74) 
2. Partial Sum 5S,(z) is the sum of the first n terms of the series (7.74): 

Salat) = fal) + fale) e+ + fala =X fale (7.75) 


3. Domain of Convergence ofa function series (7.74) is the set of values of « = a for which all 
the functions f(x) are defined and the series of constant terms 


fila) + fala) +: + fala) +- -Σ fala (7.76) 


is convergent, i.e., for which the limit of the partial sums S,,(a) exists: 
Jim S4( = lim > fela (7.77) 
= 


4. The Sum of the Series (7.74) is the function S(x), and one says that the series converges to the 
function S(x). The values for x = a are the points of convergence. 

5. Remainder R,(«) is the difference between the sum S(x) of a convergent function series and its 
partial sum Sp (x): 


Ry (0) — S(z) = Sale) = fasil) + fatal) +++ H faima) H (7.78) 
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7.3.2 Uniform Convergence 
7.3.2.1 Definition, Weierstrass Theorem 


According to the definition of the limit of a sequence of numbers (see 7.1.2, p. 458 and 7.2.1.1, 2., 
p. 459) the series (7.74) converges for all values x of a domain to S(x) if for an arbitrary & 7 0 there is 
an integer N(z) such that |S(x) — S,(x)| « £ holds for every n > N(x). For function series there are 
to be distinguished between two cases: 
1. Uniformly Convergent Series 
Tf there is a number N such that for every x in the domain of convergence of the series (7.74), |S(x) — 
Sn (x)| < £ holds for every n > N, then the series is called uniformly convergent on this domain. 
2. Non-Uniform Convergence of Series 
If there is no such number N which holds for every value of z in the domain of convergence, i.e., there are 
such values of € for which there is at least one x in the domain of convergence such that |S (a) —5, («)| > € 
holds for arbitrarily large values of n, then the series is non-uniformly convergent. 
zo zr g^ 
M@ A: Theseries 1+—+—+4+-:-+—+---=e* (7.79a) 
1! 2! n! 
(see Table 21.5, p. 1059) is convergent for every value of x. The convergence is uniform for every 
bounded interval of x, and for every |x| < a using the remainder of the Maclaurin formula (see 7.3.3.3, 
2., p. 472) the inequality 
qnl » 
πετ’ < (n 1j (0 « O « 1) (7.79b 
is valid. Since (n + 1)! increases faster than a"** in n, the expression on the right-hand side of the 
inequality, which is independent of x, will be less than € for sufficiently large n. The series is not uni- 
formly convergent on the whole numerical axis: For any large n there will be a value of x such tha 


|S(x) ~ Sa(2)| = e| g 








+1 








qnl 
mE is greater than a previously given e. 
Bb DB: Theseries z--z(1—z) + e(l- r) + z(1— x) H, (7.80a 


is convergent for every x in [0, 1], because corresponding to the d’Alembert ratio test (see 7.2.2.2, p. 461 


Qn+1 





= |1 — x| < lis valid for 0 < x < 1 (for z = 0 S = 0 holds). (7.80b 


= lim 
P n—-oo 








The convergence is non-uniform, because 

S(x) — S, (zx) 2 x((1— x)" + (1—a2)?P E... 2 (1— x)" (Τ.806 
is valid and for every n there is an x such that (1 — «)"*? is close enough to 1, i.c., it is not smaller than 
c. In the interval a € z € 1 with 0 « a « 1 the series is uniformly convergent. 














3. Weierstrass Criterion for Uniform Convergence 
The series (7.81a) is uniformly convergent in a given domain if there is a convergent series of constant 
positive terms (7.81b) such that for every x in this domain the inequality (7.81c) is valid. 


fala) + fala) +- + fala) t (7.81a) € 6 eb e b b eee (7.81b) 
|fn(@)| < Cn (7.81c) 
(7.81b) is called a majorant of the series (7.81a). 
7.3.2.2 Properties of Uniformly Convergent Series 


1. Continuity 
Ifthe functions f(x), fa(x),---, fa(x),--- are continuous in a domain and the series f(x) 4- fa(x) 4 +--+ 
falx) +: -+ is uniformly convergent in this domain, then the sum S(x) is continuous in the same domain. 
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If the series is not uniformly convergent in a domain, then the sum S(x) may have discontinuities in 
this domain. 

W A: Thesum of the series (7.80a) is discontinuous: S(x) = 0 for x = 0 and S(x) = 1 for x > 0. 

E B: The sum of the series (7.79a) is a continuous function: The series is non-uniformly convergent 
on the whole numerical axis. But it is uniformly convergent in every finite interval. 

2. Integration and Differentiation of Uniformly Convergent Series 

In the domain [a, b] of uniform convergence it is allowed to integrate the series term-by-term. It is also 
allowed to differentiate term-by-term if the result is a uniformly convergent series. That is: 


J Y (dt — Y | io dt for 2,2 € (a,b), (7.82a) 

To n=1 n-lgg 

(È fa) = Y f.) for x € [a,b]. (7.82b) 
n=1 n=1 


7.3.3 Power series 
7.3.3.1 Definition, Convergence 
1. Definition 


The most important function series are the power series of the form 





ao + a£ + agt? +--+ anI” +- — 5 asc" or (7.83a) 
n=0 
ay + a(x — £0) + a(x — £0)? +--+ + an(£ — z0)" +- = X anl — £0)", (7.83b) 
n=0 


where the coefficients a; and the center of expansion zo are constant numbers. 


2. Absolute Convergence and Radius of Convergence 
A power series is convergent either only for x = 2p or for all values of x or there is a number r > 0, the 
radius of convergence, such that the series is absolutely convergent for | — zo| « r and divergent for 
|x — xo| 7 r (Fig. 7.1). The radius of convergence can be calculated by the formulas 

1 
ΤΞ-------- (7.84) 


dim, lan 


« απ 
r — lim 
noo 





An+1 








domain of convergence if the limits exist. If these limits do not exist, then one has to take 


΄ \ the limit superior (lim) instead of the usual limit (see [7.6], Vol. I). 





XE x XEF At the endpoints x = +r and x — —r for the series (7.83a) and 
0 0 0 : . 
z = £o +r and zx — xo — r for the series (7.83b) the series can be 
Figure 7.1 either convergent or divergent. 


3. Uniform Convergence 
A power series is uniformly convergent on every subinterval |z — zo| € ro < r of the domain of conver- 
gence (theorem of Abel). 





ro oz gn 1 +1 

E For the series 1+2 +2 +--+% +- holds = = lim =1, ie; r= 1. (7.85) 
1 2 n r no n 

Consequently the series is absolutely convergent in —1 < x < +1, for x = —1 it is conditionally 


convergent (see series (7.34), p. 462) and for x = 1 it is divergent (see the harmonic series (7.16), p. 459). 
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According to the theorem of Abel the series is uniformly convergent in every interval [—r1, +11], where 


ΤΊ is an arbitrary number between 0 and 1. 
7.3.3.2 Calculations with Power Series 
1. Sum and Product 


Convergent power series can be added, multiplied, and multiplied by a constant factor term-by-term 


inside of their common domain of convergence. The product of two power series is 


(x ana) . bs a") = aobo + (aobı - a1bo)s -- (agba -- a,b; + agbo)2? 
n=0 n=0 
+(aobz + aib + azbı + azbo)z? +-+- 

2. First Terms of Some Powers of Power Series: 

S — a+br + cr? + dr? + ert + fa? E... , 

S? = a? + 2abe + (0? + 2ac)x” + 2(ad + be)x? + (2 + 2ae + 2bd)a* 

+2(af + be + cd) ++, 
10Y, 


2 
=s- at [ities (55-22) (82-1 








2a 2a 8a? 2a 4a? 















































(1e Τά 1? 3c 5 bt FT (a > 0) 
"(2a 4d 8d 16d 12804)” | ἘΠΕ 
2 : : à 
f =S =a |1 δν ey ae x4 ie a? 
VS 2a 8a? 2a 4a? 2a 16a? 
n 30d 3c? le 150c, 35 ὥς (a 5 ϐ) 
Λα} δα 2a 16a? 128a i E } 
1 E “4 b LEONE [2c d p? 3 
S s - | a (3 aj^ "ig a a)” 
20d Ê æ Pe bt\ 4 
' ( a? a a 3 a i d T (a #0), 
1 2 _2 b i b? a pj? 3 
ορ S α | 277 H ὃ 2, 48 x 
bd be 
PE s. i (a #0). 
3. Quotient of Two Power Series 
Σ απ 2 
nc ao | + art + aT +: O aon ολ. 4 i55 2 
δ = ο h [1 + (ay — B1) *- (ag — eai -- F^ — Ba)x 
πο +(a3 — a28) — 0182 — b3 — bi? + afr? + 28162)a* +--+] (bo # 0). 





(7.86) 


(7.87) 


(7.88) 


(7.89) 


(7.90) 


(7.91) 


(7.92) 


(7.93) 


One gets this formula by considering the quotient (7.93) as a series with unknown coefficients, and after 


multiplying by the numerator the unknown coefficients follow by coefficient comparison. 
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4. Inverse of a Power Series 
If the series 

















y — f(x) — ax + br? + cr? + dat + ez” + fz +- (a #0) (7.94a) 
is given, then its inverse is the series 
z = p(y) = Ay + By? + Cy? + Dyt + Ey? + Fy®+---. (7.94b) 
Taking powers of y and comparing the coefficients yields 
1 b il hs 1 > : 
A T B w σ z (2b — ac), D= g Čabe a“d — 50°), 
1 y 
E = — (6a°bd + 3a7c? + 14b* — a’e — 21ab?c), (Τ.949) 
a 








1 : 
F π (Ta? be -- Ta?cd -- 84ab? c — a* f — 28a?! d — 28a?bc? — 42D). 
α 
The convergence of the inverse series must be checked in every case individually. 


7.3.3.3 "Taylor Series Expansion, Maclaurin Series 


There is a collection of power series expansions of the most important elementary functions in Ta- 
ble 21.5, p. 1057. Usually, one gets them by Taylor expansion. 

1. Taylor Series of Functions of One Variable 

If a function f (x) has all derivatives at x = a, then it can often be represented with the Taylor formula 
as a power series (see 6.1.4.5, p. 443). 

a) First Form of the Representation: 

(zx — a)* 


f'(a) +- ~ra) +. (7.95a) 


—a (x — a? 


f(z) - f(a) + 5 f) 5, 


This representation (7.95a) is correct only for the x values for which the remainder Rp = f(x) — S5 
tends to zero if n — oo. This notion of the remainder is not identical to the notion of the remainder 
given in 7.3.1, p. 467 in general, only in the case if the expressions (7.95b) can be used. 





There are the following formulas for the remainder: 


πο n+l 
Ry = arene) (a<€<«a or ©<€<a) (Lagrange formula), (7.95b) 
n ! 
Εμ E (a — t)^ fe» (t) at (Integral formula). (7.95c) 
n! 


b) Second Form of the Representation: 





h h? h” 
f(a -- h) — f(a) - (a) - P" (0) OOA (7.962) 
The expressions for the remainder are: 
hti 
Ry = no pi ο -Oh) (0<0<1), (7.965) 


1 


h 
R, = — / (h — t) fOD (a + t) dt. (7.966) 
ο 


3 


1 
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2. Maclaurin Series 
The power series expansion of a function f(x) is called a Maclaurin series if it is a special case of the 
Taylor series with a = 0. It has the form 





q a? gq" 
Fle) = $0) + FF) + FIO) +--+ SFO) $e (7.972) 
with the remainder 
ant 1 f n g(n4-1) 
as gi" (ex) (0<0<1), (7.97b) R= afe -APSO tdt. (7.970) 
! d 


The convergence of the Taylor series and Maclaurin series can be proven either by examining the re- 
mainder R,, or determining the radius of convergence (see 7.3.3.1, p. 469). In the second case it can 
happen that although the series is convergent, the sum S(x) is not equal to f(x). This holds for instance 
for the function f(a) = exp(- 4) for x # 0, and f(0) = 0. The terms of its Maclaurin series are all 
identically zero, e.g., S(x) = 0 # f(x). 


7.3.4 Approximation Formulas 


Considering only a neighborhood small enough of the center of the expansion, one can introduce ratio- 
nal approximation formulas for several functions with the help of the Taylor expansion. The first few 
terms of some functions are shown in Table 7.3. The data about accuracy are given by estimating the 
remainder. Further possibilities for approximate representation of functions, e.g., by interpolation and 
fitting polynomials or spline functions, can be found in 19.6, p. 982 and 19.7, p. 996. Important series 
expansions see Table21.5, p. 1057. 

7.3.5 Asymptotic Power Series 


Even divergent series can be useful for calculation of function values. Some asymptotic power series 


: 1 : ' 
with respect to — are considered next to calculate the values of functions for large values of ||. 
T 


7.3.5.1 Asymptotic Behavior 


Two functions f(x) and g(x), defined for xo < x < oo, are called asymptotically equal for x — oo if 





3) =i (7.98a) or f(x)-—g(x)--o(g(r) for x oo (7.98b) 


hold. Here, o(g(a)) is the Landau symbol “little o" (see 2.1.4.9, p. 57). If (7.98a) or (7.98b) is fulfilled, 
then one writes also f(x) ~ g(x). 
Wi Jr Xen απο, πω a. 
4931-5152 4a? 
7.3.5.2 Asymptotic Power Series 
1. Notion of Asymptotic Series 


7 Ων. ; : : τ 
A series Poco is called an asymptotic power series of the function f(x) defined for x > xo if 
x 


LM 1 
ja- ES «o() em 
1 
holds for every n — 0,1,2,.... Here, O (zu) is the Landau symbol *big O". For (7.99) one also 
x 
oo αν 


writes f(x) z& 5 ,—. 
Am 
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Table 7.3 Approximation formulas for some frequently used functions 





Tolerance interval for x with an error of 




































































Approximate Next 0.1% 1% 10% 
formula term from to from to from to 
d ye x —0.077 0.077 —0.245 0.245 —0.786 0.786 
ο. ~6 | -44° | 445 | -14.0° | 14.0° | -4505 | 4505 
; x a? —0.580 0.580 —1.005 1.005 —1.632 1.632 
sine & «-— — += 
6 120 | —33.2° 33.2° —57.6° 57.6" —93.5° 98.55 
ος η s? —0.045 0.045 —0.141 0.141 —0.415 0.415 
μα 2 —2.6° 2.65 —8.1° 8.15 —25.8° | 25.85 
πα a? "s —0.386 0.386 —0.662 0.662 —1.036 1.036 
2 24 —22.1° 22.19 —37.9° 37.9° —59.3° 59.3° 
— Q8 —0.054 0.054 —0.172 0.172 —0.517 0.517 
3 —3.1° 3.1° —9.8° 9.8° —29.6° 29.6° 
— a? d ut —0.293 0.293 —0.519 0.519 —0.895 0.895 
3 15 —16.8° 16.8° —29.7° 29.7° —51.3° 51.35 
: -ᾱ 
να Επ βία. o HS —0.085a? | 0.093a? | —0.247a? | 0.328a? | —0.607a? | 1.545a? 
l a +r 
= (« + i ) 
1 1 T 32? τ. 3 9 Eo.:9. i50 2 r20,42 
UIS N a^ 258 taa —0.051a^ | 0.052a^ | —0.157a^ | 0.166a^ | —0.488a^ | 0.530a 
1 1 gy x? 
R——-— +— | —0.031a |0.031a | —0.099a | 0.0994 | —0.301a | 0.301a 
a+r a a a3 
9 
ο ο] κα +5 —0.045 0.045 —0.134 0.148 —0.375 0.502 
22 
In(14- z)z x S — 0.002 0.002 —0.020 0.020 —0.176 0.230 




















2. Properties of Asymptotic Power Series 
a) Uniqueness: If for a function f(x) the asymptotic power series exists, then it is unique, but a func- 
tion is not uniquely determined by an asymptotic power series. 


b) Convergence: Convergence is not required for an asymptotic power series. 
8 1 ymp 


oo 











MA: ce 5 "τα Ὁ an asymptotic series, which is convergent for every x with |x| > £o (zo > 0). 
ον! 
v=0 T eost 

E B: The integral f(x) = [ tae dt (x > 0) is convergent for x > 0 and repeated partial inte- 

Jo 
. : ] 1 i 2 3 n — 1)! 
gration results in the representation f(x) — στα e te pyr! ) + Ry(x) 
yo T x g^ 
ith Rala) = (1P IT. ο ο (Rala) < © [e dt =” ret 
with R,(xr) — (—1) 5f xui d ecause of |, (x)| € 5S e | = aH one gets 
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1 
Rn(x) =O (=a): and with this estimation 


oo 


co eat yv 
1 L4 CU a (7.100) 


v=0 
The asymptotic power series (7.100) is divergent for eveiy x, because the absolute value of the quotient 


of the (n + 1)-th and of the n-th terms has the value 7 —. However, this divergent series can be used for 


a reasonably good approximation of f(x). For stancà, for x — 10 with the partial sums 54(10) and 
lot 


ΤΕ ; 4 « 0.09164 holds. 





55(10) the estimation 0.09152 < Li 


7.4 Fourier Series 
7.4.1 Trigonometric Sum and Fourier Series 
7.4.1.1 Basic Notions 


1. Fourier Representation of Periodic Functions 
Often it is necessary or useful to represent a given periodic function f(x) with period T exactly or 
approximatively by a sum of trigonometric functions of the form 
a 
Sp(x) = a + a COSwWT + az COS 2wT + -+ + an COS NWE 
+b; sinwx + bz sin 2wg + -- - + bn sin nws. (7.101) 


2π 
This is called the Fourier expansion. Here the frequency is w = Τ᾽ In the case T = 27 holds w = 1. 


One can get the best approximation of f(x) in the sense given on 7.4.1.2, p. 475 by an approximation 
function s,,(x), where the coefficients a, and by (k = 0,1,2,...,m) are the Fourier coefficients of the 
given function. They are determined with the Euler formulas 


91 ro4T » Tf? 
r= = fC) cos kwax dx = = MEO cos kwa da = a fu (9) 4- f(—2)| cos kox dv, (7.102a) 
0 z 0 
and 
ao+T a? 
ο. )sin kwx dx — — 7 fie )) sin kwg dz — — ΤΙ (f(x) (—x)]sinkwx dx, (7.102b) 


or approximatively with the help of methods of harmonic analysis (see 19.6.4, p. 992). 

2. Fourier Series 

If there is a system of x values such that the sequence of functions s,,(a) tends to a limit s(a) for n + oo, 
then for the given function exists a convergent Fourier series for these x values. This can be written in 
the form 


ao 
s(x) = ο + a1 COS WH + Az COS 2WH + +++ + Gp, cos nua +: 





+b; sinwax + bg sin 2wx +---+b,sinnwa+--- (7.103a) 
and also in the form 
s(x) = > + A; sin(wx + y1) + Ag sin(2wa + ye) +--+ + An sin(nwx + Φῃ) 1--:-, (7.103b) 


where in the second case: 


A, = y ak? + bj, tan Yk = . 3 (7.103c) 
k 
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3. Complex Representation of the Fourier Series 
In many cases the complex form is very useful: 


Too . 
Y eem, (7.104a) 


k-—oo 


1 
230 fork =0, 


3 


T 
-τ|τω ether dy = λίαν — ib)  fork>0, (Τ.1040) 
0 


1 
Pi Tdba) fok«0. 


7.4.1.2 Most Important Properties of the Fourier Series 


1. Least Mean Squares Error of a Function 


2π 
If a function f(x) over the interval [0, T] (τ = 2) is approximated by a trigonometric sum 
ω 


Sp(@) = S D x aj, cos kw + 5 by sin kw, (7.105a) 
k-i 


also called the Fourier sum, then the mean square error (see 19.6.2.1, p. 984, and 19.6.4.1, 2., p. 993) 


r- gute — sa (x)]? da (7.105b) 


is smallest if one defines ag and by as the Fourier coefficients (7.102a,b) of the given function. 
2. Convergence of a Function in the Mean, Parseval Equation 


. i . 2m, . : "E 
The Fourier series converges over the interval [0, T] (T = —) in mean to the given function, i.e., 
ω 


[use )-s.(z)Pdz—0 fo n-oo (7.1062) 


holds, if the function is bounded and in the interval 0 < x < T itis piecewise continuous. A consequence 
of the convergence in the mean is the Parseval equation: 


T qo 
| πώ} αν cuu Σι (ax? + by”). (7.106b) 
/ i 


3. Dirichlet Conditions 

If the function f(a) satisfies the Dirichlet conditions, i.e., 

a) the interval of definition can be decomposed into a finite number of intervals where the function f(x) 
is continuous and monotone, and 

b) at every point of discontinuity of f (x) the values f(x + 0) and f(x — 0) are defined, 

then the Fourier series of this function is convergent. At the points where f(x) is continuous the sum 
is equal to f(x), at the points of discontinuity the sum is equal to feos form ; 

4. Asymptotic Behavior of the Fourier Coefficients 

If a periodic function f(a) and its derivatives up to k-th order are continuous, then for n — oo both 
expressions a,n^*! and b,n**! tend to zero. 





476 7. Infinite Series 











ώς, f(x) 
0| τὶ Τ x ο τι τ x 
2 2 
Figure 7.2 Figure 7.3 Figure 7.4 


7.4.2 Determination of Coefficients for Symmetric Functions 
7.4.2.1 Different Kinds of Symmetries 


1. Symmetry of the First Kind 
If the periodic function f(x) with the period T is an even function, i.e., if f(x) = f(—a) (Fig. 7.2), 
then its Fourier coefficients are 
4 τρ 2 
i= = / f(x)cosk T dg, b 20  (k-0,12,..). (7.107) 
T à T 


2. Symmetry of the Second Kind 
If the periodic function f(x) with the period T is an odd function, i.e., if f(x) — —f(—z) (Fig. 7.3), 
then its Fourier coefficients are 
4 Ta 2 
üpcd, hem / f(a)sink dg — (k-0,1,2,...). (7.108) 
T i T 


3. Symmetry of the Third Kind 
If for a periodic function with the period T holds f(a + 7/2) = —f(«) (Fig. 7.4), then the Fourier 
coefficients are 


T/2 
4 3πα 
aua m / f(x) cosQk --1) 7 dm, aa — 0, (7.1092) 
0 
4 T/2 Sea 
b= / f(x) sin(2k + 1) de, bg =0 (k=0,1,2,...). (7.109b) 
ὅ 


4. Symmetry of the Fourth Kind 
If the periodic function f(x) with the period T is odd and also the symmetry of the third kind is satisfied 
(Fig. 7.5a), then the Fourier coefficients are 


T/4 
T baky = 2 J f(x) sin(2k + ye dr (k=0,1,2,...). (7.110) 
0 


If the function f(x) is even and also the symmetry of the third kind is satisfied (Fig.7.5b), then the 
Fourier coefficients are 


T/A 
2nr 
hoax —0, axi 7 / f(x)cosQk --1) 7 dm. (k—0,1,2,.... (7.111) 


0 
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Figure 7.6 | Figure 7.7 
7.4.2.2. Forms of the Expansion into a Fourier Series 


Every function f (z), satisfying the Dirichlet conditions in an interval 0 € x X 1 (see 7.4.1.2, 3., p. 475), 
can be expanded in this interval into a convergent series of the following forms: 


αρ  2nx 2πα 2πο 
1. fi(x) = a ay cos —— + dp con 2—— Tee Han cosn—= m +> 


3πα ὅπο 3πα 
+ bısin τ + be sin2-7* Tec b,sin n Teen (7.112a) 





The period of the function f;(x) is T = l; in the interval 0 < x < I the function f(x) coincides 
with the function f(x) at the points of continuity (Fig. 7.6). At the points of discontinuity fi(x) = 


1 
2 [f(x — 0) - f(x 4- 0)] holds. The coefficients of the expansion are determined with the Euler formulas 


(7.102a,b) for w = τη 








2. falx) B H aı cos "T az cos 275 +-+ | ο Een (7.112b) 


The period of the function f(x) is T = 21; in the interval 0 < x < / the function f(a) has asymmetry 
of the first kind and it coincides with the function f(a) (Fig. 7.7). The coefficients of the expansion 
of fo(a) are determined by the formulas for the case of symmetry of the first kind with T = 2. 


3. fs(x) = b) sin B + be sin27* Tek sinn = Te (7.112c) 
The period of the function f3(x) is T = 2l; in the interval 0 < x < / the function f3(2) has a symmetry 


of the second kind and it coincides with the function f(x) (Fig. 7.8). The coefficients of the expansion 
are determined by the formulas for the case of symmetry of the second kind with T = 21. 


7.4.3 Determination of the Fourier Coefficients with Numerical 
Methods 


If the periodic function f(x) is a complicated one or in the interval 0 < x < T its values are known 


kT 
only for a discrete system x, = N with k — 0,1,2,..., N — 1, then the Fourier coefficients have to be 
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approximated. Furthermore, e.g., also the number of measurements N can be a very large number. In 
these cases one uses the methods of numerical harmonic analysis (see 19.6.4, p. 992). 


7.4.4 Fourier Series and Fourier Integrals 


1. Fourier integral 

If the function f(x) satisfies the Dirichlet conditions (see 7.4.1.2, 3., p. 475) in an arbitrarily finite 
+00 

interval and, moreover, the integral j | f (x)| dx is convergent (see 8.2.3.2, 1., p. 507), then the following 
—oo0 


formula holds (FOURIER integral): 


1 +00 +00 1 oo +00 
f(a) = A / ci^* du / f (the! dt = JE / f(t) cos (t — x) dt. (7.113a) 
π π 
n —oo Ὁ co 
At the points of discontinuity can be used the substitution 
1 
f(x) = aU (x — 0) * f(x + 0). (7.113b) 


2. Limiting Case of a Non-Periodic Function 

The formula (7.113a) can be regarded as the expansion of a non-periodic function f(2) into a trigono- 
metric series in the interval (—1, +1) for | + oo. 

With Fourier series expansion a periodic function with period T is represented as the sum of harmonic 


vibrations with frequency wn = nm with n = 1,2,... and with amplitude A,. This representation is 
based on a discrete frequency spectrum. 


With the Fourier integral the non-periodic function f(x) is represented as the sum of infinitely many 
harmonic vibrations with continuously varying frequency w. The Fourier integral gives an expansion 
of the function f(r) into a continuous frequency spectrum. Here the frequency w corresponds to the 
density g(w) of the spectrum: 


+00 
1 ; 
gw) = = / fte dt. (7.113c) 
The Fourier integral has a simpler form if f(a) is either a) even or b) odd: 
9 oo oo 
a) f(x)= = f coswx à» [ f() cos wt dt, (7.114a) 
π 
0 0 


oo 


b) f(x) = = [sinuxda f f(t)simut dt (7.114b) 
0 


π 
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E The density of the spectrum of the even function f(x) = e~*! and the representation of this function 
are 
2 


2 [e 2 
g(w) — — i e * cosutdt — — 
T Jo π 


2 [99 cosur 
and e hl 
—q (4H) and e -:| aq de. (Τ1150) 





7.4.5 Remarks on the Table of Some Fourier Expansions 


In Table 21.6, p. 1062 there are given the Fourier expansions of some simple functions, which are 
defined in a certain interval and then they are periodically extended. The shapes of the curves of the 
expanded functions are graphically represented. 

1. Application of Coordinate Transformations 

Many of the simplest periodic functions can be reduced to a function represented in Table 21.6 either 
by changing the scale (unit of measure) of the coordinate axis or by translation of the origin. 

W A function f(x) = f(—«) defined by the relations 


T 
2 forüÜc zc, 


y= 
T T 
for = < t < > 
0 or 1 κας 9 
(Fig. 7.9), can be transformed into the form 5 given in Table 21.6, by substituting a — 1 and in- 
2πα 
troducing the new variables Y — y — 1 and X — T T z By the substitution of the variables in 
27r 20 
series 5, because sin(2n 4- 1) (+ + 5) — (—1)" cos(2n — DT one gets for the function (7.116a) 


(7.1162) 


T 2 
the expression 
4 πε 1 2T 1 2πα 

1 - 1 2 (co T7 — 10053 + = coss ++). 7.116b 

π ᾿ΕΝ; T ) 
2. Using the Series Expansion of Complex Functions 
Many of the formulas given in Table 21.6 for the expansion of functions into trigonometric series can 
be derived from power series expansion of functions of a complex variable. 
W The expansion of the function 





ing beoe (Iz] « 1) (7.117) 
yields for 
z = ac? (7.118) 


after separating the real and imaginary parts 


1 — acos y 
1 cos a? cos 2 ee ? cos m. m LM 
ώση οσα DE 1— 2acos y + a? ’ 


a sin +a? sin 2y +- a^ sinn +- = E ui for |a| « 1. (7.119) 





8 Integral Calculus 


1. Integral Calculus and Indefinite Integrals Integration represents the inverse operation of 
differentiation in the following sense: While differentiation calculates the derivative function f'(x) of a 
given function f(x), integration determines a function whose derivative f'(x) is previously given. This 
process does not have a unique result, so it leads to the notion of an indefinite integral. 

2. Definite Integral By starting with the graphical problem of the integral calculus, determining 
the area between the curve of y = f(x) and the x-axis, and for this purpose approximating it with a 
set of thin rectangles (Fig. 8.1), the notion of the definite integral can be introduced. 

3. Connection Between Definite and Indefinite Integrals The relation between these two types 
of integral is the fundamental theorem of calculus (see 8.2.1.2, 1., p. 495). 


8.1 Indefinite Integrals 


8.1.1 Primitive Function or Antiderivative 


1. Definition 
Consider a function y = f(x) given on an interval [a,b]. F(x) is called a primitive function or an- 
tiderivative of f (x) if F(x) is differentiable everywhere on fa, b] and its derivative is f(x): 

F(z) = f(a). (8.1) 
Because under differentiation of F(x) + C (C const) the additive constant disappears, a function has 
infinitely many primitive functions, if it has any. The difference of two primitive function is a constant. 
So, the graphs of all primitive functions F\ (x), F)(x),..., F,(a) can be got by parallel translation of a 
particular primitive function in the direction of the ordinate axis (Fig. 8.2). 


Y 




















* » 
las a 
0 azxy XX X446 Xx Xy XnıX =b X 


Figure 8.1 











Figure 8.2 Figure 8.3 


2. Existence 


Every function continuous on an interval [a,b] has a primitive function on this interval. If there are 
some discontinuities, then one decomposes the interval into subintervals in which the original function 
is continuous (Fig. 8.3). The given function y = f(x) is in the upper part of the figure; the function 
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y = F(x) in the lower part is a primitive function of it on the considered intervals. 
8.1.1.1 Indefinite Integrals 
The indefinite integral of a given function f(a) is the set of primitive functions 


F(2)+C = / f(x) dz. (8.2) 


The function f(x) under the integral sign / is called the integrand, x is the integration variable, and C 


is the integration constant. It is also a usual notation, especially in physics, to put the differential dx 
right after the integral sign and so before the function f(x). 


Table 8.1 Basic Integrals 









































Powers Exponential Functions 
qnl 
Je dx = STI (n z —1) Je dx = e* 
JZ =m] fea (a>0,a41) 
Trigonometric Functions Hyperbolic Functions 
/ sina dx = —cosx / sinh z dr = cosh x 
/ cos x dx = sin x / cosh x dx = sinh x 
fia xdz = — ln | cos z| feme dx = In | cosh z| 
[cot x dx — ln |sinx| foon x dx = ln | sinh z| 
dx / dx 
= tanz — z = tanhg 
J cos? x MS J cosh* x IST 
dx da 
——— — —cotr — = -cothz 
J sin? x SUUS J sinh* x πο 
Fractional Rational Functions Irrational Functions 
EC = arctan Ë -i£ = arcsin Ë (|z| < a,a > 0) 
a* rax a g Va? — x? a ᾿ 
ας... Λε, r 1 a+r dx ia - — 5 
dog gArtanh z — g;lu | ES z| Ize y7 Arsinh | = In | + Va? +a | 
(|x| < a,a > 0) (a > 0) 
dr Nae ae 1 r—a dx ελ ο ασ κής, 22 52 
zu gi qAreoth | = gq ln | E 2 Jz —À Arcosh | = In |z + V2? —a | 
(|z| > a,a > 0) (|z| >a,a > 0) 














8.1.1.2 Integrals of Elementary Functions 


1. Basic Integrals 

The integration of elementary functions in analytic form is reduced to a sequence of basic integrals. 
These basic integrals can be got from the derivatives of well-known elementary functions, since indefi- 
nite integration means the determination of a primitive function F(x) of the function f(x). 

The collection of integrals given in Table 8.1 comes from reversing the differentiation formulas in Ta- 
ble 6.1, p. 434 (Derivatives of elementary functions). The integration constant C is omitted. 
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2. General Case 

For the solution of integration problems, one tries to reduce the given integral by algebraic and trigono- 
metric transformations, or by using the integration rules to basic integrals. The integration methods 
given in section 8.1.2 make it possible in many cases to integrate those functions which have an el- 
ementary primitive function. The results of some integrations are collected in Table 21.7, p. 1065 
(Indefinite integrals). The following remarks are very useful in integration: 

a) The integration constant is mostly omitted. Exceptions are some integrals, which in different forms 
can be represented with different arbitrary constants. 

b) If in the primitive function there is an expression containing In f (x), then one has to consider always 
In |f (2)| instead of it. 

c) If the primitive function is given by a power series, then the function cannot be integrated in an 
elementary fashion. 

A wide collection of integrals and their solutions are given in [8.1] and [8.2]. 


8.1.2 Rules of Integration 

The integral of an integrand of arbitrary elementary functions is not usually an elementary function. 
In some special cases it is possible to use some tricks, and by practice one can gain some knowledge of 
how to integrate. Today the calculation of integrals is mostly left to computers. 

The most important rules of integration, which are finally discussed here, are collected in Table 8.2. 


1. Integrand with a Constant Factor 
A constant factor a in the integrand can be factored out in front of the integral sign (constant multiple 
rule): 


a f(x) dr = a | Κο dx. (8.3) 


2. Integration of aSum or Difference 
The integral of a sum or difference can be reduced to the integration of the separate terms if one can 
tell their integrals separately (sum rule): 


f (nodum [uds f vds- [ was. (8.4) 
The variables u, v, w are functions of x. 
3 4 


E 
E f 696 as - f (2t +62? +101 + 62 +9) dx = = + Sa + Da4 3a +90 +0. 


3. Transformation of the Integrand 
The integration of a complicated integrand can sometimes be reduced to a simpler integral by algebraic 
or trigonometric transformations. 


1 
E [sin 2r cos x dz — / 3 (sin 3a + sina) dx. 
4. Linear Transformation in the Argument 


If / f(x) dz = F(x) is known, e.g., from an integral table, then one gets: 


| fax) dz = *F (a2) +O, (8.5a) ns c 6)dx — F(x 4b) - C, (8.5b) 
ICE +) dx = “F(a £540 (8.5c) 


1 1 
B A: [sm ax dx = ——cosaxz + C. B b: jg dz = -et 4 C. 
. à à a 
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HC: Hie — arctan(z 4- a) 4- C. 


Table 8.2 Important Rules of Calculation of Indefinite Integrals 











Rule Formula for Integration 

Integration constant / f(z)dz — F(x)--C (C const) 
Integration and "pro βρε 

differentiation F'(z)- dz f(z) 





Constant multiple rule / af(x)dx =a / f(a) dx (α const) 
Sum rule fno + v(x)| dx = fuo) dx + EOD απ 
Partial integration ΠΠ dx = u(z)v(x) — | ὑ(ϑυ(α) dx 

















x=u(t) or t=v(2); 
u und v are inverse functions of each other : 
Substitution rule / f(a) dx = fi (u(t))u'(t) dt or 


Γι) 
EO dz = v(u(t)) dt 


1: / πε dr —lnn|f(x)| -- C (logarithmic integration) 
$ 


2. f Pleads =e) +0 


u und v are inverse functions of each other : 
Integration of the no dx — ru(r)— F(u(x)) J-C1 with 
inverse function 

F(x) = EO dr--C$ (Ci, C2 const) 











Special form of 
the integrand 

















5. Power and Logarithmic Integration 


a) If the integrand has the form of a fraction such that the numerator is the derivative of the denomi- 
nator, then the integral is the m of the absolute value of the denominator: 








PO an= fF mr c. (8.6) 


2 
ΒΑ: IE τ z dr — In [z? € 3r - 5| C. 
x? + 3a 


b) If the integr m is a product of a power of a function multiplied by the derivative of the function, 
and the power is not equal to —1, then holds 


[ raras - [ ooa) - P703 


a+l1 


2x+3 1 
B: le lx = FC. 
- (x? + 3a — 5)8 a (—2)(a? + 3a — 5)? * G 





(a const, a # —1). (8.7) 
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6. Substitution Method 
If x = u(t) where t = v(x) is the inverse function of x = u(t), then according to the chain rule of 
differentiation follows 

















9) 
ο) ἀν - f u(t)]u(t) dt. or i d c 
f fas fa æ fro) PICO (8.8) 
ec -—1 au dx 1 : D 
W A: / w^ Substituting x = lnt, (t > 0), EE then taking the decomposition into 
. 6 
partial fractions gives: 
εἳ --1 t — 1dt 2 1 
dz zi z) it —21n(& 41) - 2 4 C. 
jm m dL HEEL A [EY 4. ΗΕ 
It x dx 
E B: TE . Substituting 1+ 2? = t, ο εξ 2x, then one gets αμα -f5- = T 14-α3) ἠ- 
1+ da J 142? 


C. 


7. Partial Integration 
Reversing the rule for the differentiation of a product gives 


frewa) dx = u(x) v(x) — fiw v(x) dz, (8.9) 
where u(x) and v(x) have continuous derivatives. 
W Tho integral / xe” dx can be calculated by partial integration choosing u = x and v’ = e”. Then 
u = 1 and v = e? , therefore Ja dr = te — fe dz — (x — 1)e7+C. 


8. Non-Elementary Integrals 

Integrals of elementary functions are not always elementary functions. These integrals are calculated 
mostly in the following three ways, where the primitive function will be approximated by a given accu- 
racy: 

1. Table of Values The integrals which have a particular theoretical or practical importance but 
cannot be expressed by elementary functions can be given by a table of values. (Of course, the table 
lists values of one particular primitive function.) Such special functions usually have special names. 
Examples are: 


W A: Integral Logarithm (see 8.2.5, 3., p. 513): 


z dt 
T Li(z). a 
hun (80) 
W D: Elliptic integral of the first kind (see 8.1.4.3, p. 490): 
sin p dx 
f SIG gh, (8.11) 
Jo 


y (1 — 22)(1 — K222) 


W C: Error function (see 8.2.5, 5., p. 514): 


S I e^? dt — erf(z). (8.12) 


2. Integration by Series Expansion Using the series expansion of the integrand, and if it is uni- 
formly convergent, then it can be integrated term-by-term. 





ΒΑ: / pem dx, (see also Sine integral p. 513). 
J g 


οἵ 
E B: / — dx, (see also Exponential integral p. 514). 
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3. Graphical integration is the third approximation method, which is discussed in 8.2.1.4, 5., 
p. 499. 
8.1.3 Integration of Rational Functions 
Integrals of rational functions can always be expressed by elementary functions. 
8.1.3.1 Integrals of Integer Rational Functions (Polynomials) 
Integrals of integer rational functions are calculated directly by term-by-term integration: 

n Ant” -- as 42" 7| -- ---  aqx -- ag) dr 

απ 


E ΤΝ 4 Sey am H aox + C. (8.13) 
n n 





8.1.3.2 Integrals of Fractional Rational Functions 
P(x 
The integrand of an integral of a fractional rational function / πο dx, where P(x) and Q(x) are 
T 
polynomials with degree m and n, respectively, can be transformed algebraically into a form which is 
easy to integrate. There are the following steps: 
1. Simplifying the fraction by the greatest common divisor, so P(x) and Q(x) have no common factor. 


2. Separating the integer and rational parts of the expression. If m > n , then P(x) is divided by Q(x). 
Then a polynomial and a proper fraction should be integrated. 





3. Decomposing the denominator Q(x) into linear and quadratic factors (see 1.6.3.2, p. 44): 


Q(x) = an(a — a)*(a — B)! +++ (x? + px +9)" (a? +p'e+q) --- (8.14a) 
2 /2 
with s —q«0, - ~q <0,.... (8.14b) 


4. Factoring out the constant coefficient a, in front of the integral sign. 


5. Decomposing the fraction into a sum of partial fractions: The proper fraction which is obtained 
after the division, cannot be further simplified and its denominator is decomposed into a product of 
irreducible factors. Then it can be decomposed into a sum of partial fractions (see 1.1.7.3, p. 15), each 
of them is easy to integrate. 


8.1.3.3 Four Cases of Partial Fraction Decomposition 


1. Case: All Roots of the Denominator are Real and Single 














Q(x) = (x — a)(x — B)---(x— A) (8.15a) 
a) Forming the decomposition: 

P(x) A, B, L 

Qa) r-a r6 : d (8.15b) 

P(a) Ρ(8) P(A) 
with A= x 3B ο δω Ξξ : 8.15c 
aay P-qu Qj TUS 

b) The numbers A, B, C,..., L can also be calculated by the method of undetermined coefficients (see 


1.1.7.3, 4., p. 17). 
c) Integration by the formula 





i Adit — Aln|z — al. (8.15d) 


ο πα 
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" if (22 4-3)dx — 2x --3 A B C P(0) ( 2x -3 ) 
z—0 


I = A= - 
z3 + x? — 2x z(x — 1)(x +2) ατα τη 252 Q'(0) 3r? + 27 — 2 
1 
δ’ 


9 25 1-83 5 21 4-89 
95 dB 2 ΠΗ _ 
9) a a 87 e (x22: 3)... 


-3 3 -i 3 5 1 
I f («ue zhJe- ginlel +3 ile 1| ginle 2 c Ci — In 














ο. 15/9 
x3/2 (x + 2)1/6 














2. Case: All Roots of the Denominator are Real, Some of them with a Higher Mul- 
tiplicity 








Q(z) — (x — o) (x — By"... (8.164) 
a) Forming the decomposition: 
P(x) = Αι 4 A» i ' ΑΙ 
Ql) (z-a) (z-a) * (x — o) 
Bı B B 
I t Fasta ew 8.16b 
(α-β) (x—Bp (x— By ( ) 
b) Calculation of the constants A1, Ao, ..., Aj, B, Ba, ..., Bj, ... by the method of undetermined co- 


efficients (see 1.1.7.3, 4., p. 17). 
c) Integration by the rule 








A, dx Ay dx Ax 
- Ajln|z / - ΕΝ à 
/ r-a 11n |e — al, (x — a)* (k — 1)(x — o)*-1 i (8106) 
a? 41 z? +1 A B B B. . 
i ;-— 1 m ae qu m EE τς | z = + E στρ t e πῃ . The method of undetermined 


coefficients yields A + B1 = 1, —3A — 2B; + Bə = 0, 3A + Bı — B2 + B3 = 0, -A = 1; A 1, By 

2, B3 — 1, B3 — 2. The result of the integration is 
1 2 1 2 1 1 

Jj / i dr — —In|s| -21n|z — 1 -C 

: | zx r—1 πα cm i n [zl als | 3 B 

(x — 1)? 

















T 


=e (x — 1)? 


tC. 














3. Case: Some Roots of the Denominator are Single Complex 


Suppose all coefficients of the denominator Q(x) are real. Then, with a single complex root of Q(x) its 
conjugate complex number is a root too and one can compose them into a quadratic polynomial. 


Q(x) — (x — o) (x — B)" ... (a? - px - q)(a? - pa -- q)... (8.172) 
2 Q2 
with E <q, 2 x dau (8.17b) 


because the quadratic polynomials have no real zeros. 
a) Forming the decomposition: 
P(x) Αι A» ντ. Αι Bı Bz Bm 
Q(r  xr—o (r—-a)- "(r-oa) α-β (x — yr 
σα-ε D Εα- Ε 


G-p} 
(8.17c) 
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b) Calculation of the constants by the method of undetermined coefficients (see 1.1.7.3, 4., p. 17). 


Cr+ D 


c) Integration of the expression — Es by the formula 
x 





























ρα" τα 
(Cr--D)dr Ο 2 D — Cp/2 x + p/2 
ln |z" + pr + q| Jj arctan ——————. 8.17d 
r? +pr+q 2 a Ps [q — p?/4 [q— p?/4 ( ) 
4dx 4 A x+D 
π’- / d = H ος . The method of undetermined coefficients yields the 
x? + 4a we+4e 4 9314 

equations A + C = 0, D = 0, 4A v A=1,C 1, D=0. 

1 MEI 
yz / (- gi xs Z) dx = ln |z| — = jn T 4) -In|Ci| 2 In JG , Where in this particular case 








the term arctan is missing. 


4. Case: Some Roots of the Denominator are Complex with a Higher Multiplicity 








Q(x) — (x — a)*(x — 8) ... (a? -F px - g)" (a? -- pz - q')^.... (8.18a) 
a) Forming the decomposition: 
Pm A A A B B. B 
σῶ “a8 Ga ae a ee 
— Oe +Dy Cytt+Do | Q0 Cnt t Ds 
αλ γα γα (Q-prcqy αδ Ένα ο” 
Εια} ΕΙ Boa Εν Επο 4 F, 





| H bes 4 S 8.18b 
αν ρασ-α (a? pr gy (x? + pla + η ( ) 
b) Calculation of the constants by the method of undetermined coefficients. 
Cnt + Dm 
(£? + pz + q)” 
a) Transformation of the numerator into the form 


C, C 
Cuz Da "(2a + p) + (Dn = ey, (8.185) 
B) Decomposition of the integrand into the sum of two summands, where the first one can be integrated 


directly: 


c) Integration of the expression for m > 1 in the following steps: 

















J (2x + p) dx Cm 1 -. (8.184) 
2 oo - 2(m — 1) (a? 4- pr 4 q)n-i 
y) The bs one will be integrated by the following recursion formula, not considering its coefficient: 
dx a+ p/2 
/ (z?--pr--q)" | 2(m — 1) (q — p?/4) (a? ἠ- px + q)™! 
2m — 3 dx 
| 2(m — 1) (q — p?/4) J. (x2 - pr - q)n-1* (55 
Er- Qa? +24 +13 dis 2z? +2r+13 A Cix + Dı Cr+ Də 
(x — 2) (x2 + 1)? (x—2)(x? +1} 2-2 23451 (a? + 1)? 


The method of undetermined coefficients results in the following system of equations: 
A+C; =0, —2C,4+D, — 0, 2A44-C41—2D4 C3 — 2, —2014-D1 205 4- D — 2, A-2D1—2Ds — 13; 
the coefficients are A — 1, Ci 1, Di 2, C2 3, Do 4, 


1 2 1-2 92 1-4 
=] —— — —— — M —— | da. 
J \a-2 a+4+1 — (2241) 
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' dx T 
According to (8.18e) / NS ip 7 ia τς 8 5 s wa = a $3 
3—4x à 11 (x — 2)? 
ee 
2(32--1) 2  a?41 


1 
| 3 arctan z follows, and 
finally the result is J = — Aarctan x 4- C. 


Table 8.3 Substitutions for Integration of Irrational Functions I 





Integral * Substitution 

















where r is the lowest common multiple 
of the numbers m, n,.... 


[^ R(: +ba+ Vax" + ba +c) da: One of the three Euler substitutions: 
1. δα Vax? bx 4- c — t — ax 
2. Fore» 0 Vax? + br +c = xt νο 


3. If the the polynomial az? + bz + c 
has different real roots: 
ax? + bx +c = a(x — a)(x — p) ax? +br +c=t(x—a) 











* The symbol R denotes a rational function of the expressions in parentheses. 

The numbers n,m, ... are integers. 

1 Ifa <0, and the polynomial aa? + ba +c has complex roots, then the integrand is not defined 
for any value of x, since Vax? + bx + c is imaginary for every real value of x. In this case the in- 
tegral is meaningless. 











8.1.4 Integration of Irrational Functions 


8.1.4.1 Substitution to Reduce to Integration of Rational Functions 


Irrational functions cannot always be integrated in an elementary way. Table 21.7, p. 1065 contains 
a wide collection of integrals of irrational functions. In the simplest cases one can introduce substitu- 
tions, as in Table 8.3, such that the integral can be reduced to an integral of a rational function. 


The integral J R (x, Vaz? + ba + c) dx can be reduced to one of the following three forms 


8t. Va? + a?) dz, (8.19a) [RG va? — a2) dz, (8.19b) 


[Ro Va? — x?) dx, (8.19c) 


because the quadratic polynomial ax? + bx + c can always be written as the sum or as the difference of 
two complete squares. Then, one can use the substitutions given in Table 8.4. 


1 115 113 
WB A: 4-167417 — T +4r+4+ 1) =4 ΠΣ (5) | =4 [a+ (5) Jon =24+2. 
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9 5 329 (v5y Ay 3 
. y2 yr = 72 ert—-—-—-=[r#+-— = | = 2g2—[|-— ith z; -z--. 
E B: r? +3r+1=7r ui duin 1 (s 5) ($) 2 9 with x Tt 


B C: 24-2: —1—2?42:—121?—(z—1? 2  — 22 with x4 — x — 1. 











Tabelle 8.4 Substitutions for Integration of Irrational Functions II 





| Integral Substitution 





— 


R (x (x, Va? + a?) dx x=asinht or x«=atant 
R(x v, Va? el r-acosht or r-asect 
R (x, va a) dx x =asint or x=acost 





SS ος) 








8.1.4.2 Integration of Binomial Integrands 
An expression of the form 

x" (a 4- bz")? (8.20) 
is called a binomial integrand, where a and b are arbitrary real numbers, and m, n, p are arbitrary pos- 
itive or negative rational numbers. The theorem of Chebyshev tells that the integral 

[oma + ba”)? dx (8.21) 
can be expressed by elementary functions only in the following three cases: 
Case 1: If pis an integer, then the expression (a + ba")? can be expanded by the binomial theorem, so 
the integrand after eliminating the parentheses will be a sum of terms in the form cx^, which are easy 


to integrate. 


Case 2: If Hcr 





is an integer, then the integral (8.21) can be reduced to the integral of a rational 


n 
function by substituting t = Va + bx”, where r is the denominator of the fraction p. 


Case 3: If Rel 





+ pis an integer, then the integral (8.21) can be reduced to the integral of a rational 





: sas | @ + ba” . . ] 
function by substituting t — 5 — ; where r is the denominator of the fraction p. 
Yo oc 





EE 1 1 1 1 
πλ ο [οί (1+: Dor dz; ο... = = 2, (Case 2): 


‘av n 





4 
Substitution t= 1+ Ya, « = (t? — 1)*, dz — 120 (0 — 1)*dt; = E pc = "I — t?) dt 


= ΕΠΙ. -Ὁ ο. 











a? dr 3 32-1 1 m+1 4 m+1 19 
. = qe T -1/Α. 7 q . | $ 
HB: / VII z"(14- c) iom —3,m-—3,Dp p Ἔα. . p 12 


Because none of the three conditions is fulfilled, the integral is not an elementary function. 
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8.1.4.3 Elliptic Integrals 


1. Indefinite Elliptic Integrals 
Elliptic integrals are integrals of the form 


Je (x, V ax? + ba? + cx + e) da, [^ (9, 


Usually they cannot be expressed by elementary functions; if it is still possible, the integral is called 
pseudoelliptic. The name of this type of integral originates from the fact that the first application of 
them was to calculate the perimeter of the ellipse (see 8.2.2.2, 2., p. 502). The inverses of elliptic inte- 
grals are the elliptic functions (see 14.6.1, p. 762). Integrals of the types (8.22), which are not integrable 
in elementary terms, can be reduced by a sequence of transformations into elementary functions and 
integrals of the following three types (see [21.1], [21.2], [21.7]): 


(1 — kt?) dt 
(1 — #2)(1 — kt?) 





axt + br? + cr? + ex 4- f) dx. (8.22) 


(0<k<1), (8.23a) J (0<k<1), (8.23b) 


/ (1 — #2)(1 — #24?) 


dt 
fo ος ποσο (0 « k « 1). (8.23c) 
(1 nt2)/ (1 — 12)(1 — K2t?) 
Concerning the parameter n in (8.23c) one has to distinguish certain cases (see [14.1]). 
By the substitution t = siny (0 << 5) the integrals (8.23a,b,c) can be transformed into the 


Legendre form: 


d 
Elliptic Integral of the First Kind: / LL ci (8.244) 
5 J1- E?sin?g 
Elliptic Integral of the Second Kind: I V1 — E? sin? o dp. (8.24b) 
1 
Elliptic Integral of the Third Kind: j τ (8.246) 
(1+ nsin? y)\/1 — k? sin? p 


2. Definite Elliptic Integrals 
Definite integrals with zero as the lower bound corresponding to the indefinite elliptic integrals are de- 
noted by 


e 
| —— = F(k (8.25a) / 1— K'sin? y dy — E(k, o), (8.25b) 
ὁ γι- RE sin?v ) 

7 dw 

| ———— =I[l(n,k,y) (for all three integrals 0 < k « 1 holds). (8.25c) 


5 (1o nsin? )u/1— K2 sin? b 
These integrals are called incomplete elliptic integrals of the first, second, and third kind. For y = E 


the first two integrals are called complete elliptic integrals, and they are denoted by 


3 
K=F (k 5) = J — (8.26a) 
2 à yl- E sin? 


8.1 Indefinite Integrals 491 





: 
Ε-- Ε (r, 5) = / 1— Ki gin? y dii (8.26b) 
; 


Tables 21.9.1, 2, 3 contain the values for incomplete and complete elliptic integrals of the first and 
second kind F, E and also K and E. The series expansions of K and E see 8.2.5, 7. p. 515. 


W The calculation of the perimeter of the ellipse leads to a complete elliptic integral of the second kind 
as a function of the numerical eccentricity e (see 8.2.2.2, 2., p. 502). For a = 1.5, b = 1 it follows that 
e = 0.74. Since e = k = 0.74 holds, one gets from Table 21.9.3, p. 1103: sina — 0.74, i.e., a e 48° 


and E(k, 5) = E(0.74) — 1.3238. It follows that U — 4aE(0.74) © 4aB(a = 48°) = 4-1.3238a e 7.94. 


Numerical integration yields the better approximation 7.932711. 


8.1.5 Integration of Trigonometric Functions 


8.1.5.1 Substitution 
With the substitution 





x 2dt 2t 1-9 
t=tan =, ie, dx= I E’ sing = τ ον cos T = IFE’ (8.27) 
an integral of the form 
f Gin a cos) dz (8.28) 


can be transformed into an integral of a rational function, where R denotes a rational function of its 
arguments. 


2t 2 
[qst e 2 
l-csinr ( τα]. 1 1 t 1 
a / dx = [ a= > f(t 2 z) ο ο 
i T S 5. + Fr d "ER * 3ln|tl 


sin «(1 + cos a i 1-8 
1-82 1+??? 


2 
tan = 














x 
tan? — : 
= 2 in ee 
4 2 2 





+ C. In some special cases more simple substitutions can be applied. 








If the integrand in (8.28) contains only odd powers of the functions sin x and cos x, then by the substi- 
tution t = tan z a rational function can be obtained in a simpler way. 


8.1.5.2 Simplified Methods 


Case 1: Jz (sin x) cos x da. Substitution t= singz, cosgxdr = dt. (8.29) 
Case 2: [5 (cos x) sin x dz. Substitution t= cosx, sinadx = —dt. (8.30) 
Case 3: E qd: (8.31a) 


a) n = 2m + 1, odd: 
for 2 ἀφ -- /α — cos? x)” sin z dz = — fo — £)" qt with t= cosg. (8.31b) 
b) n — 2m, even: 


EM su. 1 Pu 1 .4ym : - τη, 
[sin zdz- f [5(1 — eos22) dx = = | (1- cost) dt with t= 22. (8.31c) 
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The power is halved in this way. After removing the parentheses in (1 — cost)" follows integration 
term-by-term. 


Case 4: [ cos" x dx. (8.32a) 
a)n= 2m + 1, odd: 

fos zdr = /α — sin? z)" cosa dz — fc —- 8y qt with t= sinz. (8.32b) 
b) n = 2m, even: 

fos xdr = 1 [50 + cos n) dx = m fo + cost)” dt with t= 27. (8.32c) 
The power is halved in this way. After removing the parentheses follows integration term-by-term. 
Case 5: / sin” x cos” x dx. (8.33a) 
a) One of the numbers m or n is odd: Reducing it to the cases 1 or 2. 


ΒΑ: [ sin? xcos® zdr = EE (1 — sin? x)? cos z dz = fea — t) dt+C with t-sinz. 





It 
HB: I το with t= cosg. 


b) The numbers m and n are both even: Reducing it to the cases 3 or 4 by halving the powers using 


the trigonometric formulas 
sin 2x ; 1 — cos 2a” 1 + cos 2” 
E E sin? x = Pe E cos? x = e n (8.33b) 


1 1 
Oo n x cost x dz = [osm x cosa)? cos? x da = 8 J sin? 2x(1 + cos 2x) dz = 3 n 2x cos 2x dx + 





sin x cos 7 = 


1 1 1 1 
3g _[ (1 - cose) de = g” 2s + ig? T g tO. 


Case 6: fon" zde = fia 7? v (sec? x — 1) dr — n 7? y (tan x)' dy — [ezar 


tan”! x 
= -f tan"? y dz. (8.344) 
^n-1 
By repeating this process one decreases the power and depending on whether n is even or odd one finally 
gets the integral 


fa =r or [tan x dx = — ln | cos z| (8.34b) 
respectively. 
Case 7: [ cot” v dz. (8.35) 


The solution is similar to case 6. 
Remark: Table 21.7, p. 1065 contains several integrals with trigonometric functions. 


8.1.6 Integration of Further Transcendental Functions 
8.1.6.1 Integrals with Exponential Functions 


Integrals with exponential functions can be reduced to integrals of rational functions if it is given in the 
form 


[5 (e εν μες dg, (8.364) 


$.2 Definite Integrals 498 





where m, n, ...,p are rational numbers. Necessary are two substitutions to calculate the integral: 
1. Substitution of t = e” results in an integral 


1 
/; R(t^,t^,...,() dt. (8.36b) 


2. Substitution of z — Vt, where r is the lowest common multiple of the denominators of the fractions 
m,n,...,p, results in an integral of a rational function. 

8.1.6.2 Integrals with Hyperbolic Functions 

Integrals with hyperbolic functions, i.e., containing the functions sinh z, cosh x, tanh x and coth xv in 
the integrand, can be calculated as integrals with exponential functions, if the hyperbolic functions 


are replaced by the corresponding exponential functions. The most often occurring cases / sinh" z dz , 


1 cosh” x dz , 1 sinh” x cosh” a dx can be integrated in a similar way to the trigonometric functions 
(see 8.1.5, . p. 491). 
8.1.6.3 Application of Integration by Parts 


If the integrand is a logarithm, inverse trigonometric function, inverse hyperbolic function or a product 
of zx" with Inz, eù, sinas or cosas or their inverses, then the solution can be got by a single or 
repeated integration by parts. 

In some cases the repeated partial integration results in an integral of the same type as the original 
integral. In this case one has to solve an algebraic equation with respect to this expression. One can 
calculate in this way, e.g., the integrals 1 e°" cos bz dz, / e“ sin bx dx, where one needs integration by 
parts twice. The same type of function should be chosen for the factor u in both steps, either the 
exponential or the trigonometric function. 


One also uses integration by parts if there are integrals in the forms / P (x)e^* dx, | P (a) sin bx dx 


and / P (x) cos bx dx, where P (x) is a polynomial. (Choosing u = P (x) the degree of the polynomial 
will be decreased at every step.) 


8.1.6.4 Integrals of Transcendental Functions 


The Table 21.7, p. 1065, contains many integrals of transcendental functions. 


8.2 Definite Integrals 


8.2.1 Basic Notions, Rules and Theorems 
8.2.1.1 Definition and Existence of the Definite Integral 
1. Definition of the Definite Integral 


The definite integral of a bounded function y = f (x) defined on a finite closed interval [a, b] is a number, 
which is defined as a limit of a sum, where either a < b can hold (case A) or a > b can hold (case B). 
In a generalization of the notion of the definite integral (see 8.2.3, p. 506) in the following sections also 
functions are considered, which are defined on an arbitrary connected domain of the real line, e.g., on 
an open or half-open interval, on a half-axis or on the whole numerical axis, or on a domain which is 
only piecewise connected, i.e., everywhere, except finitely many points. These types of integrals belong 
to improper integrals (see 8.2.3, 1., p. 506). 

2. Definite Integral as the Limit of a Sum 

The notion of the definite integral is obtained by the limit given in the following procedure (see Fig. 8.1, 
p. 480): 
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1. Step: The interval [a, b] is decomposed into n subintervals by the choice of n — 1 arbitrary points 


11, 22,..., 051 S0 that one of the following cases occurs: 
a = to < T1 < T2 <L -+ < Li L- < En-1 L Tn =b (case A) or (8.37a) 
a = to > T1 > T2 >+: > Li >- > En > Tn =b (case B). (8.37b) 


2. Step: A point €; is chosen in the inside or on the boundary of each subinterval as in Fig. 8.4: 
vio <& <a (incase A) or vi-1 > & >a; (incase B). (8.37c) 


Ax, Ax, Ax, Δ) 





Figure 8.4 


3. Step: The value f (£;) of the function f (x) at the chosen point is multiplied by the corresponding 
difference Axj_1 = x; — 2-1, i-e., by the length of the subinterval taken with a positive sign in case A 
and taken with negative sign in case B. This step is represented in Fig. 8.1, p. 480 for the case A. 

4. Step: Then all the n products f (£;) Av; 1 are added. 


5. Step: The limit of the obtained integral approximation sum or Riemann sum 


Y f (&) Avia (8.38) 


il 
is calculated if the length of each subinterval Az; 4 tends to zero and consequently their number n 
tends to oo. Based on this, one can also denote Az; as an infinitesimal quantity. 


If this limit exists independently of the choice of the numbers x; and £;, then it is called the definite 
Riemann integral of the considered function on the given interval. One writes 





b n 
ο im, Y f) Ana (8.39) 


The endpoints of the interval are called limits of integration and the interval [a,b] is the integration 
interval; a is the lower limit, bis the upper limit of integration; x is called the integration variable and 
f(x) is called the integrand. 


3. Existence of the Definite Integral 


The definite integral of a continuous function on [a,b] is always defined, i.e., the limit (8.39) always 
exists and is independent of the choice of the numbers x; and €;. Also for a bounded function having 
only a finite number of discontinuities on the interval [a, b] the definite integral exists. The function 
whose definite integral exists on a given interval is called an integrable function on this interval. 


8.2.1.2 Properties of Definite Integrals 


'The most important properties of definite integrals explained in the following section are presented in 
Table 8.5, p. 496. 
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1. Fundamental Theorem of Integral Calculus 
If the integrand f(2) is continuous on the interval [a, b], and F (x) is a primitive function, then 


fi (x) dx = / F'(z)dz — F(z)[ — F (5) — F (a) (8.40) 


holds, i.e., the calculation of a definite integral is reduced to the calculation of the corresponding indef- 
inite integral, to the determination of the antiderivative: 


= | Ε{9) 45. C. (8.41) 
Remark: There are integrable functions which do not have any primitive function, but one will see 
that, if a function is continuous, it has a primitive function. 
2. Geometric Interpretation and Rule of Signs 
1. Area under a Curve Let f (x) > 0 for all x in [a, 0]. Then the sum (8.38) can be considered as the 
total area of the rectangles (Fig. 8.1), p. 480, which approximate the area under the curve y = f (x). 


Therefore the limit of this sum and together with it the definite integral is equal to the area of the region 
A, which is bounded by the curve y = f (x), the x-axis, and the parallel lines x = a and x = b: 


A= fro z)dz (a«b and f(z) 0 fora z « 5). (8.42) 


2. Sign Rule Ifa function y = f(x) is piecewise positive or negative in the integration interval 
(Fig. 8.5), then the integrals over the corresponding subintervals, that is, the area parts, have pos- 
itive or negative values, so the integration over the total interval yields the sum of signed areas. 
In Fig. 8.5a—d four cases are represented with the different possibilities of the sign of the area. 


id la BB 


f(x)>0, a<b f(x)>0, a>b f(x)«0, a<b f(x)«0, a»b 
a) b) c) d) 


Figure 8.5 


W A: Í sinrdx (read: Integral from x = 0 to x = 7) — (—cosz|g — (— coss + cos 0) = 2. 
Jo 








2π 
B Db: [ sinzdr (read: Integral from x = 0 to x = 27) = (— cos |2™ = (— cos 27 + cos 0) = 0. 
Jo 


3. Variable Upper Limit 


1. Particular Integral If the upper limit is considered as variable (Fig. 8.6, region ABC D), then 
there is an area function in the form 


- / f(t)dt (a«b and f(z) » 0 for z » a). (8.43) 


This integral is called a particular integral. 
To avoid accidently confusing the variable upper limit x with the variable of the integrand, often the 
integration variable is denoted by t instead of x as in (8.43). 
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y=f(x) 
M 
ZZ, (x) f(x) 
- N, 
x 0 | x 
Figure 8.6 Figure 8.7 Figure 8.8 


2. Differentiation ofthe Definite Integral with Respect to the Upper Limit A definite integral 
T 
with a variable upper limit | f(t) dt, if this integral exists, is a continuous function F(x) of the upper 
Ja 


limit. If f(a) is continuous, then F(x) is differentiable with respect to x, i.e., it is a primitive function 
of the integrand. So, if f(x) is continuous on [a,b], and x € (a,b) holds 


F(z)-f(z) or 2 roa ro. (8.44) 


The geometrical meaning of this theorem is that the derivative of the variable area S(x) is equal to the 
length of the segment NM (Fig. 8.7). Here, the area, just as the length of the segment, is considered 
according to the sign rule (Fig. 8.5). 


4. Decomposition of the Integration Interval 


The interval of integration [a, b] can be decomposed into subintervals. The value of the definite integral 
over the complete interval is 


[re ax = | πω a | κο dx. (8.45) 


This is called the interval rule. If the integrand has finitely many jumps, then the interval can be 
decomposed into subintervals such that on these subintervals the integrand will already be continuous. 
Then the integral can be calculated according to the above formula as the sum of the integrals on the 
subintervals. 

At the endpoints of the subintervals the function must be defined by its corresponding left or right-sided 
limit, if it exists. If it does not, then the integral is an improper integral (see 8.2.3.3, 1., p. 509). 
Remark: The formula above is valid also in the case if c is outside of the interval [a, 0] if one can suppose 
that the integrals on the right-hand side exist. 


8.2.1.3 Further Theorems about the Limits of Integration 


1. Independence of the Notation of the Integration Variable 


The value of a definite integral is independent of the notation of the integration variable: 


b b b 


/ f(x) dr = EO du = [ro dt. (8.46) 
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Table 8.5 Important Properties of Definite Integrals 














Property Formula 
b 
Fundamental theorem of / f@\es =F (ο) = F(b)— F(a) with 
the integral calculus lc a 
(f (x) is continuous) F(x) = n dr +C or F'(z) — f(z) 
b a 
Interchange rule / f(a) dx = — / f(a) dx 
a b 
Equal integration limits Í f(x) dx =0 





Interval rule jio ‘dx = jro f(x) dz jro f(a 


b 


Independence of the notation 
of the integration variable / f(x) dz = fu (u) du = [κ f(t)dt 





a a 


T 
Differentiation with respect to d . 
Ghewoner inii dz fi (t) dt = f(x) with f(x) continuous 
a 











of the integral calculus 











Mean value theorem Jro ρε ο B. (gere 





2. Equal Integration Limits 


If the lower and upper limits are equal, then the value of the integral is equal to zero: 
a 
EO dz — 0. (8.47) 
a 


3. Interchange of the Integration Limits 
After interchanging the limits, the integral changes the sign (interchange rule): 


] io απ -- ως. (8.48) 
a b 


4. Mean Value Theorem and Mean Value 


1. Mean Value Theorem [fa function f(x) is continuous on the interval |a, b], then there is at least 
one value € in this interval such that in the case A with a < € < band in the case B with a > € > b (see 
8.2.1.1, 2., p. 494) 


b 
[ f@ ae =(b- @) F() (8.49) 


is valid. 
The geometric meaning of this theorem is that between the points a and b there exists at least one point 
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€ such that the area of the figure ABC D is equal to the area of the rectangle AB'C"D in Fig. 8.8. The 
value 





m= 


— f to) du (8.50) 


is called the mean value or the arithmetic average of the function f (x) in the interval [a, b] . 


2. Generalized Mean Value Theorem If the functions f(x) and (x) are continuous on the closed 
interval [a, b], and y(x) does not change its sign in this interval, then there exists at least one point £ 
such that 


/ f(zye(z) dz — f(£) [Ῥω ρία (a « € « b) (8.51) 


is valid. 


5. Estimation of the Definite Integral 

The value of a definite integral lies between the values of the products of 
the infimum m and the supremum M of the function on the interval [a, 0] 
multiplied by the length of the interval: 


m(b — a) € EO dx < M(b—a) (a«b,f(x) 20). (8.52) 





If f is continuous, then m is the minimum and M is the maximum of the 
function. It is easy to recognize the geometrical interpretation of this the- 
orem in Fig. 8.9. Figure 8.9 


8.2.1.4 Evaluation of the Definite Integral 


1. Principal Method 

The principal method of calculating a definite integral is based on the fundamental theorem of integral 
calculus, i.e., the calculation of the indefinite integral (see 8.2.1.2, 1., p. 495), e.g., using Table 21.7 p. 
1065. Before substituting the limits it must be ckecked if there is an improper integral. 

Nowadays computer algebra systems can be used to determine analytically the indefinite and definite 
integrals (see Chapter 20). 

2. Transformation of Definite Integrals 

In many cases, definite integrals can be calculated by appropriate transformations, with the help of the 
substitution method or partial integration. 


E A: Using the substitution method for I = [ Va? — r? dz. 
` T 
2 
a arcsin 1 51 
Then7= f væ-dr= f a?4/ 1 — sin? tcost dt — a? / cos? t dt — a? I 5 (1 + cos 2t) dt. 
0 0 0 2 


arcsin 0 





First one substitutes: x = y(t) = asint, t = y(x) = arcsin = y v(0) — 0, v(a) — 
z 








With the further substitution t — q(z) a z = (t) = 2t, (0) = 0, φ) π the value 
PT 2 2 2 
I= S 1|Σ + τ coszdz — L + "n sin z|o = T is obtained. 


1 1 
W B: Method of partial integration: / zc? da — [ve?]] — f e* dy 2e—(e—1)-1. 
0 0 
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3. Method for Calculation of More Difficult Integrals 

If the determination of an indefinite integral is too difficult and complicated, or it is not possible to 
express it in terms of elementary functions, then there are still some further ideas to determine the value 
of the integral in several cases. Here the integration of functions with complex variables is mentioned 
(see the examples on p. 754-757) or the theorem about the differentiation of an integral with respect 
to a parameter (see 8.2.4, p. 512): 


dj f 0 f(z,t) 
af fe) dx [τοι 9 (8.53) 





!Ir-l ; 1gt—-]1 
BJ = [ dx. Introducing the parameter t: F(t) = [ — 
o lur Jo Ine 


νην : .dF κ. z5—1 . pes " [5 I 1 aul 1 
Using (8.53) for F(t): d d. srl mz dx το da: h? da τετ bI 


dt 
T+1 
4. Integration by Series Expansion 
If the integrand f(x) can be expanded into a uniformly convergent series 


f(x) = pila) + palt) t + pala) + (8.54) 
in the integration interval [a, b], then the integral can be written in the form 


[1 dx = [49 dx + l palx) dz +--- + / n(x) da +--+. (8.55) 


In this way the definite integral can be represented as a convergent numerical series: 


dr; F(0) — 0; F(1) = I. 

















In(r + 1)|5 — In(t - 1). Result: I — F(1) 2 n2. 








t 
Integration: F(t) — F(0) i 
Jo 











b b b b 
EO dx = [ 49 dx + ] (p(x) da + +++ + / n(x) da +---. (8.56) 
a a a a 
When the functions p(x) are easy to integrate, if, e.g., f(x) can be expanded in a power series, which 
b 
is uniformly convergent in the interval [a, b], then the integral / f (x) dx can be calculated to arbitrary 
accuracy. 
3 1/2 42 : ; 2 r2 
WB Calculate the integral J = [ e dx with an accuracy of 0.0001. The series e~™ = 1 — AT πρ 
Jo : 


zt x8 αδ 














OF grap is uniformly convergent in any finite interval according to the Abel theorem (see 
2 x xt zê x8 
7.3.3.1, p. 469), so fe? dr —z|l 1-3 | 2-5 3-7 | aeg holds. With this result it 
1/2 2 
: — 2-2" 7, — 1 1 1 il 1 n 
follows that 1 — ff e dx Σα 311.3 * 94.91-8 3.31.7 ^ 98. 41-9 ) 


= 1 (1 — 4 + Ti . "m + sion -»' ). To achieve the accuracy 0.0001 for the calculation of the 


integral it is enough to consider the first four terms, according to the theorem of Leibniz about alter- 
nating series (see 7.2.3.3, 1., p. 463): 


1/2 0, 
~ 4(1— 0.08333 + 0.00625 — 0.00037) = 4 - 0.92255 = 0.46127, Í ο" de = 0.4613. 


5. Graphical Integration 
Graphical integration is a graphical method to integrate a function y = f(x) which is given by a curve 


b 
AB (Fig. 8.10), i.e., to calculate graphically the integral / f(a) dx, the area of the region Mp ABN: 
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1. The interval ΜΟΝ is divided by the points 
11/2, 01, 13/2, 2, --. Zn 1, £n—1/2 into 2n equal parts, where the 
result is more accurate if there are more points of division. 

2. At the points of division 11/5, 13/5, .. . &n—1/2 One draws ver- 
tical lines intersecting the curve. The ordinate values of the seg- 
ments are denoted on the y-axis by O41, O4», ..., OA. 

3. A segment OP of arbitrary length is placed on the negative 
a-axis, and P is connected with the points A4, A», .. ., Aj. 

4. Through the point Mp a line segment is drawn parallel to P A 
to the intersection point with the line x = 2; this is the segment 
MoM,. Through the point M; the segment MM» is drawn paral- 
lel to P As to the intersection with the line x = zr», etc., until the 
last point M, is reached with the abscissa xp. 

The integral is numerically equal to the product of the length of 
OP and the length of N M,: 





























b 
Figure 8.10 I f(£) dx = OP - NM. (8.57) 


By a suitable choice of the arbitrary segment OP the extent of the graph can be influenced; the smaller 
ET qe b ει... 
the graph is wanted, the longer the segment OP should be chosen. I OP — 1, then | f(a) dz = NM), 
Ja 
and the broken line Mo, M1, Mo,..., Mn represents approximately the graph of a primitive function of 


f(x), i.e., one of the functions given by the indefinite integral / f(a) da. 


6. Planimeter and Integraph 
A planimeter is a tool to find the area bounded by a closed plane curve, thus also to compute a definite 
integral of a function y = f(x) given by the curve. Special types of planimeter can evaluate not only 
fy dx, but also [9 dx and [9 dx. 


T 
An integraph is a device which can be used to draw the graph of a primitive function Y = Í f(t) dt if 
Ja 
the graph of a function y — f(x) is given (see [19.30]). 
7. Numerical Integration 


If the integrand of a definite integral is too complicated, or the corresponding indefinite integral cannot 
be expressed by elementary functions, or the values of the function are known only at discrete points, 
e.g., from a table of values, then the so-called quadrature formulas or other methods of numerical math- 
ematics are used (see 19.3.1, p. 963). 


8.2.2 Applications of Definite Integrals 
8.2.2.1 General Principle for Applications of the Definite Integral 


1. The quantity A to be determined is decomposed into a large number of very small quantities, i.e., 
into infinitesimal quantities: 

A= a αρ s. (8.58) 
2. Every one of these infinitesimal quantities a; is replaced by a quantity à;, which differs only very 


slightly in value from a;, but which can be integrated by known formulas. Here the error a; — a; — à; 
should be an infinitesimal quantity of higher order than a; and à;. 


3. Representation of @; by a variable x and a function f(x) so that à; has the form f(x;) Az;. 
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4. Evaluation of the desired quantity as the limit of the sum 


n n b 
A= jm Y = Jim Y: Gn = EO dz, (8.59) 


where Ax; > 0 holds for every 7. The lower and upper limit for x is denoted by a and b. 

W Evaluating the volume V of a pyramid with base area S and height H (Fig. 8.11a-c): 

a) Decomposition of the required volume V by plane sections into frustums (Fig. 8.11a): V — v4 
3 d + Up. 

b) Replacing every frustum by a prism, whose volume is 6;, with the same height and with a base area 
of the top base of the frustum (Fig. 8.11b). The difference of their volumes is an infinitesimal quantity 
of higher order than v;. 


c) Representation of the volume 9; in the form 0; = S; Ah; , 
















Λ where A; (Fig. 8.11c) is the distance of the top surface 
Αν N " < from the vertex of the pyramid. Since S; : S = h? : H? 
2 

H uo ] Ah; one can write: 0; — 2 Ahi. 

EA a? d) Calculation of the limit of the sum 

3 " Γ Sh? n S 

ml aan ο τι ο) 2 H 
Figure 8.11 uu x s Lo n 
=. = 0 


8.2.2.2 Applications in Geometry 


1. Area of Planar Figures 


1. Area of a Curvilinear Trapezoid Between B and C (Fig. 8.12a) if the curve is given by an 
equation in explicit form (y = f(x) and a € x € b) or in parametric form (x = x(t), y = y(t), tı < 
i < 13): 


b tg 
Sanon = f f(x) de = f y(t)a'(t) dt (f(x) > 052(5) = a; (ta) = b; y(t) 2 0). (8.602) 
a ty 


2. Area ofa Curvilinear Trapezoid Between G and H (Fig. 8.12b) if the curve is given by an 
equation in explicit form (x = g(y) and a € y € £8) or in parametric form (x = a(t), y = y(t), tı < 
t < 13): 
B ty 
Seron = [ su) dy = [ «ov dt (g(y) > 0; y(tı) = a; y(t2) = 8; x(t) > 0). (8.602) 
à i 


3. Area of a Curvilinear Sector (Fig. 8.12c), bounded by a curve between K and L, which is 
given by an equation in polar coordinates (p = p(q), «1 € 9 € v): 


1 2 
Sox, 7 5 / p? dg. (8.60c) 
v1 


Areas of more complicated figures can be calculated by partition of the area into simple parts, or by 
line integrals (see 8.3, p. 515) or by double integrals (see 8.4.1, p. 524). 
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a) 0 9 99, Φ; 5 


Figure 8.12 


2. Arc-length of Plane Curves 


1. Arc-length of a Curve Between Two Points (I) A and B, given in explicit form (y = f(a) or 
x = g(y)) or in parametric form (r — x(t), y = y(t)) (Fig. 8.13a) can be calculated by the integrals: 


b β to 
- J l-c[f'(z)f dz = / [g'(y)]? + 1 dy = J [x (t)]2 + [y'(t)]? dt. (8.61a) 


With the differential of the arc-length dl one gets 
L= fa with dl? = dx? + dy’. (8.61b) 


W The perimeter of the ellipse with the help of (8.61a): With the substitutions x — «(t) = asint, y = 


y(t) = bcost it follows that Lo E ya? — (a? — b?) sin? tdt = ΤΝ V1 — e?sin? t dt, where 


e = Va? — D? [a is the numerical aum s of the ellipse. 
Since r — 0, y — band x — a, y = 0 the limits of the ue > the first quadrant are tı = 0 and 


t2 = T/2 so, for the perimeter of the ellipse holds Be E uf” V1-— esin?tdt — a E(k 5) with 
k — e. 'The value of the integral E(k, Di is given in aula 21.9 (see example in 8.1.4.3, p. ia 


2. Arc-length of a Curve Between Two 


Points (II) C and D, given in polar coordinates 
(p = ply) (Fig. 8.13b): ΤΩ͂Ι 
La = sof p? +( (861€) 9 


With the dicont of the Je dl one gets a) 0 M 0; 
pe / dl with dl? = p2dy? + dp?. (8.614) 


το 





Figure 8.13 


3. Surface Area of a Body of Revolution (see also First Guldin Rule, p. 506) 


1. The area of the surface of a body given by rotating the graph of the function y — f(x) > 0 around 
the z-axis (Fig. 8.14a) is: 


b b 2 
dy 
=2r | ydl= ia OU) gy. > 
S an f yal 2m uz) 1+ (2) dx. (8.62a) 


a 
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2. The area of the surface of a body given by rotating « = f(y) > 0 around the y-axis (Fig. 8.14b) is: 


B B dx\? 
S = 2r fea — 2m Γεω (: + τ) dy. (8.62b) 
j i dy 


o o 
3. To calculate the area of more complicated surfaces see the application of double integrals in 8.4.1.3, 
p. 527 and the application of surface integrals of the first kind, 8.5.1.3, p. 535. General formulas for 
the calculation of surface areas with double integrals are given in Table 8.9 (Applications of double 
integrals), p. 528. 
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Figure 8.14 Figure 8.15 


4. Volume (sce also Second Guldin Rule, p. 506) 
1. The volume of a rotationally symmetric body given by rotation around the x-axis (Fig. 8.14a) is: 


b 
V2m [9 dx. (8.63a) 


2. The volume of a rotationally symmetric body given by rotation around the y-axis (Fig. 8.14b) is: 
b 
V2m Je dy. (8.63b) 
a 
3. The volume of a body, whose section perpendicular to the x-axis (Fig. 8.15) has an area given by 
the function $ — f(x), is: 


V= / f(a) dx. (8.64a) 


W Calculation of the volume of an ellipsoid of revolution centered at the origin. Since the ellipsoid 
of revolution x?/a? + y?/b? + 22/c? = 1 (see (3.412), 3.5.3.13,2., p. 224 and Fig.8.16) with a = ο 
arises by rotating the ellipse y?/b? + 2?/c? = 1 about the y-axis, the area of the circle shaped cross- 
sections being parallel to the z, z-plane is S — f(y) = mz? = re(1 — y?/b*), so by integration V = 
οπο ρα — y? /b?)dy = (4/3)rbe. 

4. Cavalieri Principle If in the interval [a, b] there exists in addition to a cross-sectional area func- 
tion S — f(x) a second cross-sectional area function S — f(x) which has the same value for every x as 


f(x), then the volumes V according to (8.64a) and V according to (8.64b) are equal: 


b 
V= / f(z) - V. (8.64b) 
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Figure 8.16 Figure 8.17 Figure 8.18 


The original Cavalieri’s principle tells: If two bodies are bounded below and above by two parallel 
planes and in these planes and in every plane being parallel to these ones their cross-sectional areas are 
equal to each other, then the volumes of these bodies are also equal (Fig.8.17). 

5. Tables General formulas to calculate volumes with multiple integrals are given in Table 8.9 
(Applications of double integrals, see p. 528) and Table 8.11 (Applications of triple integrals, see 
p. 533). 


8.2.2.3 Applications in Mechanics and Physics 
1. Distance Traveled by a Point 


The distance traveled by a moving point during the time from tọ until T with a time-dependent velocity 


v= f(t) is 


T 
s= foa (8.65) 
lo 
2. Work 


The work in moving a body in a force field depends of the direction of motion. Here it is supposed that 
the direction of the field and the direction of the movement are constant and coincide along the x-axis. 
If the magnitude of the force Fis changing, i.e., F | = f(x), then the work W necessary to move the 
body from the point x — a to the point x — b along the z-axis is 





b 
W - f f) dz. (8.66) 


In the general case, when the direction of the force field and the direction of the movement are not 
coincident, the work is calculated as a line integral (see (8.130), p. 522) of the scalar product of the 
force and the variation of the position vector at every point of r along the given path. 


3. Gravitational and Lateral Pressure 
In a fluid at rest in the gravitational field of the Earth with a density o and gravitational acceleration 
g (see Table 21.2, p. 1053) the gravitational pressure p and the lateral pressure p, are distinguished. 
The gravitational pressure p at a depth x under the surface of the fluid (sce Fig.8.18) is 

τοσα. (8.672) 
The lateral pressure p, acting e.g. on a cover plate (surface area A) of a side opening of a container (see 
Fig.8.18), is caused by the pressure force F' acting in all directions. The differential pressure force dF’ 
acting perpendicular on a side surface-element dA in depth x under the surface of the fluid is 


dF =ogudA=ogxy(u)da. (8.67b) 
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Integration and division by A yields 
h2 
09 
Ds = B [πω — ag (x)) dx — og zo. (8.67c) 
hi 


Functions y;(r) and ya(x) describe the left and right boundaries of the cover plate and xc is the x- 
coordinate of its center of mass (see center of gravity of planar figures paragraph 5., p. 505). 
Remark: The center of mass of the cover plate usually does not coincide with the point of impact of 
pressure force F since the lateral pressure force is proportional to x. 

4. Moments of Inertia 


1. Moment of Inertia of an Arc The moment of inertia of a homogeneous curve segment y = f(x) 
with constant density p in the interval [a, b] with respect to the y-axis (Fig. 8.19a) is: 


b b 
Lp Je dico fey 4 (y? dz. (8.68) 


If the density is a function o(z), then its analytic expression is in the integrand. 


2. Moment ofInertia of a Planar Figure The 


y^ y moment of inertia of a planar figure with a homo- 
geneous density o with respect to the y-axis, where 
dy Sy y is the length of the cut parallel to the y-axis 

X 1 d (Fig. 8.19b), is: 

y Pas 
x+—| I= efe y da. (8.69) 

à dx bx οἱ a 
b) 


a dx b x 
a) 

















(See also Table 8.9, (Applications of the Double 
Integral), p. 528.) If the density is position depen- 























Figure 8.19 dent, then its analytic expression must be in the 
integrand. 
ya y^ yA 
L B B 
Yc C Yc 
7 
01a Xc b X 0 Xc X 

a) b) ο) 





Figure 8.20 


5. Center of Gravity, Guldin Rules 


1. Center of Gravity of an Arc Segment The center of gravity C of an arc segment of a homo- 
geneous plane curve y — f(x) in the interval [a,b] with a length L (Fig. 8.20a) considering (8.61a), 
p. 502, has the coordinates: 


b 
[^^ + y? dx 


L 


b 
fw 1 +y? dr 


i (8.70) 


Tc yc 
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2. Center of Gravity of a Closed Curve The center of gravity C of a closed curve y = f(x) 
(Fig. 8.20b) with the equations y; — fi(x) for the upper part and yo — f(x) for the lower part, and 
with a length L has the coordinates: 


[ 1G)? e (99) dz [OVE P va O89) a 


a 


wc = 











Ί 8.71 
Τ yc L (8.71) 
3. First Guldin Rule Suppose a plane curve segment is rotated around an axis which lies in the 
plane of the curve and does not intersect the curve. Choosing it as the z-axis the surface area S,o4 of 
the body generated by the rotated curve segment is the product of the perimeter of the circle drawn by 
the center of gravity at a distance rc from the axis of rotation, i.e., 277r¢, and the length of the curve 
segment L: 

Six — L-2nrg. (8.72) 


4. Center of Gravity ofa Trapezoid The center of gravity C of a homogeneous trapezoid bounded 
above by a curve segment between the points of the curve A and B (Fig. 8.20c), with an area S of the 
trapezoid, and with the equation y — f(x) of the curve segment AB, has the coordinates: 


b b 
/ vy dx i I y? dr 
ποπ =y Yo = —3— (8.73) 


5. Center of Gravity of an Arbitrary Planar Figure The center of gravity C of an arbitrary 
planar figure (Fig. 8.20d) with area S, bounded above and below by the curve segments with the 
equations y; = fi (a) and y2 = fo(x), has the coordinates 
b i 
fom — ya) d [αἱ — yà) dex 
EE a 
S 


a 


ποστ ‘i 
σ S yc 





(8.74) 


Formulas to calculate the center of gravity with multiple integrals are given in Table 8.9 (Applications 
of double integrals, p. 528) and in Table 8.11 (Applications of triple integrals, p. 533). 

6. Second Guldin Rule Suppose a plane figure is rotated around an axis which is in the plane of 
the figure and does not intersect it. Choosing it as the z-axis the volume V of the body generated by 
the rotated figure is equal to the product of the perimeter of the circle drawn by the center of gravity 
under the rotation, i.e., 27rc, and the area of the figure 5: 


Vrot = S * 2πτσο. (8.75) 


8.2.3 Improper Integrals, Stieltjes and Lebesgue Integrals 


8.2.3.1 Generalization of the Notion of the Integral 

The notion of the definite integral (see 8.2.1.1, p. 493), as a Riemann integral (see 8.2.1.1, 2., p. 494), 
was introduced under the assumptions that the function f(a) is bounded, and the interval [a, b] is closed 
and finite. Both assumptions can be relaxed in the generalizations of the Riemann integral. Some of 
them are mentioned in the following section. 

1. Improper Integrals 

These are the generalization of the integral to unbounded functions and to unbounded intervals. In the 
next paragraphs integrals with infinite integration limits and integrals with unbounded integrands are 
discussed. 

2. Stieltjes Integral for Functions of One Variable 

Considering two finite functions f(x) and g(x) defined on the finite interval [a, b] and decomposing the 
interval into subintervals, just as with the Riemann integral, but instead of the Riemann sum (8.38) 
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the following sum is constructed: 
D FED lole) = gli). (8.76) 
il 


If the limit of (8.76) exists, when the length of the subintervals tends to zero, and it is independent of 
the choice of the points x; and &;, then this limit is called a definite Stieltjes integral (see also [8.8]). 

E For g(x) = x the Stieltjes integral becomes the Riemann integral. 

3. Lebesgue Integral 

Another generalization of the integral notion is connected with measure theory (see 12.9, p. 693), where 
the measure of a set, measure spaces, and measurable functions are introduced. In functional analysis 
the Lebesgue integral is defined (see 12.9.3.2, p. 696) based on these notions (see [8.6]). The generaliza- 
tion with comparison to the Riemann integral is, e.g., the domain of integration can be a rather general 
subset of R” and it is partitioned into measurable subsets. 


There are different notations for the generalizations of the integrals (see [8.8]). 
8.2.3.2 Integrals with Infinite Integration Limits 
1. Definitions 


a) If the integration domain is the closed half-axis [a, +00), and if the integrand is defined there, then 
the integral is by definition 


Tro dx = jm f fo dx. (8.77) 


If the limit exists, then the integral is called a convergent improper integral. If the limit does not exist, 
then the improper integral (8.77) is divergent. 
b) If the domain of a function is the closed half-axis (—oo, b] or the whole real axis (—o0, +00), then 
one defines analogously the improper integrals 


i f(a) dx = lm Jio es (8.78a) fre dz = Jim fios (8.78b) 


c) At the limits of (8.78b) the numbers A and B tend to infinity independently of each other. If the 
limit (8.78b) does not exist, but the limit 


+A 
lim / f(z) dz, (8.78c) 
x 


exists, then this limit (8.78c) is called the principal value of the improper integral, or Cauchy’s principal 
value. 
Remark: An obviously necessary but not sufficient condition for the convergence of the integral (8.77) 
is lim f(x) — 0. 

z—00 
2. Geometrical Meaning of Integrals with Infinite Limits 
The ea (8.77), (8.78a) and (8.78b) give the area of the figures represented in Fig. 8.21. 


Ba fE 


rum 
dx B dx 1 1 1 

HB: L3 lim = lim (5 ) 3 (convergent). 
x? 


Boo J/g 1; Booo \2 B 


απ 
lim — = lim ln B = ow (divergent). 
Boo /1 x Booo 
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to dg B. dg 
B cC: / T = lim / oF _ = lim [arctan B — arctan A] = 5-2) — m (convergent). 
Je xz ΠΕΣΏΝ: 2-3 
Y 
0 x 
y 











Figure 8.21 


3. Sufficient Criteria for Convergence 


If the direct calculation of the limits (8.77), (8.78a) and (8.78b) is complicated, or if only the convergence 
or divergence of an improper integral is the question, then one of the following sufficient criteria can be 
used. Here, only the integral (8.77) is considered. The integral (8.78a) can be transformed into (8.77) 
by substitution of x by —a: 


/ f(x) dx = [res dx. (8.79) 


The integral (8.78b) can be decomposed into the sum of two integrals of type (8.77) and (8.78a): 


+00 c +00 
/ f(z)dz — i f(a) dx + 1 μοι (8.80) 


where c is an arbitrary number. 
Criterion 1: If f is integrable in any finite interval of [a, oo) , and if the integral 


+00 
/ | f(x) | dz (8.81) 
a 

is convergent, then also the integral (8.77) is convergent. In this case the integral (8.77) is called abso- 

lutely convergent, and the function f(x) is absolute integrable on the half-axis [a, +00). 

Criterion 2: If for the functions f(x) and y(x) 


f(x) >0, v(x) >0 and f(x) < p(x) for a< x « «oo (8.82a) 
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hold, then from the convergence of the integral 


+00 +00 
/ φ(α){ (8.820) the convergence of the integral / f(a) dx (8.82c) 


follows, and conversely, from the divergence of the integral (8.82c) the divergence of the integral (8.82b) 
follows. 


Criterion 3: Substituting (8.83a) and considering that for a > 0, a > 1 the integral (8.83b) 
+00 
1 dx 1 
plz) =-=, (8-83a) E oos ded nd (8.83b) 


TO um EET 
£ J z%  (a—l)a* 


is convergent and has the value of the right-hand side, and the integral of the left-hand side is divergent 
for a < 1, then one can deduce a further convergence criterion from the second one: 
If f(x) ina € x « oo is a positive function, and there exists a number œ > 1 such that for all x large 
enough 

f(x)zr* « k «oo. (k » 0, const) (8.83c) 


holds, then the integral (8.77) is convergent; if f (v) is positive and there exists a number a € 1 such 
that 


f(x)z* »c2»0 (c2 0, const) (8.83d) 
holds from a certain point x, then the integral (8.77) is divergent. 
[ n n . Substituting a = ; , gives E gl? — a — 1. The integral is divergent. 
4. Relations Between Improper Integrals and Infinite Series 
If 21, %2,...,%p,... is an arbitrary, unlimited increasing infinite sequence, i.e., if 
a < Tı < Ta <- <In <: with „im, In = 00, (8.84a) 


and if the function f(x) is positive for a < x < oo, then the problem of convergence of the integral 
(8.77) can be reduced to the problem of convergence of the series 


[re dx + [ro dr +--+ / f(a)da +--+. (8.84b) 


If the series (8.84b) is convergent, then the integral (8.77) is also convergent, and it is equal to the 
sum of the series (8.84b). If the series (8.84b) is divergent, then the integral (8.77) is also divergent. 
So the convergence criteria for series can be used for improper integrals, and conversely, in the integral 
criterion for series (see 7.2.2.4, p. 462) one can use the improper integrals to investigate the convergence 
of infinite series. 


8.2.3.3 Integrals with Unbounded Integrand 
1. Definitions 


1. Right Open Interval For a function f(x), which has a domain fa, b) open on the right, and at 
the point b it has the improper limit lim. f(x) — oo, the definition of the improper integral is the 
r—b— 
following: 
b 


[1 dx = lim, f io dz. (8.85) 


a 
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If this limit exists and is finite, then the improper integral (8.85) exists, and is called a convergent 
improper integral. If the limit does not exist or it is not finite, then the integral is called a divergent 
improper integral. 

2. Left Open Interval For a function f(x), which has a domain open on the left (a, b], and at the 
point a it has the limit lm g f(x) — oo, the definition of the improper integral analogously to (8.85) is 


b b 

1 f(z) dx = Jim, / f(x) dz. (8.86) 
a ate 

3. Two Half-Open Continuous Intervals For a function f(x), which is defined on the interval 

(a, b] except at an interior point c with a < c < b, i.e., for a function f(x) defined on the half-open 

intervals [a, c) and (c, b], or is defined on the interval [a, b], but at the interior point c it has an infinite 

limit at least from one side im, f(x) = or Him, f(x) = œ, the definition of the improper integral 


is 


jio )dr — lim "E ) dz 4- lim | 1 ) ἄν. (8.872) 


To 


Here the numbers € and ó tend to zero independently of each other. If the limit (8.87a) does not exist, 
but the limit 


im fro ) da + / f(x D (8.87b) 


cte 
does, then the limit (8.87b) is called the principal value of the improper integral or Cauchy's principal 
value. 
2. Geometrical Meaning 
The geometrical meaning of the integrals of unbounded functions (8.85), (8.86), and (8.87a) is to find 
the area of the figures bounded, e.g., from one side by a vertical asymptote as represented in Fig.8.22. 


At 


b dx 
ΒΑ: [ =: Case (8.86), singular point at x — 0. 
Jo Jax 











b) s. (8.86) 
Figure 8.22 


b dx b dx 
—= li — — lim (2Vb — = rgent). 
dcm im f Ja Hm Vb —2/e) =2Vb (convergent) 


π/ὰ π 
B D: [ tanadx: Case (8.85), singular point at « = 3 
Jo 


π/2 π/2-ε π 
[ tanxdx = lim [ tanadx = lim |lncos0 — In cos ( - ε)] — oo (divergent). 
Jo 0 £240 2 


£—H0, 
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HC: Le ==: Case (8.87a), singular point at z — 0. 
8 i . dx _ 3) 9/3 » 9 
| c = : iE 
Lim = Jim, | Va vum [5 σε = jim, παν 1+, lin EC -δ””) - 5 (convergent). 
2 23 ^ d: a 
B D: / i 1 : Case (8.87), singular point at r — +1. 
J-222— 
2 2a dx -1-ε 1-v 2 
/ 3 = lim J + lim 1 + lim / 
-243"—]1 εθν-ε $o10J-148 73H90 Jp 
— lim In|z? — 1||-1* +--+ = lim [In|1 + 2e +e? — 1] -—In3]+---= 00 (divergent). 
e>+0 e>+0 


3. The Application of the Fundamental Theorem of Integral Calculus 
1. Warning The calculation of improper integrals of type (8.87a) with the incorrect use of the formula 


/ f(x)dz— F(z)| with F"(x) = f(x) (8.88) 


(see 8.2.1.2,1., p. 495) usually results in mistakes if the singular points in the interval [a, b] are not taken 
into consideration. 
W E: Using formally the fundamental theorem one gets for the example D 








2 2rdx 
/ : T dp 1| 252 1n3 - In3 — 0, 


-2 £? — 
though this integral is divergent. 
2. General Rule The fundamental theorem of integral calculus can be used for (8.87a) only if the 
primitive function of f(x) can be defined to be continuous at the singular point. 
E F: In example D the function In |z? — 1| is discontinuous at « = +1, so the conditions are not 


ae f 3 . NOM : 1 
fulfilled. Considering example C, the function y — 5 x/3 is such a primitive function of yz on the 
T 





intervals [a, 0) and (0, b] which can be defined continuously at  — 0, so the fundamental theorem can 
be used in example C: 
9 


8 dy 3 2/9 uuo 
25 


ayg& 3 


4. Sufficient Conditions for the Convergence of an Improper Integral with 
Unbounded Integrand lim | f(x) — oo 


5(8^ - (-1)%) = 


b b 
1. If the improper integral 1 |f(x)| dx converges, then the improper integral 1 f(x)dx also con- 
a a 


verges. In this case it is called an absolutely convergent integral and the function f(x) is an absolutely 
integrable function on the considered interval. 


2. Ifthe function f(x) is positive in the interval [a,b), and there is a number a < 1 such that for the 
values of all x close enough to b 

f(x) (b— x)* « oo (8.894) 
holds, then the integral (8.87a) is convergent. But, if the function f(x) is positive in the interval [a, b), 
and there is a number a > 1 such that for the values of x close enough to b 

f(x)(b— x)" »c20 (cconst) (8.89b) 
holds, then the integral (8.87a) is divergent. 
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8.2.4 Parametric Integrals 


8.2.4.1 Definition of Parametric Integrals 
The definite integral 


/ f(z,y) dz — Fly) (8.90) 


is a function of the variable y considered here as a parameter. In several cases the function F(y) is 
no longer elementary, even if f(x,y) is an elementary function of x and y. The integral (8.90) can be 
an ordinary integral, or an convergent improper integral with infinite limits or unbounded integrand 
f(x,y). 
For theoretical discussions about the convergence of improper integrals depending on a parameter see, 
e.g.,[8.3]. 
B Gamma Function or Euler Integral of the Second Kind (see 8.2.5, 6., p. 514): 
oo 
Γ(υ) -- ΤΕ dx (convergent for y > 0). (8.91) 
0 


8.2.4.2 Differentiation Under the Symbol of Integration 

1. Theorem lfthe function (8.90) is defined in the interval c € y € e, and the function f(x,y) is 
continuous on the rectangle a € x € b, c € y € e and it has here a continuous partial derivative with 
respect to y, then for arbitrary y in the interval [c, e] holds 


b b 
1 Of (v, 
τ I f(x,y) dx = / δία, ν) dx. (8.92) 
dy a a ὃν 
This is called differentiation under the symbol of integration. 
rdr 1 y? 


E For arbit >0 E tan 5 d. [2 tan) d f ihn 
or arpıtrar pe arctan axr = arctan © = = -m * 
ΤΝ dy Jo y ο ὃν y o 324-2 2 1-43? 
y 


1.1 d d. 3 P 1 2 
Checking: ré arctan ` y T dz = arctan y +> 5 yin Τ a 25 du (arctan 7 H 5 yln 1 T 7) 5 In 1 T J 














For y = 0 the condition of ον for : (x,y) is not fulfilled, and there exists no derivative. 

2. Generalization for Limits of Integration Depending on Parameters The formula (8.92) 
can be generalized, if with the same assumptions as have been made for (8.92) the functions a(y) and 
B(y) are defined in the interval [c, e], they are continuous and differentiable there, and the curves x = 
a(y), x = (y) do not leave the rectangle a < x < b, c < y < e: 


80) "0 f(x. 9) y) 
e uJ, f(v,y) de = T are de  B'Qy) f (8.9) — a(y) F (ay) 3) (8.93) 
aly) ὃν 
8.2.4.3 Integration Under the Symbol of Integration 


If the function f(x,y) is continuous on the rectangle a < x < b, c € y € e, then the function (8.90) is 
defined in the interval [c, e], and 


j / FG, y) i dy = 1 | / f(x,y) - de (8.94) 


is valid. This case of commutability of the order of integration is called integration under the symbol of 
integration. 
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W A: Integration of the function f(x,y) — xz" on the rectangle 0 € z € 1, a € y € b. The function 





x” is discontinuous at x = 0, y = 0, for a > 0 it is continuous. So one can change the order of 
bp pl 1f pb è dy tp 

integration: / | | x" η dy — / Í x” dy| dx. On the left-hand side one gets / — — — |n ; 
a 190 0 Ja al+y l+a 


1 4b 
: 2 g= er: m 

on the right-hand side 1 dx. The corresponding indefinite integral cannot be expressed by 

o lng 

elementary functions. Anyway, the definite integral is known, so now follows: 


14b. χα 14b 
/ ne @ = ing (0<a<b). 








ae 
E B: Integration of the function f(x,y) = GV over the rectangle 0 € r € 1, 0 € y € 1. The 
a? y 


function is discontinuous at the point (0, 0), so the formula (8.94) cannot be used. Checking it yields: 

















NEP NN. p 1 alt d 
y B y T αξι. $ ων ο ρα: 
/ (+ yp απ S [250 rg / ity arctan y |= 2 
1 1 ορ 
y- r |y yl Y. dv ο a 
Í Gr yn dy ry [oo ml! Í Pal arctan x |o= e 
8.2.5 Integration by Series Expansion, Special Non-Elementary 
Functions 


It is not always possible to express an integral by elementary functions, even if the integrand is an ele- 
mentary function. In many cases one can express these non-elementary integrals by series expansions. 
If the integrand can be expanded into a uniformly convergent series in the interval (a, b], then one gets 


T 
also a uniformly convergent series for the integral Ϊ f(t) dt by integrating it term by term. 
Ja 


1. Sine Integral (|z| < cv, see also 14.4.3.2, 2., p. 756) 


fein τ i 
= fF nt ο ΤΠ -f5 ΠΕ 

t deg t m 
0 m 


αφ T (—1)" rt 














egal aal Fnr eed * (8:95) 
2. Cosine Integral (0 < x « oo) 
T cost f 1— cost 
Ci (z) = -f5 dt=C+ine— [at 
z 0 
x αἱ (-1)"2?" . 
Ξ ο Για 2.931 * 4.4 der) 2n- uj eo with (8.964) 
ο τος 7 e 'Intdt = 0.577 215665... (Euler constant). (8.96b) 
0 


3. Integral Logarithm (0 < x < 1, for 1 < x < oo as Cauchy Principal Value) 


2 f dt (Ina)? (In a)” 
Li (x) int C In |Inz| - Inz 4 gm wem 
J ! ! 


(8.97) 
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4. Exponential Integral (—oo « x < 0, for 0 « x < œo as Cauchy Principal Value) 


; fe x? g" 
πιῳ)- | ο ο ορ πο, (8.982) 
Fi (ln x) = Li (x). (8.98b) 


5. Gauss Error Integral and Error Function 
The Gauss error integral is defined for the domain |v| « oo and it is denoted by 9. The following 
definitions and relations are valid: 


T Bg 
P(x) = = / e 2 dt, (8.99a) jim (1) =1, (8.99b) 


1 7 8 1 
dolz) = — fe 3dt- 9()- 5. 8.99¢ 
9 ία ) Vor | e (x ) 9 ( c) 
The function d(z) is the distribution function of the standard normal distribution (see 16.2.4.2, p. 819) 


and its values are tabulated in Table 21.17, p. 1133. 


The error function erf (x), often used in statistics (see also 16.2.4.2, p. 819), has a strong relation with 
the Gauss error integral: 








2f 
erf (x) = a je di — 26g(x v2), (8.1002) Jim. erf (z) — 1, (8.100b) 
2 x a? (πα 
rf (x) = T | pus s «il 
Ὁ ^ Πα δια " πιο 1)" ) (8.1006) 
f 1 lerf(x) 2 
[5 (t) dt — x erf (x) + Vm (e? - 1) , (8.100d) $ στο = UU. (8.1000) 
6. Gamma Function and Factorial T(x) 
1. Definition The gamma function, the Euler integral of the 5 
second kind (8.91), is an extension of the notion of factorial 4 
for arbitrary numbers x, even complex numbers, except zero 3 
and the negative integers. The curve of the function I(x) is 2 
represented in Fig. 8.23. Its values are given in Table 21.10, 1 
p. 1105. It can be defined in two ways: 
oo -5f\4 -3 -2 -1 4 x 


I(r) = joer dt («>0) or (8.101a) 
0 
I(x) = lim pee S ἢ 


ΠΕ a + D + 2) Lone (x + n) 
(x A 0,—1,—2,...). (8.101b) 





Figure 8.23 
2. Properties of the Gamma Function 


Γ(α -- 1) -- «Γ(α), (8.102a) [(n+1)=n! (n=0,1,2,...),  (8.102b) 
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π 





r(x) r(1-— z) = 








0,1, 2,...), (8.102c) 





sin 7x 


1 T p 
r (5) -2f[e*dt- v, (8.1024) 
0 


1 (2n)! yT f 
r (n 5) σας METL Zen), (8.102e) 


—1)"n!22"\/rr 
r(-n+5) = ae (n =0,1,2,...). (8.102ῃ 


The formulas (8.102a) and (8.102c) are also valid for complex arguments z, but only if Re (z) 7 0 holds. 


3. Generalization of the Notion of Factorial The notion of factorial, defined until now only for 
positive integers n (see 1.1.6.4, 3., p. 13), leads to the function 





x! — Γ(α-ε 1) (8.103a) 
as its extension for arbitrary real numbers. The following equalities are valid: 
For positive integers s: x!— 1-2-3--..r, (8.103b) forr —0: 0!— (1)- 1, (8.103c) 
for negative integers z: zr! = too, (8.103d) for x = f : (5): =. (5) = B , (8.103e) 
1 1 1 3 3 1 
rg=—-:(|—-)!= =) = g=—-: |[—-]!= — m o 
for x= —5 af ;) r(;) νπ, (8103) rz -—5: ( 5) r( ;) 2/5 .(8.103g) 


An approximate determination of a factorial can be performed for numbers > 10, also for fractions n 
with the Stirling formula: 





πλ” 1 
nlz[(—) Von a : 
D i "m ( * qm * 2888) * ) l a) 
In(n!) = (n + 5) Inn — n 4 In v2r. (8.1031) 


7. Elliptic Integrals 
For the complete elliptic integrals (see 8.1.4.3, 2., p. 490) the following series expansions are valid: 


Ξ 

αὐ s 133 1.32 1-3-5)? 

πο πα 2 ( ) ( ΣΣ; -| , k? <1, (8.104 
lana ;| do τ 2.4 T 2-4-6 T ( ) 


3 212 2 14 2 1.6 
= pa ty ai π (1) k (2:2) k (s) k - 
E / 1—K?sin^ 0d) sh 2] 1 2.4) 3 2.4.6) 5 i 


k? <1. (8.105) 


K 














The numerical values of the elliptic integrals are given in Table 21.9, p. 1103. 


8.3 Line Integrals 

The notion of the integral can be generalized in different ways. While the domain of an ordinary definite 
integral is an interval on the numerical axis, for a line integral, the domain of integration is a segment 
of a planar or space curve. The curve, i.e., the path of integration can also be closed; it is called also 
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circuit integral and it gives the circulation of the function along the curve. There are distinguished line 
integrals of the first type, of the second type, or of general type. 


8.3.1 Line Integrals of the First Type yA A=B 
8.3.1.1 Definitions 


The line integral of the first type or integral over an arcis the definite 
integral 


f feas. (8.106) 
(C) 


where f(x, y) is a function of two variables defined on a connected 


domain and the integration is performed over an arc C =AB of 
a plane curve given by its equation. The considered arc is in the 
same domain, and it is called the path of integration. The numerical 
value of the line integral of the first type can be determined in the 
following way (Fig. 8.24): Figure 8.24 





1. Decomposing the rectifiable arc segment AB into n elementary parts by points Aj, Ag,...,An—1 
chosen arbitrarily, starting at the initial point A = Apo and finishing at the endpoint B = An. 





2. Choosing arbitrary points P; inside or at the end of the elementary arcs A; 1A;, with coordinates 
ĉi and ni. 

3. Multiplying the values of the function f(ĉ;, ni) at the chosen points with the arc-length A;—14A;= 
As;i—ı which should be taken positive. (Since the arc is rectifiable, As;_ is finite.) 

4. Adding the n products f(£;, ;) ^s; 4. 

5. Evaluating the limit of the sum 


SHE mAs (8.107a) 
il 


as the arc-length of every elementary curve segment As; ; tends to zero, while n obviously tends to oo. 
If the limit of (8.107a) exists and is independent of the choice of the points A; and P;, then this limit is 
called the line integral of the first type, and the function f (c, y) is called integrable along the curve C: 


/ f(v,y) ds — Jim, = Fléi ni) Asi- (8.107b) 
©) ndo i= 


Analogously the line integral of the first type can be defined for a function f(x, y, ο) of three variables, 
whose path of integration is a curve segment of a space curve: 


/ f(a,y, 2) ds = jim, 2, f ms G) Asi. (8.107c) 
©) nico i= 


8.3.1.2 Existence Theorem 

The line integral of the first type (8.107b) or (8.107c) exists if the function f(x, y) or f(x, y, z) is con- 
tinuous along the continuous arc segment C’, and the curve has a tangent which varies continuously. In 
other words: The above limits exist and are independent of the choice of A; and P;. In this case, the 
functions f(x,y) or f(x,y,z) are called integrable along the curve C. 


8.3.1.3 Evaluation of the Line Integral of the First Type 


The calculation of the line integral of the first type can be done by reducing it to a definite integral. 
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1. The Equation of the Path of Integration is Given in Parametric Form 
If the defining equations of the path are x = z(t) and y = y(t), then 


[ feuds = [ fitt) OY OP - OP at (8.1084) 
(C) to 


holds, and in the case of a space curve x = x(t), y = y(t), and z = z(t) 
τ 
f 1e Ms — f eO, vO lye OP + WOP + OP ae, (8.108b) 
(C) fo 


where to is the value of the parameter t at the point A and T is the parameter value at B. The points 
A and B are chosen so that tọ < T holds. 


2. The Equation of the Path of Integration is Given in Explicit Form y =y (x) 
Substituting t = x one gets from (8.108a) for the planar case 


/ f(x,y)ds — / Fl v(2)]/ 1 [y (x)? dx, (8.1092) 
(C) s 


and from (8.108b) for the three dimensional case 
b 
f fes — f[ issu) YA e I COP e EGO de. (8.109) 
(C) @ 


Here a and 5 are the abscissae of the points A and B , where the relation a « b must be fulfilled. Then 
one considers x as a parameter if every point corresponds to exactly one point on the projection of the 
curve segment C onto the x-axis, i.e., every point of the curve is uniquely determined by the value of its 
abscissa. If this condition does not hold, then the curve segment has to be partitioned into subsegments 
having this property. The line integral along the whole segment is equal to the sum of the line integrals 
along the subsegments. 


8.3.1.4 Application ofthe Line Integral of the First Type 


Some applications of the line integral of the first type are given in Table 8.6. The curve elements ds 
needed for the calculations of the line integrals are given for different coordinate systems in Table 8.7. 
8.3.2 Line Integrals ofthe Second Type 

8.3.2.1 Definitions 


A line integral of the second type or an integral over a projection onto the x-, y- or z-axis is e.g., the 
definite integral 


I f(x,y) dx (8.110a) or / J (v, y, z) do, (8.110b) 
©) ©) 


where f(x,y) or f(x,y,z) are two or three variable functions defined on a connected domain, and the 
integration is done over a projection of a plane or space curve C —AB (given by its equation) onto the 
x-, y-, or z-axis. The path of integration is in the same domain. 

The line integral of the second type one gets similarly to the line integral of the first type, but in the 
third step the values of the function f(£;, n;) or f (£;, ni, G;) are not multiplied by the arc-length of the 


elementary curve segments A; ; A;, but by its projections onto a coordinate axis (Fig. 8.25). 
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Table 8.6 Line Integrals of the First Type 





























Length of a curve segment C | L = J ds 
7 I (C) 

Mass of an inhomogeneous . . 
curve segment C M= | ods (o= f(x,y,z) density function) 

(C) 

. . 1 1 1 
Center of gravity coordinates | zo = L / rods, yo = L / yods, σσ Z / zods 
(C) (C) (C) 

Moments of inertia of a plane 2 Beas 
curve in the x, y plane l = / rods, l,— / y ods 

(C) (C) 

L= [@ +2°)ods, Iy= Je +2")ods, 

Moments of inertia of a (C) (C) 
space curve with respect 3 2 
to the coordinate axes l= / (x* + y")ods 

(C) 
In the case of homogeneous curves o — 1 is substituted. 





Table 8.7 Curve 


Elements 








Cartesian coordinates x, y = y(x 
Plane curve in 
the x, y plane 


Polar coordinates y, p = p(y) 
x = p(p) cos p, y = p(p) sin p 


ds — 4/1 4 [y (x)]?dx 
ds = y P? (p) * [p'Ge)Pde 

















Parametric form in Cartesian ds = \/ |x (t)? + [y (t)]2dt 
coordinates « = x(t), y = y(t) 
Space curve Parametric form in Cartesian ds = ο Πο ο. 
coordinates x = x(t), y = y(t), z = z(t) 
1. Projection onto the z- Axis y 
With Pr, A; 4A;2 z; — z;:1 — Av;.i one gets (8.111) y, A, 
n Yap "A PG m) 
/ f(a,y) de = lim | M f(&i. m) Avi, (8.112a) Ar 
Azj 10 4 ! i 
(C) noo i=l A | | 
ο ο ο ο ο πμ xix 
©) cda 


2. 


Projection onto the y-Axis 


f Fav) dy = Jim, fon) Ava, 
©) n356o i=l 


EOD 2) dy = „lim, ŽS f&n m G) Ayi- 
nooo i=l 


(C) 








Figure 8.25 


(8.113a) 


(8.113b) 
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3. Projection onto the z-Axis 


| renad im, Y (6G) Ass (8.114) 
©) ΙΙ 


8.3.2.2 Existence Theorem 

The line integral of the second type in the form (8.112a), (8.113a), (8.112b), (8.113b) or (8.114) exists 
if the function f(x,y) or f(x,y,z) and also the curve are continuous along the arc segment C, and the 
curve has a continuously varying tangent there. 

8.3.2.3 Calculation of the Line Integral of the Second Type 

The calculation of the line integral of the second type can be done by reducing it to a definite integral. 
1. The Path of Integration is Given in Parametric Form 

With the parametric equations of the path of integration 








z=x(t), y=y(t) and (for a space curve) z= z(t) (8.115 
we get the following formulas: 
T 
For (8.112a) 1 f(z,y) dz — / fix (t), y(6)]a (t) d. (8.116 
(c) to 
T 
For (8.113a) [ fey) dy = [ fe, vol dt. (8.116b 
[ο h 
T 
For (8.112b) f fov. z)dr — f FEO, vO), (O00 at. (8.116c 
(0) to 
T 
For (8.113b) J tendi f fixtt) v. OW dt. (8.116d 
© A 
T 
For (8.114) f fev. z)dz — [ fe),vO, 20 00 dt. (8.116e 
(0) to 


Here, to and T are the values of the parameter t for the initial point A and the endpoint B of the arc 
segment. In contrast to the line integral of the first type, here we do not require the inequality to < T. 
Remark: In the case of reversion of the path of the integral, i.e., interchanging the points A and B, 
the sign of the integral changes. 
2. The Path of Integration is Given in Explicit Form 
In the case of a plane or space curve with the equations 

ν-νωὶ ο:  y=y(2), z= z(a) (8.117) 
as the path of integration, with the abscissae a and b of the points A and B, where the condition a « bis 
no longer necessary, the abscissa x takes the place of the parameter t in the formulas (8.112a) — (8.114). 


8.3.3 Line Integrals of General Type 
8.3.3.1 Definition 


A line integral of general type is the sum of the integrals of the second type along all the projections of 
a curve. If two functions P(x, y) and Q(x, y) of two variables, or three functions P(r, y, z), Q(v, y, z), 
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and R(x, y, z) of three variables, are given along the given curve segment C, and the corresponding line 
integrals of the second type exist, then the following formulas are valid for a planar or for a space curve. 


1. Planar Curve 


nu +Qdy) = 1 Pdz + / Q dy. (8.1182) 
(C) (C) (C) 


2. Space Curve 


f Pas Qdy + Raz) = [ Pac+ 
(C) 


[9 [ Raz. (8.118b) 
(C) (C) 


(C) 


The vector representation of the line integral of general type and an application of it in mechanics will 
be discussed in the chapter about vector analysis (see 13.3.1.1, p. 719). 


B 
8.3.3.2 Properties of the Line Integral of General Type κ᾽ 
Μ 


1. The Decomposition of the Path of the Integral i 
er dd 


by a point M, which is on the curve, and it can even be outside of AB 


(Fig. 8.26), results in the decomposition of the integral into two parts: A M 
[ (6a 94) - [ (Pax + Qay) $ [ (Pav + Qdy)- (8.119) 5 B 


AB AM MB . 
Figure 8.26 


2. The Reverse of the Sense of the Path of Integration 
changes the sign of the integral: 


[ (Pav + Qay) == [ (Pav + Qay)- (8.120) 
AB BA 


3. Dependence on the Path 


In general, the value of the line integral is dependent not only on the initial and B 
endpoints but also on the path of integration (Fig. 8.27): A 
/ (P dz +Q dy) # / (Pdx + Qdy).* (8.121) D 
A A Figure 8.27 
AMB ADB 


BA: /= f e dx + yz dy + zx dz), where C is one turn of the helix x — acost, y — asint,z = bt 
(5) 
(see Helix on p. 260) from tg — 0 to T' — 2m: 
on πα’) 
T= | (—à? sin? t cos t 4- a?bt sin t cos t 4- al?t cost) dt = B 
0 
HB: J = 1 ly? dz + (xy — z’) dy], where C is the arc of the parabola y? — 9x between the points 
(€) 





*Similar formulas are valid for the three-variable case. 
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9 9 8 
8.3.3.3 Integral Along a Closed Curve 


1. Notion of the Integral Along a Closed Curve A circuit integral or the circulation along a 
curve is a line integral along a closed path of integration C , i.e., the initial point A and the end point 
B coincide. The following notation is used: 
jo dx + Q dy) or jo dx + Q dy + Rdz). (8.122) 
(C) (€) 


3 aye 4 
A(0,0) and B(1,3): 1 — [ lo^ + (5 = τ) dicic. 
JO 


In general, this integral differs from zero. But it is equal to zero if the conditions (8.127) are satisfied, 
or if the integration is performed in a conservative field (see 13.3.1.6, p. 721). (See also zero-valued 
circulation, 13.3.1.6, p. 721.) 

2. The Calculation of the Area S of a Plane Figure is a typical example of the application of 
the integral along a closed curve in the form 


1 
ο 5 fe dy — y dx), (8.123) 
(2 

where C is the boundary curve of the plane figure. The integral is positive if the path is oriented 
counterclockwise. 
8.3.4 Independence ofthe Line Integral ofthe Path of Integration 
The condition for independence of a line integral of the path of integration is also called integrability of 
the total differential. 
8.3.4.1 Two-Dimensional Case 
If the line integral 

[(P@y) de + Qa,y) dy (8.124) 

(C) 


with continuous functions P and Q defined on a simple connected domain depends only on the initial 
point A and the endpoint B of the path of integration, and does not depend on the curve connecting 
these points, i.e., for arbitrary A and B and arbitrary paths of integration ACB and ADB (Fig. 8.27) 
the equality 


/ (Pdr +Q dy) = / (P dx + Q dy) (8.125) 
ACB ADB 


holds, then it is a necessary and sufficient condition for the existence of a function U (x, y) of two vari- 
ables, whose total differential is the integrand of the line integral: 
oU oU 
P=—, Q=—. 
Ox Oy 
The function U (x,y) is a primitive function of the total differential (8.126a). In physics, the primitive 
function U (x, y) means the potential in a vector field (see 13.3.1.6, 4., p. 722). 


Pdz +Q dy = dU, (8.126a) ie., Q (8.126b) 


8.3.4.2 Existence of a Primitive Function 
A necessary and sufficient criterion for the existence of the primitive function, the integrability condition 
for the expression P dx + Q dy, is the equality of the partial derivatives 

OP  0Q 


πω (8.127) 
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where also the continuity of the partial derivatives is required. 
8.3.4.3 Three-Dimensional Case 
The condition of independence of the line integral 
[(Pl.y,2) de + Qlx,y,2) dy + R(w,y, 2) di] (8.128) 


of the path of integration analogously to the two-dimensional case is the existence of a primitive function 
U (x, y, z) for which 


P dx -- Qdy -- Rdz — dU, (8.1292) 
holds, i.e., 
ου ου ου 
paw B E. 8.1291 
am LT’ dz (3:129Þ) 


The integrability condition is now that the three equalities for the partial derivatives 
ὃῳ OR OR OP OP OQ 
Oz Oy’ Ox Oz’? Oy Ox 


should be simultaneously satisfied, provided that the partial derivatives are continuous. 











(8.129c) 


E The work W (see also 8.2.2.3, 2., p. 504) is defined as the scalar product of force F (r) and displace- 
ment s. In a conservative field the work depends only on the place 7’, but not on the velocity v. With 
F = Pë, + Qé, + Ré, = gradV and ds = dxē, + dyé, + dzé, the relations (8.129a), (8.129b) are 
satisfied for the potential V (7), and (8.129c) is valid. Independently of the path between the points P, 
and P, holds: 


W= i F(?) ds = [ina + Qdy + Rdz] = V(Py) - V (P). (8.130) 


1 


8.3.4.4 Determination of the Primitive Function 


1. Two-Dimensional Case (Fig.8.28) 

If the integrability condition (8.127) is satisfied, then along an arbitrary path of integration connecting 
an arbitrary fixed point A(xo, yo) with the variable point P(x, y) and passing through the domain where 
(8.127) is valid, the primitive function U(x, y) is equal to the line integral 


v= [pa 4 Qdy). (8.131) 


AP 


In practice, it is convenient to choose a path of integration parallel to the coordinate axes, i.e., one 
of the segments AK P or ALP, if they are inside the domain where (8.127) is valid. There exist two 
formulas for the calculation of the primitive function U (x, y) and the total differential P dx + Q dy: 


z y 

U — Un; go) -- Í «f -C« f PE m) d+ f Qle,n)dn, (8.132a) 
AK KP To Yo 
y T 

U = U (zo, yo) + [+ i =C+ [ Qo. an [PE y) ἀξ. (8.132b) 
AL LP Yo To 


Here C is an arbitrary constant. 
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P(x, y, 2) 
A(X, Yor Zo) 





Figure 8.28 Figure 8.29 


2. Three-Dimensional Case (Fig. 8.29) 
Ifthe condition (8.129c) is satisfied, the primitive function can be calculated for the path of integration 
AKLP with the formulas 


U =U(z0,yo,20) + f + f+ f 


AK KL LP 


-J P(£, yo, zo) dÊ + Jo z,n, z0) dn + n z,y,£) d£ -- C (C arbitrary constant). (8.133) 
Yo 20 
For the me five possibilities of a path of integration with the segments being parallel to the coordinate 
axes one gets five further formulas. 


x + 
Application of the formula (8.132b) and the substitution of ry = 0, yo = 1 (zo = 0, yo = 0 
may not be chosen since the functions P and Q are discontinuous at the point (0,0)) resulting in 


"0-4 d x 
᾿ 3 se a 1. gu ee s + U(0, 1) = — arctan E +C = arctan : +C. 


: Pd Z 1 1 ο. T irélaii 
E B: Pdr + Qdy+ Rdz = z (2, a =) dx + n dy 4 (= Jp 5) dz. The relations 
(8.129c) are satisfied. Application of the formula (8.133) and substitution of xo = 1, yo — 1, 29 — 0 


] x y NH x 1 PA zo. 
result inv =f 0-dc f 0-dn4 1 (ss =) ασ arctan = iv C. 
8.3.4.5 Zero-Valued Integral Along a Closed Curve 


The integral along a closed curve, i.e., the line integral P dx + Q dy is equal to zero, if the relation 


(8.127) is satisfied, and if there is no point inside the curve where even one of the functions P, Q, a 


0Q 


or ΠΕ is discontinuous or not defined. 


: s y dx x dy I m tet, OP 899. p-r 
W A: Pdr-Qdy — gt. αν The condition (8.129c) is satisfied: [RM ο η, 

















Remark: The value of the integral can be equal to zero also without this conditions, but then one gets 
this value only after performing the corresponding calculations. 


8.4 Multiple Integrals 


The notion of the integral can be extended to higher dimensions. If the domain of integration is a region 
in the plane or on a surface in space, then the integral is called a surface integral, if the domain is a part 
of space, then it is called a volume integral. Furthermore, for the different special applications there are 
other special notations. 
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8.4.1 Double Integrals 

8.4.1.1 Notion of the Double Integral 

Y| Pogy) 1. Definition 

The double integral of a function of two variables u — f (c, y) over a planar 
domain S in the z, y plane is denoted by 


[ie 48 — f/f fix. y) dua. (8.134) 
S S 


It isa number, ifit exists, and it is defined in the following way (Fig. 8.30): 
1. Decomposition of the domain S into n elementary domains AS;. 





2. Choosing an arbitrary point P;(2;, y;) in the interior or on the boundary 
Figure 8.30 of every elementary domain. 


3. Multiplication of the value of the function u = f(2;,y;) at this point by the area AS; of the corre- 
sponding elementary domain. 

4. Summation of these products f(x, y:)AS;. 

5. Calculation of the limit of the sum 


5 F (ti v) A5; (8.135a) 
i=l 


as the diameter of the elementary domains tends to zero, consequently AS; tends to zero, and so n 
tends to oo. (The diameter of a set of points is the supremum of the distances between the points of 
the set.) The requirement AS tends to zero is not enough, because, e.g., in the case of a rectangle the 
area can be close to zero also if only one side is small and the other is not, so the considered points 
could be far from each other. If this limit exists independently of the partition of the domain S into 
elementary domains and also of the choice of the points P;(x;, y;), then it is called the double integral 
of the function u — f(x, y) over the domain S, the domain of integration, and one writes: 


n 
/ f(x,y) dS = lim Y^ fGri y) AS. (8.135b) 
$ nso i=l 
2. Existence Theorem 

If the function f(x, y) is continuous on the domain of integration including the boundary, then the dou- 
ble integral (8.135b) exists. (This condition is sufficient but not neces- 
sary.) 

3. Geometrical Meaning 

'The geometrical meaning of the double integral is the volume of a solid 
whose base is the domain in the x, y plane, whose side is a cylindrical sur- 
face with generators parallel to the z-axis, and it is bounded above by the 
surface defined by u = f(x,y) (Fig. 8.31). Every term f(x, y;) AS; of 
the sum (8.135b) corresponds to an elementary cell of a prism with base 


u=f(x,y) 


T 


SAD 





x AS, Pio γι) AS; and with altitude f (x;, y;). The sign of the volume is positive or neg- 
ative, according to whether the considered part of the surface u = f(x,y) 
Figure 8.31 is above or under the x, y plane. If the surface intersects the x, y plane, 


then the volume is the algebraic sum of the positive and negative parts. 
If the value of the function is identically 1 (f(a, y) = 1), then the volume has the numerical value of 
the area of the domain $ in the z, y plane. 


8.4.1.2 Evaluation of the Double Integral 


The evaluation of the double integral is reduced to the evaluation of a repeated integral, i.e., to the 
evaluation of two consecutive integrals. 
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1. Evaluation in Cartesian Coordinates 

If the double integral exists, then one can consider any type of partition of the domain of integration, 
such as a partition into rectangles. After dividing the domain of integration into infinitesimal rectan- 
gles by coordinate lines (Fig. 8.32a) it follows the calculation of the sum of all differentials f(a, y)dS 
starting with all the rectangles along every vertical stripe, then along every horizontal stripe. (The 
interior sum is an integral approximation sum with respect to the variable y, the exterior one with re- 
spect to x.) If the integrand is continuous, then this repeated integral is equal to the double integral 
on this domain. The analytic notation is: 


b | ex(z) b p2(£) 
EIC y)dS — / | f(x,y)dy| dz = / / f(x,y) dy dz. (8.136a) 
s à dpi) a gila) 
Here y = p(x) and y = yı (x) are the equations of the upper and lower boundary curves (AB) pove and 


(AB)pelow Of the surface region S (p1 < p2; Y1, P2 continuous). Here a and b are the abscissae of the 
points of the curves to the very left and to the very right. The elementary area in Cartesian coordinates 
is 






























































































































































dS = dz dy. (8.136b) 
(The area of the rectangle is Ax Ay independently of the value of x.) For the first integration x is 
yA yA 
g D 
NYL B | —. Ss 
y B A x ZO 
dy H dy πας Β EEN" 
AREO E AS 
4A α C ΝΥ JESSE OY 
of a dx bx 0 dx X 
a) b) 
Figure 8.32 Figure 8.33 Figure 8.34 


handled as a constant. The square brackets in (8.136a) can be omitted, since according to the notation 
the interior integral is referred to the interior integration variable, the exterior integral is referred to 
the second variable. In (8.136a) the differential signs dx and dy are at the end of the integrand. It is 
also usual to put these signs right after the corresponding integral signs, in front of the integrand. 
The summation can be performed in reversed order, too, (Fig. 8.32b). If the integrand is continuous, 
then it results also in the double integral: 
B va(y) 
[fe y) dS = i. I Λία, y) dz dy. (8.136c) 
5 a n (y) 


W Calculation of A — Í zy? dS , where S is the surface region between the parabola y — a? and the 
Js 


2 [2r 2 y? ap 32 
line y = 2x (Fig. 8.33) as A= Í is vy? dy dx — Í tdr |>| => [ (8x4 — 27) dx = = or 
Jo Ja Jo ὃ 1 3 Jo 5 


4 [NU 2 zl) ips ἢ 2 
a=f / xy? de dy = | y? dy e = sf y? y— y dy — m 
Jo Jy/2 Jo 2 y/2 2 Jo 4 5 


2. Evaluation in Polar Coordinates 
The integration domain is divided by coordinate lines into elementary parts bounded by the arcs of 
two concentric circles and two segments of rays issuing from the pole (Fig. 8.34). The area of the 
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elementary domain in polar coordinates has the form 

dS = p dp dọ. (8.137a) 
(The area of an elementary part determined by the same Ap and Ay is obviously smaller being close 
to the origin, and larger far from it.) With an integrand given in polar coordinates w = f(p,y) a 
summation is to be performed first along each sector, then with respect to all sectors: 


p2 p2(y) 


i Fods = ᾗ Πορ) ράράρ. (8.1376) 
φι ρι() 


Here p — pilo) and p — pa() are the equations of the interior and the exterior boundary curves 


(AmB ) and (AnB ) of the surface S and v and v» are the infimum and supremum of the polar angles 
of the points x the domain. The reverse order of integration is seldom used. 


W Calculation of the special integral A — / p sin? q dS, where S is the surface of the half-circle with 
Js 


p=3cosp (0 < y < 7/2) (see Fig. 8. Pal 


x π/2 p3cosp p 
A-[ Í P sin? y dp dy = κ sin? o d 
0 (3, 0) 3 


, WC ge ok 6 
Figure 8.35 = of sin“ p cos” y dy = 5 
Jo 


3. Evaluation with Arbitrary Curvilinear Coordinates u and v 


3coso 


0 


The coordinates are defined by the relations —Const, 
x= x(u,v), y= y(u, v) (8.138) K zu 
(see 3.6.3.1, p. 261). The domain of integration is partitioned 
by coordinate lines u — const and v — const into infinitesimal 
surface elements (Fig. 8.36) and the integrand is expressed by 
the coordinates u and v. Performing the summation along one 
strip, e.g., along v = const, then over all strips, yields 
ug v2(u) B | 
[tev v) dS = / / f (u, v)| D| dv du. (8.139) Σ 


u vı(u) 











const. 
B 


u= 
Q 
Un 

5 


Here v = vi(u) and v — vo(u) are the equations of the boundary Figure 8.36 


curves AmB and AnB of the surface S. uy and ug denote the infimum and the supremum of the values 
of u of the points belonging to the surface δ. |D| denotes the absolute value of the Jacobian determinant 
(functional determinant) 





Ox Ox 
Βία.) _| Ju ðv 
D= =| ĝu Qv. 140a. 
D(u,v) - |0y ὃν (8.140a) 
Ou Ov 
The area of the elementary domain dS in curvilinear coordinates can be easily expressed: 
— |D| dv du. (8.140b) 


The formula (8.137b) is a special case of (8.139) for the polar coordinates z = pcos p, y — psinq. The 
functional determinant here is D — p. 

The curvilinear coordinates are chosen so that the limits of integration in the formula (8.139) are as 
simple as possible, and also the integrand is not very complicated. 
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Wi Calculation of A = / f(x, y) dS for the case when S is the inte- 
5 


rior of an asteroid (see 2.13.4, p. 104), with x — acos?t, y — asin?t 
(Fig. 8.37). First the curvilinear coordinates u and v as £x = 
ucos?v, y — usin? v are introduced whose coordinate lines u — ci 
represents a family of similar asteroids with equations x = cı cos? v 
and y — c, sin? v. The coordinate lines v — c3 are rays with the equa- 
tions y = kx, where k = tan? cz holds. This gives 

cos? v —3u cos? v sin v 
sin?v  3usin?vcosv 


= 3u sin? v cos? v, 











a pn 
A= 1 / f (x(u, v), y(u, v)) 3usin? v cos? v dv du. Figure 8.37 
o Jo 


Tabelle 8.8 Plane Elements of Area 

















Coordinates Element of Area | 
Cartesian coordinates x, y dS — dy dx | 
Polar coordinates p, o dS = pdp dp | 
Arbitrary curvilinear coordinates u,v | dS = |D|dudv (D Jacobian determinant) | 





8.4.1.3 Applications of the Double Integral 


Some applications of the double integral are collected in Table 8.9, p. 528. The required areas of 
elementary domains in Cartesian and polar coordinates are given in Table 8.8. 


8.4.2 Triple Integrals 

The triple integral is an extension of the notion of the integral into three-dimensional domains. It also 
is called volume integral. 

8.4.2.1 Notion of the Triple Integral 

1. Definition 


One defines the triple integral of a function f(x, y, z) of three variables over a three-dimensional domain 
V analogously to the definition of the double integral. One writes: 


f tenaa = fff fey, 2)dzdyae. (8.141) 





The volume V (Fig. 8.38) is partitioned into elementary volumes AV;. Then the products f (£i, Yi, zi) 
AV; are formed, where the point P;(x;, y;, z;) is inside the elementary volume or it is on the boundary. 
The triple integral is the limit of the sum of these products with all the elementary volumes in which the 
volume V is partitioned, then the diameter of every elementary volume tends to zero, i.e., their number 
tends to oo. The triple integral exists only if the limit is independent of the partition into elementary 
volumes and the choice of the points P;(zx;, y;, z;). Then holds: 


J Kenad = jim, πα (8.142) 
V 


noo i—l 
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Table 8.9 Applications of the Double Integral 





General Formula 


Cartesian Coordinates 


Polar Coordinates 





1. Area of a plane 


figure: 





8- [as 
$ 


= [font 





2. Surface: 





dS 





cos y 


s=f 
5 


ΠΟΙ 


OzV fOzW 
_ 2.1.2 
eT p ctp (3) + (5) dp di 





3. Volume of a cylinder: 





peu 


= f vex 


= [[ :odoae 





4. Moment of inertia of a plane figure, with respect to the x-axis: 





I= [yds 
5 


= / y? dy dx 


= / p? sin? y dp dọ 





5. Moment of inertia of a plane figure, with respect to the pole 0: 





l = [7 dS 
5 


= nic 4 y?) dy dx 


= [f è apap 





6. Mass of a plane 


figure with the density function 


0: 





M το 


= / ody dx 


= |f epdpag 





7. Coordinates of the center of gravity of a homogeneous plane figure: 





I «dS 








n x dy dx 


j dy dx 
i y dy dx 








wg = 5 S 
[yas 
Yo = £ S 


| i dy dx 


/ p cos yp dp dy 


/ p dp do 


I p? sin q dp di 





/ p dp d 





2. Existence The 


The existence theorem for the triple integral is a perfect analogue of the existence theorem for the 


double integral. 


orem 
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8.4.2.2 Evaluation of the Triple Integral 
The evaluation of triple integrals is reduced to repeated evaluation of three ordinary integrals. If the 
triple integral exists, then one can consider any partition of the domain of integration. 








z-Np(x, y) 
ANN 
z-y, y) 





> 
y 




















Figure 8.38 Figure 8.39 


1. Evaluation in Cartesian Coordinates 
The domain of integration can be considered as a volume V here. A decomposition of the domain 
is formed by coordinate surfaces, in this case by planes, into infinitesimal parallelepipeds, i.e., their 
diameter is an infinitesimal quantity (Fig. 8.39). Then one performs the summation of all the products 
f(x,y, z) dV, starting the summation along the vertical columns, i.e., summation with respect to z, then 
in all columns of one slice, i.e., summation with respect to y, and finally in all such slices, i.e., summation 
with respect to x. Every single sum for any column is an approximation sum of an integral, and if the 
diameter of the parallelepipeds tends to zero, then the sums tend to the corresponding integrals, and 
if the integrand is continuous, then this repeated integral is equal to the triple integral. Analytically: 
b ( p2(x) | p2(z,y) 

f(x, y, 2) dz| dy ¢ da 


| fo av 
V a kia) biev) 
b p2(x) ψα(α,υ) 
= if | I f(x,y, 2) dz dy dx. (8.143a) 
a nz) va (ay) 
Here z — v4 (x, y) and z 2 v»(x, y) arethe equations ofthe lower and upper part of the surface bounding 
the domain of integration V (see limiting curve [ in Fig. 8.39); da dy dz is the elementary volume in 
the Cartesian coordinate system. y = yı (x) and y = (x) are the functions describing the lower and 
upper part of the curve C which is the boundary line of the projection of the volume onto the x, y plane, 
and z = a and x = b are the extreme values of the x coordinates of the points of the volume under 
consideration (and also the projection under consideration). There are the following postulates for 
the domain of integration: The functions ọı(x) and y2(x) are defined and continuous in the interval 
a < x < b, and they satisfy the inequality yı(x) < p2(x). The functions v(r, y) and v»(r, y) are 
defined and continuous on the domain a < x < b, yı(x) < y < p(x), and also vi(z,y) X we(v, y) 
holds. In this way, every point (x,y, z) in V satisfies the relations 
a<r<b, pi(x) € y € vo(x), y(x, y) € z < palz, y). (8.143b) 
Just as with double integrals, the order of integration can be changed, then the limiting functions will 
change in the same sense. (Formally: the limits of the outermost integral must be constants, and any 
limit may contain variables only of exterior integrals.) 
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E Calculate the integral J = / (y? ἠ- 27) dV. for a pyramid bounded by the coordinate planes and the 
V 
plane x +y +z = 1: 


1 fl- fl-z-y T 1-α 1-α-ῃ 1 
1 =f / 1 (y? + 27) dz dy dx — T {/ ή (y2 + 2?) dz] dy} Pp 
0 J0 0 0 0 0 30 








Figure 8.40 Figure 8.41 


2. Evaluation in Cylindrical Coordinates 

The domain of integration is decomposed into infinitesimal elementary cells by coordinate surfaces 
p = const, p = const, z = const (Fig. 8.40). The volume of an elementary domain in cylindrical 
coordinates (Table 8.10, p. 532) is 


dV = pdzdpdy. (8.144a) 
After defining the integrand by cylindrical coordinates f(p, y, z) the integral is: 
v2 pz) za(p,e) 
/ f(p.«,z)dV — / | I f (p.«, z) pdz dp dq. (8.144b) 
V 


#1 pr(y) zi (ps) 


W Calculate the integral / — / dV for a solid (Fig. 8.41) bounded by the z, y plane, the x, z plane, 
V 


the cylindrical surface x? + y? = ax and the sphere x? + y? 4-22 5 a2: z; 20, z; 5 Ja?$— 32 —? — 
T π/2 pacosp py/a?—p? 
γαρ; pr =0, p= acosy, 1 =0, = 5-1 = / [ f pdz dpd = 
π/2 acos p V a2—p? 3 
1 μ ή «| Z dy = a (3m — 4). Since f(p,y, z) = 1, the integral is equal to 
0 0 0 


the volume of the solid. 
3. Evaluation in Spherical Coordinates 
The domain of integration is decomposed into infinitesimal elementary cells by coordinate surfaces 
r = const, yp = const, J = const (Fig. 8.42). The volume of an elementary domain in spherical 
coordinates (see Table 8.10, p. 532) is 

dV = r° sin ù dr dò dọ. (8.145a) 
For the integrand f(r, p, V) in spherical coordinates, the integral is: 

φα ϑο(φ) τα(ϑ.ρ) 
I f(r,g,0) dV = J / / f(r, ¢,0) r? sin 0 dr dd dy. (8.145b) 
V 91 01(9) ri (9,9) 
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Figure 8.42 Figure 8.43 


cos Ü 





dV for a cone whose vertex is at the origin, and its symmetry axis 


W Calculate the integral  — [3 
V 


is the z-axis. The angle at the vertex is 2a, the altitude of the cone is h (Fig. 8.43). Consequently 


h 
holds: rı = 0, r2 = sj Üi1—0,05— 0o; yı =0, p2 = 2r. 
COS 


2n h/ cos 0 h/ cos? 
1 =f I / cost? ? sin dr dÜ dp — a {[ cos Ü sin Ü n «| «| αφ 
0 


— 2m h (1— cosa). 








4. Evaluation in Arbitrary Curvilinear Coordinates u, v, w 
The coordinates are defined by the equations 

z = z(u,v, w), y = y(u, v, w), z = z(u,v, w) (8.146) 
(see 3.6.3.1, p. 261). The domain of integration is decomposed into infinitesimal elementary cells by the 


coordinate surfaces u = const, v = const, w = const. The volume of an elementary domain in arbitrary 
coordinates (see Table 8.10, p. 532) is: 


Ox Ox Ox 
Ou Ov Ow 
-|D|dudvdw, with D= ou ou Du (8.1472) 
Oz Oz Oz 
Ou Ov Ow 


i.e., D is the Jacobian determinant. For the integrand f(u, v, w) in curvilinear coordinates u, v, w, the 
integral is: 
uz v2(u) w2 (u,v) 
f fessa =] j / f (u, v, w) | D| dw dv du. (8.147b) 
V 


ùa v, (u) udu) 


Remark: The formulas (8.144b) and (8.145b) are special cases of (8.147b). 

For cylindrical coordinates D — p holds, for spherical coordinates D — r? sin is valid. 

If the integrand is continuous, then one can change the order of integration in any coordinate system. 
A curvilinear coordinate system is chosen such that the determination of the limits of the integral 
(8.1475), and also the calculation of the integral, should be as easy as possible. 


8.4.2.3 Applications of the Triple Integral 


Some applications of the triple integral are collected in Table 8.11, p. 533. The elementary areas 
corresponding to different coordinates are given in Table 8.8, p. 527. The elementary volumes corre- 
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sponding to different coordinates are given in Table 8.10. 


Table 8.10 Elementary volumes 























Coordinates Elementary Volume | 
Cartesian coordinates x, y, z dV = dz dy dz | 
Cylindrical coordinates p, vy , z dV = pdp dọ dz | 
Spherical coordinates r , 9 , dV = r?° sin V dr dô dy | 
Arbitrary curvilinear coordinates u, v, w | dV — |D| dudv dw | (D Jacobian determinant) | 





8.5 SurfaceIntegrals 
There are distinguished surface integrals of the first type, of the second type, and of general type, 
analogously to the three different line integrals (see 8.3, p. 515). 


8.5.1 Surface Integral of the First Type 
The surface integral or integral over a surface in space is the generalization of the double integral, sim- 
ilarly as the line integral of the first type (see 8.3.1, p. 516) is a generalization of the ordinary integral. 


8.5.1.1 Notion of the Surface Integral of the First Type 


1. Definition 
The surface integral of the first type of a function u = f(x,y,z) of three variables defined in a connected 


domain is the integral 
[ tew2) ds, (8.148a) 
5 


over a region S of a surface. The numerical value of the surface integral of the 

first kind is defined in the following way (see Fig. 8.44): 

1. Decomposition of the region S in an arbitrary way into n elementary regions 
Figure 8.44 AS; . 

2. Choosing an arbitrary point P;(x;, Yi, zi) inside or on the boundary of each elementary region AS; . 

3. Multiplication of the value f (x;, yi, z;) of the function at this point by the area AS; of the corre- 

sponding elementary region. 

4. Summation of the products f (xi, Yi, zi) ASi- 

5. Determination of the limit of the sum 


n 

Σ f (a, Vi; zi) AS; (8.148b) 

i=1 
as the diameter of each elementary region tends to zero, so AS; tends to zero, hence, their number n 
tends to oo (see 8.4.1.1, 1., p. 524). 
If this limit exists and is independent of the particular decomposition of the region S into elementary 
regions and also of the choice of the points P;(x;, Yi, zi), then it is called the surface integral of the first 
type of the function u = f(x,y,z) over the region S, and one writes: 


ή f(a,y,z) dS = Jim, M Mui vi zi) ASi. (8.148c) 
S 





noo i—l 
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2. Existence Theorem 


If the function f(x,y,z) is continuous on the domain, and the functions defining the surface have con- 
tinuous derivatives here, the surface integral of the first type exists. 


Table 8.11 Applications of the Triple Integral 





General 
Formula 


Cartesian 
Coordinates 


Cylindrical 
Coordinates 


Spherical 
Coordinates 





1. Volume of a solid 





v-fav- 
V 


1 T / dz dy dx 


[|| az aoae 


III r? sin 0dr dd dp 





2. Axial momen 


t of inertia of a solid with respect to the z-axis 





pa | eV 
V 


J (2° + y’) dz dy dx 


/ p? dz dp dip 


if r^ sin? 9 dr dó ἀφ 





3. Mass of a solid with the density function o 





M = [sav - 
V 


lll odz dy dx 


I, op dz dp dq 


/ / or” sin 0 dr dd dy 





4. Coordinates of the center of a homogeneous solid 





1 rdV 











ME = 
te V 

f» dV 
yc — * = 

few 

V . 
20Ο = ν = 








JI x dz dy dx 


/ p? cos o dp dip dz 


If r? sin? 0 cos y dr dd dy 





/ / / dz dy dx 
J) y dz dy dx 





I If edpaeae 


/ p? sin q dp d dz 


nii r? sin V dr dô dy 


If r? sin? 0 sin o dr d dip 





Ji dz dy dx 
[| z dz dy dx 





nii pdp dp dz 
I pz dp dọ dz 





ii dz dy dx 


n pdp dọ dz 





// r?° sin V dr dô dp 


J 1? sin Ü cos ó dr d dip 
// 1? sin J dr di dp 





8.5.1.2 Evaluation of the Surface Integral of the First Type 


The evaluation of the surface integral of the 


over a planar domair 


1 (see 8.4.1, p. 524). 





first type is reduced to the evaluation of a double integra 


1. Explicit Representation of the Surface 


If the surface S is given by the equation 


z = z(x,y) 





(8.149) 
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in explicit form, then 
f fev 38 — [f flv zo Ya e P P acay, (8.1502) 
5 ο 


is valid, where 5" is the projection of S onto the x,y plane and p and q are the partial derivatives 
, Oz Oz 
P= n° 1~ Oy 
point in S’ in the x, y plane, i.e., the points of the surface are defined uniquely by their coordinates. If 
it does not hold, one decomposes S into several parts each of which satisfies the condition. Then the 
integral on the total surface can be calculated as the algebraic sum of the integrals over these parts of 
S 


The equation (8.150a) can be written in the form 


/ f(v.y, 2) dS — n fle, y, z(e, y) E m ; (8.150b) 
s Sey 


. Here one assumes that to every point of the surface S there corresponds a unique 





cos y 


X-r Y- Z-— 
since the equation of the surface normal of (8.149) has the form = f= D 
p q = 
ble 3.29, p. 264), since for the angle between the direction of the normal and the z-axis, cosy = 
1 


Vit P+? 
as an acute angle, so cosy > 0 always holds. 
2. Parametric Representation of the Surface 











(see Ta- 


holds. In evaluating a surface integral of the first type, this angle 7 is always considered 


If the surface S is given in parametric form by the equations 
r=o(u,v), y=yluv),  2=2(u,0), (8.151a) 
(Fig. 8.45), then 


ΣΙ 

5 

= n f [x(u, v), y(u, v), z(u, v) V EG — F? du dv, (8.151b) 
A 


where the functions E, F, and G are the quantities given in 3.6.3.3, 1., 
p. 263. The elementary region in parametric form is 


v EG — F? dudv = dS, (8.151c) 





Figure 8.45 


and A is the domain of the parameters u and v corresponding to the given surface region. The evaluation 
is performed by a repeated integration with respect to v and u: 
ug v2(u) 

f 9o. v) dS = / / d(u,v)vV EG — F?dvdu, 9 — f[v(u, v), (uv), z(u, v)]. (8.151d) 

S Ul v(u) 
Here uw; and uy are coordinates of the extreme coordinate lines u = const enclosing the region S 
(Fig. 8.45), and v — vi(u) and v = v2(u) are the equations of the curves AmB and AnB of the 
boundary of S. 
Remark: The formula (8.1502) is a special case of (8.151b) for 

u=2, v=y, E=1+p*, F=pq, G=14+¢. (8.152) 
3. Elementary Regions of Curved Surfaces 
The elementary regions of curved surfaces are given in Table 8.12. 
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Table 8.12 Elementary Regions of Curved Surfaces 





| Coordinates Elementary Region 








| Cartesian coordinates x,y,z = z(x,y) | d5 = 





Cylindrical lateral surface, dS — Rdp dz 





R (const. radius), coordinates y, z 





Spherical surface R (const. radius), dS = R? sin V dò dy 





coordinates Ù , 


Arbitrary curvilinear coordinates u,v | d$ — V EG — F? du dv 
(E, F,G see differential of arc, p. 263) 

















8.5.1.3 Applications of the Surface Integral of the First Type 


1. Surface Area of a Curved Surface 
S= |as. (8.153) 
/ 





2. Massofan Inhomogeneous Curved Surface S 
With the coordinate-dependent density o = f(x,y, z) it follows: 


Ms = feas. (8.154) 
S 


8.5.2 Surface Integral ofthe Second Type 


The surface integral of the second type, also called an integral over a projection, is a generalization of 
the notion of double integral similarly to the surface integral of the first type. 


8.5.2.1 Notion of the Surface Integral of the Second Type 


1. Notion of an Oriented Surface 

A surface usually has two sides, and one of them can be chosen arbitrarily as the exterior one. If the 
exterior side is fixed, it is called an oriented surface. Surfaces for which one can not define two sides are 
not discussed here (see [8.7]). 

2. Projection of an Oriented Surface onto a Coordinate Plane 

Projecting a bounded part S of an oriented surface onto a coordinate plane, e.g., onto the x,y plane, 
one can consider this projection Pr; S as positive or negative in the following way (Fig. 8.46): 

a) If the z, y plane is looked at from the positive direction of the z-axis, and one sees the positive side 
of the surface S, where the exterior part is considered to be positive, then the projection Prey S has a 
positive sign, otherwise it has a negative sign (Fig. 8.46 a,b). b) If one part of the surface shows its 
positive side and the other part its negative side, then the projection Przy S is regarded as the algebraic 
sum of the positive and negative projections (Fig. 8.46c). 

The Fig. 8.46d shows the projections Pr;,S and Pr,,5 of asurface 5; one of them is positive the other 
one is negative. 

The projection of a closed oriented surface is equal to zero. 
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exterior side 





L] interior side 








Figure 8.46 


3. Definition of the Surface Integral of the Second Type over a Projection 
onto a Coordinate Plane 


The surface integral of the second type of a function f(x,y,z) of three variables defined in a connected 
domain is the integral 


[fü y, 2) da dy, (8.155) 
s 


over the projection of an oriented surface S onto the z, y plane, where S is in the same domain where the 
function is defined, and if there is a one-to-one correspondence between the points of the surface and 
its projection. The numerical value of the integral is obtained in the same way as the surface integral 
of the first type except that in the third step the function value f (z;, y;, z;) is not multiplied by the 
elementary region AS;, but by its projection Prz,ASj;, oriented according to 8.5.2.1, 2., p. 535 on the 
x,y plane. Then holds: 


noo t= 


/ f(a,y, 2) dx dy = Jim, Σ, Λου νυ z;) Pray AS;. (8.156a) 
5 


Defining analogously the surface integrals of the second type over the projections of the oriented surface 
S onto the y, z plane and onto the z, x plane one gets: 


[i y, z) dy dz = Jim, YO f (£i, Yi, zi) Pryz ASi, (8.156b) 
S 


noo i=1 


ngo i=l 


/ f(a,y, 2) dzdxz = aim, Σ f (zi, yi, zi) Prze ASi. (8.156c) 
S 
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4. Existence Theorem for the Surface Integral of the Second Type 

The surface integral of the second type (8.156a,b,c) exists if the function f(x,y,z) is continuous and 
the equations defining the surface are continuous and have continuous derivatives. 

8.5.2.2 Evaluation of Surface Integrals of the Second Type 

The principal method is to reduce it to the evaluation of double integrals. 


1. Surface Given in Explicit Form 
If the surface S is given by the equation 


z= (z,y) (8.157) 
in explicit form, then the integral (8.156a) is calculated by the formula 

ftem, z) dx dy = / Fle, y, p(x, y)] dS, (8.158a) 

5 PrayS 


where Spy = PreyS. The surface integral of the function f(x,y, z) over the projections of the surface 
S onto the other coordinate planes is calculated similarly: 


f te y, z) dy dz = / fiu, 2). y, z] dSy;, (8.158b) 
5 PryzS 

where one substitutes x = wv(y, z), the equation of the surface S solved for x, and Sy, = Pry,S. 
"ti y, z) dz dz = i Fle, x(z, £), z] dSzx, (8.158c) 
S Pras 


where one substitutes y = x(z,2), the equation of the surface S' solved for y, and Sz, = PrzgS. If 
the orientation of the surface is changed, i.e., if interchanging the exterior and interior sides, then the 
integral over the projection changes its sign. 

2. Surface Given in Parametric Form 

If the surface is given by the equations 











z = z(u,v), y = y(u, v), z = z(u,v) (8.159) 
in parametric form, one calculates the integrals (8.156a,b,c) with help of the following formulas: 
f fov. z) dx dy = [ feu»), (u,v), 2(u, goo du dv, (8.1602) 
] 3 D(u, v) 
S A 
ο ο f tet. u Do. 2) 
f(x,y,z)dydz — [ f[x(u,v), y(u, v), z(u, v)] du dv, (8.160b) 
] y D(u,v) 
5 Δ 
Dí(z,x) 
f fe y, z) dz dz = f fix. v), y(u, v), z(u, v)] du dv. (8.160c) 
j i D(u,v) 


D(x, y Diye) Dlz,x ; d ; s 
Here the expressions PU) (y, z) DU) are the Jacobian determinants of pairs of functions 


D(u,v)' D(u,v)' D(u,v) 


x,y,z with respect to the variables u and v; A is the domain of u and v corresponding to the surface S. 





8.5.8 SurfaceIntegralin General Form 
8.5.3.1 Notion of the Surface Integral in General Form 


If P(x,y, z), Q(x,y, z), R(x, y, z) are three functions of three variables defined in a connected domain 
and S is an oriented surface contained in this domain, the sum of the integrals of the second type taken 
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over the projections on the three coordinate planes is called the surface integral in general form: 











Je dy dz + Q dz dx + R dx dy) = n + 1 Q dz dz + [Rav dy. (8.161) 
5 5 5 5 
The formula reducing the surface integral to a double integral is: 
de sdsd Rir D(yz) , οῬία.α)  ῃῬία,ν) 
ju dy dz + Q dz dx + Rdz dy) / Poe 5) * pics) t+ σος (8.162) 


D(rvy) D(wz | D(z, x) 
D(u,v)' D(u,v)' D(u,v) 
Remark: The surface integral of vector-valued functions is discussed in the chapter about the theory 
of vector fields (see 13.3.2. p. 722). 


where the quantities 





and A have the same meaning, as above. 


8.5.3.2 Properties of the Surface Integrals 
1. Ifthe domain of integration, i.e., the surface S, is decomposed into two parts $4 and 53 (see Fig. 
8.47), then 


[Piyaz 4+ Qdzdx + Rdz dy) = [Piyaz +Qdzdz + Rdz dy) 
S δι 


+ [(Pdydz + Qdzdx + Rdedy). (8.163) 


2. Ifthe orientation of the surface is reversed, i.e., the exterior and interior sides are interchanged, the 
integral changes its sign: 
[Pw dz + Q dz dz + Rdz dy) = — [Pw dz + Q dz dz + Rdz dy), (8.164) 
s+ s- 
where St and S^ denote the same surface with different orientation. 
3. A surface integral depends, in general, on the line bounding the 
surface region S as well as on the surface itself. Thus the integrals taken 
over two different non-closed surface regions $4 and 55 spanned by the 
same closed curve C are, in general, not equal (Fig. 8.47): 
[Piyaz + Q dz dz + Rdz dy) 
δι 
£ [ (Pads & Qaz dz  Raz y). (8.165) 
Sa Figure 8.47 
4. An application of the surface integral is the calculation of the volume V of a solid bounded by a 
closed surface S. The integral can be expressed and calculated in the form 


v=o ; [© dudz +ydzde + zde dy), (8.166) 
5 
ej xdydz or ya ydzdx or vef zdxdy or (8.167a) 
5 5 5 
V= : fe dy dz + y dz dz + z dz dy) (8.167b) 
5 


where S is oriented so that its exterior side is positive. 
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Bb Calculation of the volume V of a sphere with surface S according to z? + y? + 2? = R?: Using 
spherical coordinates x = Rsin ð cosg, y = RsinÜüsing,z — Rcos0 (0€ 0 X 15,0 o x22) and 
the Jacobian determinant as in (8.160a) 


D(x,y) _ |x» xg 
Ρ(ὐ,ᾳ) | Ye Ye 


follows from the third integral in (8.167a) 





= R? cossin v (8.168a) 








2π π π 

4 

V= j / R? cos? ð sin V dò dy = ὑπ { cos’ J sin) di = ark. (8.168b) 
y=0 3=0 0 





9 Differential Equations 


1. A Differential Equation is an equation, in which one or more variables, one or more functions 
of these variables, and also the derivatives of these functions with respect to these variables occur. The 
order of a differential equation is equal to the order of the highest occurring derivative. 

2. Ordinary and Partial Differential Equations differ from each other in the number of their 
independent variables; in the first case there is only one, in the second case there are several. 

o?z Oz Oz 


XYZ ---- 


“Ox Oy’ 





2 
ΒΑ: dy ης —-0. BB: zd?ydr—dy(dr)? —e"(dy?. WI C: = 
da dx Oxdy 


9.1 Ordinary Differential Equations 
1. General Ordinary Differential Equation of Order n 


in implicit form has the equation 
F |z,y(x),y'(a),....y™(a)] =0. (9.1) 


If this equation is solved for y? (x), then it is the explicit form of an ordinary differential equation of 

order n. 

2. Solution or Integral 

of a differential equation is every function satisfying the equation in an interval a € x < b which can be 

also infinite. A solution, which contains n arbitrary constants c4, co, .. . , c5, is called the general solution 

or general integral. If the values of these constants are fixed, a particular integral or a particular solution 

is obtained. The value of these constants can be determined by n further conditions. If the values of 

y and its derivatives up to order n — 1 are prescribed at one of the endpoints of the interval, then the 

problem is called an initial value problem. If there are given values at both endpoints of the interval, 

then the problem is called a boundary value problem. 

W The differential equation —4' sin x + y cos z — 1 has the general solution y — cosz -- csin x. For the 

condition c — 0 one gets the particular solution y — cos z. 

3. Initial Value Problem 

If the n values y(xo), y (xo), . .. , y^^ (zo) are given at zo for the solution y = y(x) of an n-th order 

ordinary differential equation, then an initial value problem is given. The numbers are called the initial 

values or initial conditions. They form a system of n equations for the unknown constants C1, C2,..., Cn 

of the general solution of the n-th order ordinary differential equation. 

W The harmonic motion of a special elastic spring-mass system can be modeled by the initial value 

problem y" 4- y — 0 with y(0) — yo, y(0) = 0. The solution is y = yo cos x. 

4. Boundery Value Problem 

If the solution of an ordinary differential equation and/or its derivatives are given at several points of 

its domain, then these values are called the boundary conditions. A differential equation with boundary 

conditions is called a boundary value problem. 

E The bending line of a bar with fixed endpoints and uniform load is described by the differential 

equation y” = x — x? with the boundary conditions y(0) = 0, y(1) = 0 (0 € x < 1). The solution is 
a3 A . 


9.1.1 First-Order Differential Equations 
9.1.1.1 Existence Theorems, Direction Field 


1. Existence of a Solution 
In accordance with the Cauchy existence theorem the differential equation 

y — f(x.y) (9.2) 
© Springer-Verlag Berlin Heidelberg 2015 540 
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has at least one solution in a neighborhood of zo such that it takes the value yo at x = xo if the function 
f (x, y) is continuous in a neighborhood G of the point (xo, yo). For example, G can be selected as the 
region given by |x — zo| < a and |y — yo| < b with some a and b. 


2. Lipschitz Condition 
The Lipschitz condition with respect to y is satisfied by f(x, y) if 


μα. οκ.” Ὁ (9.3) 
holds for all (x, y1) and (zr, ya) from G, where N is independent of z, y1, and yg. If this condition is 
satisfied, then the differential equation (9.2) has a unique solution through (ro, yo). The Lipschitz 
condition is obviously satisfied if f(x,y) has a bounded partial derivative 0f /Oy in this neighborhood. 
In 9.1.1.4, p. 546 there are examples in which the assumptions of the Cauchy existence theorem are not 
satisfied. 

3. Direction Field 

If the graph of a solution y = y(x) of the differential equation y’ = f(x,y) goes through the point 
P(x,y), then the slope dy/dx of the tangent line of the graph at this point can be determined from 
the differential equation. So, at every point (x,y) the differential equation defines the slope of the 
tangent line of the solution passing through the considered point. The collection of these directions 
(Fig. 9.1) forms the direction field. An element of the direction field is a point together with the direc- 
tion associated to it. Integration of a first-order differential equation geometrically means to connect 
the elements of a direction field into an integral curve, whose tangents have the same slopes at all points 
as the corresponding elements of the direction field. 


Νίκο Yo) 


0 x 





Figure 9.1 Figure 9.2 


4. Vertical Directions 
If a vertical direction can be found in a direction field, e.a., if the function f(x, y) has a pole, then one 
can change the role of the independent and dependent variables and consider the differential equation 
μα (9.4) 
dy f(x,y) 
as an equivalent equation to (9.2). In the region where the conditions of the existence theorems are 
fulfilled for the differential equations (9.2) or (9.4), there exists a unique integral curve (Fig. 9.2) 
through every point P(xo, yo). 
5. General Solution 
The set of all integral curves of (9.2) can be characterized by one parameter and it can be given by the 
equation 
F(r,y,C) 20 (9.5a) 
of the corresponding one-parameter family of curves. The parameter C, an arbitrary constant, can be 
chosen freely and it is a necessary part of the general solution of every first-order differential equation. 





542 9. Differential Equations 





A particular solution y = (x), which satisfies the condition yo = (xo), can be obtained from the 
general solution (9.5a) if C is expressed from the equation 


F' (xo, Yo, C) =0. (9.5b) 
9.1.1.2 Important Solution Methods 


1. Separation of Variables 
If a differential equation can be transformed into the form 


M(ax)N(y)da + P(x)Q(y)dy = 0, (9.6a) 
then it can be rewritten as 
R(«)dax + S(y)dy = 0, (9.6b) 


where the variables x and y are separated into two terms. To get this form, equation (9.6a) is divided 
by P(x) N(y). The νο solution of (9.6a) is 


^ d r+ [29 πον -0 (9.7) 


If for some ne xz = Tor y = y, the functions P(r) or N(y) or both are equal to zero, then the 
constant functions x = % or/and y — y are also solutions of the differential equation. They are called 
singular solutions. 





d d: 
E dy ydx — 0; | T = =C; Injy|+In|z| = C =In|c|; yz = c. If one allows also c = 0 


in this final equation, then one has the singular solutions y = 0 and x = 0. 


2. Homogeneous Equations 
If M(z, y) and N (x, y) are homogeneous functions of the same order (see 2.18.2.6, 1., p. 122), then in 
the equation 


M(a,y)dx + N(x, y)dy = 0 (9.8) 
the variables can be separated by substitution of u — y/x. 
E r(x- e +4? = 0 with y = u(x)x, gives (1 — u)u’ + u/x = 0, then by separation of the variables 
ο... =) ty --/; — da. After integration: In |r|-In |u-u| 2 C = Incl, ux = ce", y = ce"/?. 


As can be seen in the preceding paragraph, Separation of Variables, the line x = 0 is also an integral 
curve. 


3. Exact Differential Equations 


An exact differential equation is an equation of the form 


M(a,y)dz + N(a,y)dy=0 or N(x, y)y' + M(x,y) = 0, (9.9a) 
if there exists a function &(x, y) of two variables such that 
M (v, y)dz 4- N(x,y)dy = d(x,y), (9.9b) 


i.c., if the left side of (9.9a) is the total differential of a function d$ (z, y) (see 6.2.2.1, p. 447). If functions 
M (x, y) and N (x, y) and their first-order partial derivatives are continuous on a connected domain G, 
then the equality 


OM | ON 

Oy Ox 
is a necessary and sufficient condition for equation (9.9a) to be exact. In this case the general solution 
of (9.9a) is the function 

(x,y) =C (C = const), (9.94) 


(9.96) 
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which can be calculated according to (8.3.4), 8.3.4.4, p. 522 as the integral 
z y 
(x,y) = [MEn * f NGv.m dn, (9.9e) 
πο yo 


where zo and yo can be chosen arbitrarily from G. 
W Examples will be given later. 
4. Integrating Factor 
A function u(x, y) is called an integrating factor or a multiplier if the equation 
Mdx -- Ndy —0 9.104) 
multiplied by u(x, y) becomes an exact differential equation. The integrating factor satisfies the differ- 
ential equation 
Oln Oln OM ON 
BH - ka σος (9.105) 
Ox Oy ὃν Ox 
Every particular solution p of this equation is an integrating factor. To give a general solution of this 
partial differential equation is much more complicated than to solve the original equation, so usually 
one is looking for the solution u(x, y) in a special form, e.g., u(z), p(y), p(zy) or u(x? + y?). 


N 








E To solve the differential equation (x? + y) dx — x dy = 0, the equation for the integrating factor is 








al Ol dl 
-ᾱ-----. (a? + E — 2. An integrating factor which is independent of y must satisfy x - Kz 
Ox Oy ` δα 

1 1 1 
—2, so jj — —; . Multiplication of the given differential equation by ji yields (1 + +) dx — —dy = 0. 
x x x 


The general solution according to (9.9e) with the selection of a9 = 1, yo = 0 is then: 
c y 
dle) = { (+3) ag- f dj—C or a i 
1 € 0 T 
5. First-Order Linear Differential Equations 
A first-order linear differential equation has the form 
y + P(x)y = Q(z), (9.11a) 
where the unknown function and its derivative occur only in first degree, and P(x) and Q(x) are given 
functions. If P(x) and Q(x) are continuous functions on a finite, closed interval, then the differential 


equation satisfies the conditions of the Picard-Lindelöf theorem (see 12.2.2.4,4., p. 668) in this region. 
An integrating factor is here 


L = exp ([ Par) ; (9.11b) 


the general solution is 


y exy(- f Paz) |[ Qexp([ Par) ar +c]. (9.119) 


Replacing the indefinite integrals by definite ones with lower bound xo and upper bound x in this 
formula, then for the solution y(xo) — C (see 8.2.1.2, 1., p. 495). If yi is any particular solution of the 
differential equation, then the general solution of the differential equation is given by the formula 


y = yı + C exp (- [ Pa). (9.11d) 


If yi(x) and ys(x) are two linearly independent particular solutions (see 9.1.2.3, 2., p. 553), then one 
can get the general solution without any integration as 


y= t Ca - 3) (9.119) 
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W To solve the differential equation y’ — ytanaz — cosz with the initial condition ro = 0, yo — 0. 


x 
Calculating exp (- [ tan x dz) = cos x one gets the solution according to (9.11c): 
Jo 








v. ena 
[ cos“ g dgr = 
0 


1 [^ £ Cos x + z] sinz q 
2 
6. Bernoulli Differential Equations 
The Bernoulli differential equation is an equation of the form 

y + P(x)y = Q(a)y" (n#0,n #1), (9.12) 
which can be reduced to a linear differential equation if it is divided by y" and the new variable z = 
y 7" is introduced. 


y= 


t ] 
COS T J cosg 2 2 cos £ 


4ι 
W Solution of the differential equation y' — s v4/y . Since n = 1/2, dividing by ,/y and introducing 
x 


z 22 α 
η. By using the formulas for the solution of a 
roc 


1 vil 1 
linear differential equation there is exp(/ P dz) = — and z = x? [/ in dx + σ =7° E ln |æ] + c| 5 
x J 22 


the new variable z = ,/y leads to the equation 


1 2 
So, finally, y = a* (5 In |z| 4- c) : 
7. Riccati Differential Equations 
The Riccati differential equation 
y' = P(a)y* + Q(a)y + R(x), (9.13a) 
usually cannot be solved by elementary integration, i.e., not by using a final number of successive ele- 


mentary integrations. However it is possible to transform it by suitable substitutions into differential 
equations for which solutions often can be found. 


Method 1: By the substitution 


u(x) 





= x 131 
Y= pat B(x) (9.13b) 
the Riccati differential equation can be transformed into the normal form 
da 
d =u? + Rolz)  (9.13c) with R(x) = P?6?+ QPB + PR- PP. (9.134) 
X 


Therefore (a) is determined so that terms with the factor u(x) disappear. 
If a particular solution uj; (x) of (9.13c) is known, which can be found, e.g., by a suitable approach, then 
by the help of the substitution 
1 
-——— uut x 

u πο Tux) (9.13e) 
(9.13c) is to be transformed into the linear differential equation for z(x): 

2’ }- 2υι(α)2-- 1 --0. (9.139 
From the solution of (9.13f) the solution of (9.13a) is obtained by using (9.13e) and (9.13b). 
Method 2: By the substitution 

v 
Ῥ(α)υ(α) 
(9.154) is transformed into a linear differential equation of second order (see 9.1.2.6,1., p. 560): 


Py" — (P' - PQw' 4- P? Rv — 0. (9.13h) 


— (9.13g) 
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1 4 1 4 
E To solve the differential equation y' + y? + —-y — σπα =0,ie. for P = —-1,Q = ——, R = ο. 
x x ie T 


1 1 
Method 1: One gets B(x) — = and with the help of y — —u(x) — πε e gets the normal form 
x x 


15 
u = u? — m . Particular solutions of the normal form can be got, e.g., with the approach u — ας 
Av x 
3 5 spies’ 1 3 ] : MTM 
u(x) = —, ue(x) = ——. After substituting u = —~ + — the differential equation z’+ —z+1=0 
2r 2r z(r) 2c 5 
: τ r K AK —a* ο : ; : 
follows with the solution z(r) — =i = ee (K const). The inverse transformation gives 
2x4 +2C 
=——-. (C = 4k). 
Y= 5 σα ( ) 


Method 2: According to (9.13h) the Euler differential equation z?v"--xv' —4v — 0 is obtained with the 
1 

general solution v(z) — Cia? 4- C = (see concerning the Euler differential equation, p. 557). One of 
T 


2x* 4-2 
the constants C1 and C$ can be chosen freely, e.g. C9 — —1 then from (9.13h) follows y — E ; 
T — OT 


9.1.1.3 Implicit Differential Equations 


1. Solution in Parametric Form 
Given a differential equation in implicit form 

F(x,y,y') — 0. (9.14) 
"There are n integral curves passing through a point. P(xo, yo) if the following conditions hold: 
a) The equation F(xo, yo, p) = 0 (p = dy/dx) has n real roots pi, .. ., p» at the point. P(zo, yo). 
b) The function F(x, y, p) and its first partial derivatives are continuous at £ = £o, y = Yo, Pp = Pi; 
furthermore OF /Op # 0. 
If the original equation can be solved with respect to y', then it yields n equations of the explicit forms 
discussed above. Solving these equations one gets n families of integral curves. If the equation can be 
written in the form x = (y, y’) or y = y(x, y’), then putting y’ = p and considering p as an auxiliary 
variable, after differentiation with respect to y or x one obtains an equation for dp/dy or dp/dx which 
is solved with respect to the derivative. A solution of this equation together with the original equation 
(9.14) determines a desired solution in parametric form. 
E To get the solution of the differential equation x = yy’ + y?, one substitutes y’ = p and gets x — 


dx 1 1 d 
py + p°. Differentiation with respect to y and substituting = = — results in = = p + (y + 2p) or 
y p p y 
d " 2p? . . . . C + arcsinj 
D -1 E =F 7 . Solving this equation for y one obtains y = —p + a 


Substitution into the initial equation gives the solution for x in parametric form. 








(C const). 


2. Lagrange Differential Equation 
The Lagrange differential equation is the equation 

aly')x + (yy + ely’) = 0. (9.154) 
The solution can be determined by the method given above. If for p — po holds 


a(p) + b(p)p = 0, (9.15b) then a(po)x + b(po)y + c(po) = 0 (9.15c) 


is a singular solution of (9.15a). 
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3. Clairaut Differential Equation 


The Clairaut differential equation is the special case of the Lagrange differential equation if 


a(p) + b(p)p = 0, (9.16a) 
and so it can be transformed into the form 

y — yz f(y’). (9.16b) 
The general solution is 

y = Cz 4 f(C). (9.16c) 


Besides the general solution, the Clairaut differential equation also has a singular solution, which can 
be obtained by eliminating the constant C from the equations 


y = Cz + f(C) (9.164) and 0 -z- f'(C), (9.169) 


The second equation can be obtained by differentiating the first one with respect to C. Geometrically, 
the singular solution is the envelope (see 3.6.1.7, p. 255) of the solution family of lines (Fig. 9.3). 





W Solution of the differential equation y — cy'4-y?. The general solution is y = Ca+C?. The singular 
solution one gets with the help of the equation x + 2C = 0 to eliminate C, and hence x? + 4y = 0. 
Fig. 9.3 shows this case. 


y 





Figure 9.3 Figure 9.4 


9.1.1.4 Singular Integrals and Singular Points 


1. Singular element 
An element (zo, yo, yo) is called a singular element of the differential equation, if in addition to the 
differential equation 


F(x,y,y') =0 (9.17a) 
it also satisfies the equation 

OF 

y = (9.17b) 


2. Singular Integral 
An integral curve from singular elements is called a singular integral curve; the equation 

v(r,y) —0 (9.17c) 
of a singular integral curve is called a singular integral. The envelopes of the integral curves are singular 
integral curves (Fig. 9.3); they consist of the singular elements. 
The uniqueness of the solution (see 9.1.1.1, 1., p. 540) usually fails at the points of a singular integral 
curve. 
3. Determination of Singular Integrals 
Usually one cannot obtain singular integrals for any values of the arbitrary constants of the general 
solution. To determine the singular solution of a differential equation (9.17a) with p — y' one has to 
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introduce the equation 
OF — 
Op — 
and to eliminate p. If the obtained relation is a solution of the given differential equation, then it is 
a singular solution. The equation of this solution should be transformed into a form which does not 
contain multiple-valued functions, in particular no radicals where the complex values should also be 
considered. 
Radicals are expressions obtained by nesting algebraic equations (see 2.2.1, p. 62). If the equation of the 
family of integral curves is known, i.e., the general solution of the given differential equation is known, 
then one can determine the envelope of the family of curves, the singular integral, with the methods of 
differential geometry (see 3.6.1.7, p. 255). 


0 (9.17d) 


: : ' : 4 8 — : 
H Solution of the differential equation « — y — 2y? -- —y? = 0. Substituting y’ = p, the calculation 


9 27 
- . ' . 8 8 E ; ; 
of the additional equation with (9.17d) yields ορ + 9 p? — 0. Elimination of p results in equation a) 
4 
r—y-—O0andb)r—y-— op where a) is not a solution, b) is a solution, a special case of the general 


solution (y — C)? — (x — C)?. The integral curves of a) and b) are shown in Fig. 9.4. 
4. Singular Points of a Differential Equation 
Singular points of a differential equation are the points where the right side of the differential equation 
y'= f(x,y) (9.18a 
is not defined. This is the case, e.g., in the differential equations of the following forms: 
1. Differential Equation with a Fraction of Linear Functions 
dy _ ax + by 
Fey 
has an isolated singular point at (0, 0), since the assumptions of the existence theorem are fulfilled almos 
at every point arbitrarily close to (0, 0) but not at this point itself. The conditions are not fulfilled a 
the points where cx + ey = 0. One can force the fulfillment of the conditions at these points exchanging 
the role of the variables and considering the equation 


(ae — bc £ 0) (9.18b 








dr cx 


απ cu +ey 

dy ax + by” 
The behavior of the integral curve in the neighborhood of a singular point depends on the roots of the 
characteristic equation 

A? — (b 4- c)À -- bc — ae — 0. (9.18d 
The following cases can be distinguished: 


(9.18c 





Case 1: If the roots are real and they have the same sign, then the singular point is a branch point. The 

integral curves in a neighborhood of the singular point pass through it and if the roots of the character- 

istic equation do not coincide, they have a common tangent except for one. If the roots coincide, then 

either all integral curves have the same tangent, or there is a unique integral curve passing through the 

singular point in each direction. 

: : . di 2 ος SET n 

W A: For the differential equation E = «3 the characteristic equation is A? — 3A --2 — 0, 41 — 2, 
dr £ 

As — 1. The integral curves have the equation y — C 2? (Fig. 9.5). The general solution also contains 

the line x = 0 considering the form x? = C1 y. 


d η» 
W B: Thecharacteristic equation for - LEUTE is A? —2\+1 = 0, Ay = Ag = 1. The integral curves 
da: ; 





X 
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are y = x ln |z| + Cx (Fig. 9.6). The singular point is a so-called node. 








d 
W C: The characteristic equation for = “is 2-2) + 1 = 0, Ay = Ag = 1. The integral curves 
dr rz 


x ; 
are y — C z (Fig. 9.7). The singular point is a so-called ray point. 


y y y 
x x x 


Figure 9.5 Figure 9.6 Figure 9.7 


Case 2: If the roots are real and they have different signs, the singular point is a saddle point, and 
two of the integral curves pass through it. 





d 
W D: The characteristic equation for ¥_ Ιλ. 1-- 0, Ay = 1, A; — —1. The integral curves 
m X 


dx 

are z y — C (Fig. 9.8). For C — 0 the particular solutions x = 0, y = 0 hold. 

Case 3: If the roots are conjugate complex numbers with a non-zero real part (Re(A) 4 0), then the 
singular point is a spiral point which is also called a focal point, and the integral curves wind about this 
singular point. 

dy x+y 
dy  r—y 
integral curves in polar coordinates are r — C e? (Fig. 9.9). 


INS, fs, 
WF | 


Figure 9.8 Figure 9.9 Figure 9.10 
Case 4: Ifthe roots are pure imaginary numbers, then the singular point is a central point, or center, 


which is surrounded by the closed integral curves. 


di £ 
E F: The characteristic equation for dt = is A? +1=0, \y =i, Ap = —i. The integral curves are 
z y 





W E: The characteristic equation for is à? — 2A + 2 = 0, à` = 1 + i, àz = 1 — i. The 


xY 


AN 
C7 





x? +y? =C (Fig. 9.10). 

2. Differential Equation with the Ratio of Two Arbitrary Functions 
dy _ P(x,y) 
de Q(a,y) 





(9.19a) 
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has the singular points for the values of the variables where 

P(x,y) = Q(a, y) = 0. (9.19b) 
If P and Q are continuous functions and they have continuous partial derivatives, (9.19a) can be written 
in the form 

dy _ a(x — xo) + b(y — yo) - Pix y) 

= ] (9.19c) 

dv — c(r — vo) t e(y — yo) + Qi(z, y) 
Here zo and yo are the coordinates of the singular point and Pj (x, y) and Q1(z, y) are infinitesimals of a 
higher order than the distance of the point (x, y) from the singular point (£o, yo). With these assump- 
tions the type of a singular point of the given differential equation is the same as that of the approximate 
equation obtained by omitting the terms P, and Q4, with the following exceptions: 





a) If the singular point of the approximate equation is a center, the singular point of the original equa- 
tion is either a center or a focal point. 











a b 
b) If ae — bc — 0, ie., —ora = c = l or a = b = 0, then the type of singular point should be 
c ο 
determined by examining the terms of higher order. 


9.1.1.5 Approximation Methods for Solution of First-Order Differential 
Equations 


1. Successive Approximation Method of Picard 
The integration of the differential equation 
y = f(x,y) (9.203) 


with the initial condition y = yo for x = xo results in the fixed-point problem 
T 
y — yo [ FG, y) dz. (9.20b) 
do 


Substituting another function yi (x) instead of y into the right-hand side of (9.20b), then the result will 
be a new function y»(x), which is different from yi(x), if yi (zx) is not already a solution of (9.20a). Sub- 
stituting y2(a) instead of y into the right-hand side of (9.20b) gives a function y3(a). If the conditions 
of the existence theorem are fulfilled (see 9.1.1.1, 1., p. 540), the sequence of functions y1, ys, ya. ..- 
converges to the desired solution in a certain interval containing the point xo. 

This Picard method of successive approximation is an iteration method (see 19.1.1, p. 949). 

E Solve the differential equation y’ = eë — y? with initial values xo = 0, yo = 0. Rewriting the equa- 
tion in integral form and using the successive approximation method with an initial approximation 


T T 1 
yo(x) = 0 gives: yy =f edr =e- 1, y =f [e (e* 1] de= Be" xe x : etc. 











2. Solution by Series Expansion 
The Taylor series expansion of the solution of a differential equation (see 7.3.3.3, 1., p. 471) can be 
given in the form 
dm y 

Y = yo + (x — x0) yo’ 4 u E y ped 
if the values yo’, yo’, ..., Yo™®, ... of all derivatives of the solution function are known at the initial 
value xo of the independent variable. The values of the derivatives can be determined by successively 
differentiating the original equation and substituting the initial conditions. If the differential equation 
can be differentiated infinitely many times, the obtained series will be convergent in a certain neigh- 
borhood of the initial value of the independent variable. This method can be used also for n-th order 
differential equations. 
Remark: The above result is the Taylor series of the function, which may not represent the function 


(w= 20)" α 
a yof d esse (9.21) 
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itself (sce 7.3.3.3, 1., p. 471). 
It is often useful to substitute the solution by an infinite series with unknown coefficients, and to de- 
termine them by comparing coefficients. 
HA: To solve the differential equation y! = e* — y?, 7 = 0, yg — 0 one can consider the series 
Y = x + agx? + a3x3 +--+ a,x" +--+. Substituting this into the equation considering the formula 
(7.88), p. 470 for the square of the series gives 

2 3 








T ΐ 
ο. [ai22? F- 2a1a52? 4 (a2? | 2a4a3)a* τν 5 τε α -Ε 3 T 6 qn 
1 1 

Comparing coefficients gives: a, = 1, 2a, = 1, 3a3 + a1? = -,4a4 4 2a1a5 — e etc. Solving 
these equations successively and substituting the coefficient values into the series representation yields 

A a g 5 4 2 
y—-rztbc-———— Ur fees 
z 2 6 24 


W B: The same differential equation with the same initial conditions can also be solved in the fol- 
lowing way: Substituting x = 0 into the equation, gives yo’ = 1 and successive differentiation yields 


m a" 


y" 2 e* — 2yy', y" — 1, y" — e? — 2y? — 2yy", yy" = —1, y® = e — 6y'y" — 2yy", y (9 — —5, ete. 
-2 


x Bat 
a at 

3. Graphical Solution of Differential Equations 

The graphical integration of a differential equation is a method, which is 
based on the direction field (see 9.1.1.1, 3., p. 541). The integral curve 
in Fig. 9.11 is represented by a broken line which starts at the given 
initial point and is composed of short line segments. The directions of 
the line segments are always the same as the direction of the direction 
field at the starting point of the line segment. This is also the endpoint 
Figure 9.11 of the previous line segment. 


From the Taylor theorem (see 7.3.3.3, 1., p. 471) follows the solution y — x 4 





4. Numerical Solution of Differential Equations 


The numerical solutions of differential equations will be discussed in detail in 19.4, p. 969. Numerical 
methods are used to determine a solution of a differential equation, if the equation y’ = f(x,y) does 
not belong to the special cases discussed above whose analytic solutions are known, or if the function 
f (x, y) is too complicated. This can happen if f(x,y) is non-linear in y. 


9.1.2 Differential Equations of Higher Order and Systems of 
Differential Equations 


9.1.2.1 Basic Results 


1. Existence of a Solution 


1. Reduction to a System of Differential Equations Every explicit n-th order differential equa- 
tion 


ye! f egy ο) (9.222) 
by introducing the new variables 

ο μα. (9.22b) 
can be reduced to a system of n first-order differential equations 


dy dyi dia 
== — = aae, = f(z, snag ntl 22 
ας δν uu 4 eu F(E: Y Yir Yni) (9.229) 
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2. Existence of a System of Solutions The system of n differential equations 


dy; 


πω ο... (9.234) 
which is more general than system (9.22c), has a unique system of solutions 

yi =yilz) (6—1,2,...,n), (9.23b) 
which is defined in an interval rg — h € zr € zo + h and for x = xo takes the previously given initial 
values y;(zo) ^ yj? (i — 1,2,..., n), if the functions f;(v, yi, yo; . .. , y) are continuous with respect to 


all variables and satisfy the following Lipschitz condition. 
3. Lipschitz condition For the values x, y; and y; + Ay;, which are in a certain neighborhood of 
the given initial values, the functions f; satisfy the following inequalities: 
fil, yr + Ayn, Y2 + Aya,- - -Yn + Ayn) — Filz, Y1, ass) 
< K (lAyi| + [Ayl +--+ |Agn|) (9.24a) 
with a common constant K (see also 9.1.1.1, 2., p. 541). 


This fact implies that if the function f (v, y, y^, . .. , y^?) is continuous and satisfies the Lipschitz con- 
dition (9.24a), then the equation 
y® = f (zu y^... ος ο) (9.24b) 


has a unique solution with the initial values y(zo) — yo, y'(xo) — yo, ..., y" P (vo) — yo), and it 
is (n — 1) times continuously differentiable. 


2. General Solution 


1. The general solution of the differential equation (9.24b) contains n independent arbitrary con- 
stants: 


y — y(z, C4, Co, ..., Cs). (9.25a) 


2. In the geometrical interpretation the equation (9.25a) defines a family of curves depending on n 
parameters. Every single one of these integral curves, i.e., the graph of the corresponding particular 


solution, can be obtained by a suitable choice of the constants C1, C5,..., C4. If the solution has to 
satisfy the above initial conditions, then the values C4, C5, ..., Cn are determined from the following 
equations: 


y(xo, Cy Cn) = Y, 


d 
——JwWz,Cu..., c.) — yo, (9.25b) 
lz des 
πι 
πππυ.Οι...., ο) = yo". 
lz 1 ns 


If these equations are inconsistent for any initial values in a certain domain, then the solution is not 
general in this domain, i.e., the arbitrary constants cannot be chosen independently. 

3. The general solution of system (9.23a) also contains n arbitrary constants. This general solution 
can be represented in two different ways: Either it is given in a form which is solved for the unknown 
functions 


yi = Fix C,...,C4), ys 9 ο ο ο ο. πο ο (9.264) 
or in the form which is solved for the constants 
πο ο = Ci, ο ο ο πο — Cs. (9.269) 


In the case of (9.26b) each relation 
φι(υ, γι»... Yn) = Ci (9.26c) 
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is a first integral of the system (9.23a). The first integral can be defined independently of the general 
solution as a relation (9.26c). That is, (9.26c) will be an identity replacing yi, ya; . - - , Yn by any partic- 
ular solution of the given system and replacing the constant by the arbitrary constant C; determined 
by this particular solution. 

If any first integral is known in the form (9.26c), then the function y;(x, y1,..., Yn) satisfies the partial 
different equation 


Ovi Oi 
δε) SO 26d 
E n + fr(@, yi y δν. 0 (9.264) 


Conversely, each solution y; (2, 41,..-,Yn) of the partial differential equation (9.26d) defines a first inte- 
gral of the system (9.23a) in the form (9.26c). The general solution of the system (9.23a) can be repre- 
sented as a system of n first integrals of system (9.23a), if the corresponding functions y;(2, y1,-.., Yn) 
(i = 1,2,...,n) are linearly independent (see 9.1.2.3, 2., p. 553). 


9.1.2.2 Lowering the Order 


One of the most important solution methods for n-th order differential equations 


ο 0 (9.27) 
is the substitution of variables in order to obtain a simpler differential equation, especially one of lower 
order. Different cases can be distinguished. 

1. f— f(y, y^... y?), ie., £ does not appear explicitly: 


Oi 








t iG yu s.a) 


F yf m)oo. (9.284) 
By substitution 

d d? d; 

=p Z =p... (9.28b) 


dr P da? Pay 
the order of the differential equation can be reduced from n to (n — 1). 
H Reducing the order of the differential equation yy" — y? — 0 to one, with the substitution y' — 
p.pdp/dy — y" it becomes a first-order differential equation y p dp/dy — p? — 0, and y dp/dy — p = 0 
results in p = Cy = dy/dx, y = Cye@*. Canceling p does not result in a loss of a solution, since p = 0 
gives the solution y — C4, which is included in the general solution with C — 0. 
2. f — f(m,y',..., y), i.e, y does not appear explicitly: 


f (s, yes yo) =0. (9.29a) 
The order of the differential equation can be reduced from n to (n — 1) by the substitution 

y =p. (9.29b) 
If the first k derivatives are missing in the initial equation, then a suitable substitution is 

y*D — y, (9.29c) 


Wl The order of the differential equation y" — zy" 4- (y)? — 0 will be reduced by the substitution 
dp [dpV 
y" — p, so one gets a Clairaut differential equation p — στε + (2) — 0 whose general solution 
d z 
Ciz? ο. 
6 2 








isp — Ciz 4 Cj). Therefore, y = + Cox + C3. From the singular solution of the 


3/2 


: ' . : 2/3 . s T : 
Clairaut differential equation p — MS one gets the singular solution of the original equation: 


8v3 
= au^ + Cyr + Co. 
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oy f ία, TR secs y) is a homogeneous function (see 2.18.2.4, 4., p. 122) 
in y, y y"... y 9: 


F (zuy y®) = 0. (9.30a) 
One can reduce the order by the substitution 
ai 
z=% ie, yet, (9.30b) 
y 
: ' > ; dt. um ο j . dz 
W Transforming the differential equation yy" — y" — 0 by the substitution z — y'/y, results in a 
yy" _ y? 2 
Tq. — = 0 so the order is reduced by one. One gets z = C1, therefore, ln|y| = Cix + C2, or 
y 


y — Ce with In |C] = C2. 
4. f=f(a,y,y’,---,y™) isa function of only z: 








y™ = f(x). (9.31a) 
One gets the general solution by n repeated integrations. It has the form 

y = Ci + Cox + Ca? 4 Cua" 4 w(x) (9.31b) 
with 


ψ(α) =J |t% (dz — μυ. dt. (9.31c) 


It has to be mentioned here that ro is not an additional arbitrary constant, since the change in zo results 
in the change of Ci, because of the relation 
1 


Cee a 0. (9.914) 


9.1.2.3 Linear n-th Order Differential Equations 


1. Classification 
A differential equation of the form 


y 4 a y079 4 asy 079 4... pa, way — F (9.32) 
is called an n-th order linear differential equation. Here F and the coefficients a; are functions of x, 
which are supposed to be continuous in a certain interval. If a1,a2,...,a4 are constants, it is called 
a differential equation with constant coefficients. If F = 0, then the linear differential equation is 
homogeneous, and if F Æ 0, then it is inhomogeneous. 
2. Fundamental System of Solutions 
A system of n solutions y1, ys, ... , ya of a homogeneous linear differential equation is called a funda- 
mental system if these functions are linearly independent on the considered interval, i.e., their linear 
combination Cy y; + C2 y2 +++: + Ch Yn is not identically zero for any system of values C1, C2,..., Ch, 
except for the values C] = Cp =--- = C, = 0. The solutions y1, Y2, - - - , Yn Of a linear homogeneous dif- 
ferential equation form a fundamental system on the considered interval if and only if their Wronskian 
determinant 


n ys a Yn, 
w=” ya Ut Un (9.33) 
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is non-zero. For every solution system of a homogeneous linear differential equation the formula of 
Liouville is valid: 


T 
W (x) — W (xg) exp (- fen η : (9.34) 
πο 

It follows from (9.34) that if the Wronskian determinant is zero somewhere in the solution interval, 
then it can be only identically zero. This means: The n solutions yi, y», ..., y. of the homogeneous 
linear differential equation are linearly dependent if even for a single point xo of the considered interval 


W (xo) — 0. If the solutions y1, yo, .. . , y» form a fundamental system of the differential equation, then 
the general solution of the linear homogeneous differential equation (9.32) is given as 
y = Ciyi t C» ya + Cn Yn- (9.35) 


A linear n-th order homogeneous differential equation has exactly n linearly independent solutions on 
an interval, where the coefficient functions a;(x) are continuous. 
3. Lowering the Order 
If a particular solution y; of a homogeneous differential equation is known, by assuming 

y = yr(x)u(x) (9.36) 
one can determine further solutions from a homogeneous linear differential equation of order n — 1 for 
u' (ac). 
4. Superposition Principle 
If yı and yz are two solutions of the differential equation (9.32) for different right-hand sides Fy and 
Fy, then their sum y — yı + y2 İs a solution of the same differential equation with the right-hand side 
F = Fi + Fy. From this observation it follows that to get the general solution of an inhomogeneous 
differential equation it is sufficient to add any particular solution of the inhomogeneous differential 
equation to the general solution of the corresponding homogeneous differential equation. 
5. Decomposition Theorem 
If an inhomogeneous differential equation (9.32) has real coefficients and its right-hand side is complex 
in the form F’ = F, + iF) with some real functions Fi and F5, then the solution y = y; + iye is also 
complex, where y; and ys are the two solutions of the two inhomogeneous differential equations (9.32) 
with the corresponding right-hand sides Fy and F5. 
6. Solution of Inhomogeneous Differential Equations (9.32) by Means 

of Quadratures 


If the fundamental system of the corresponding homogeneous differential equation is already known, 
there are the following two solution methods to continue the calculations: 


1. Method of Variation of Constants Looking for the solution in the form 
y = Cry + Caya +++ + Cni (9.37a) 
where Ci, C5, ..., C4, here treated as functions of x. There are infinitely many such functions, but 
requiring that they satisfy the equations 
Cry + O'y +- Cs κ — 0, 


ο ο ο (9.37b) 
ον .. Co! yo?) che Segre C, y, 79 —0 
and substituting y into (9.32) with these equalities follows 
Cy 7D 4 Oy D e Oy m = F. (9.37c) 








Because the Wronskian determinant of the coefficients in the linear system of equations (9.37b) and 
(9.37c) is different from zero, one gets a unique solution for the unknown functions Cy’, Cs, ..., Cs, 
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and their integrals give the functions C1, C5, . .. , Cs. 
a T "à 1 
WB | - y ————y-z-1. (9.37d) 
1--ᾳ 1-32 

In the interval « > 1 or x < 1 all assumptions on the coefficients are fulfilled. First the homogeneous 
x 1 

equation 7” + D uir e 0 is solved. A particular solution is e; = e”. Then one looks for 
-α -r 


a second one in the form y2 = e”u(x), and with the notation u'(x) = v(x) one gets the first-order 
1 

differential equation v' 4- (1 + =) v = 0. A solution of this equation is v(x) = (1— r)e *, and 
-r 

therefore, u(x) = Jo dx = fa —a)e"dx = xe”. With this result p = x is obtained for 


the second element of the fundamental system. The general solution of the homogeneous equation is 








g(x) = Cie” +Cax. The variation of constants with u(x) and uz(x) instead of C1 (x) and C2(x) is now: 
y(x) — uy(x)e* 4- us(x)x, 
y(x) = u(x) + u(x) + ur (£) -- us'(x)v, | uj'(x)e* -- us' (x) = 0, 
y (x) = u(x) + u (£) + ug (x), uj (x)e? -- uy (x) 2 r—1, so 


uj'(r)-— ze ο, uy (r) 2 —1, ie, "(r)-—-(1-c-z)e *-- Ci, uo(z) 2 —x 4 C2. 
With this result the general solution of the inhomogeneous differential equation is: 


ylz) = —(1 + 22) + Cie” + (C2 — 1)r = —(1 + 2°) + Ci*e* 9 Cs. (9.37e) 
2. Method of Cauchy In the general solution 

y — Cun + Caya +++- + Cryn (9.38a) 
of the homogeneous differential equation associated to (9.32) one determines the constants such that 
for an arbitrary parameter a the equations y = 0, y/ = 0,..., y®-?) 2 0, yo-9 — F (α) are satisfied. 
In this way one gets a particular solution of the homogeneous equation, denoted by y(x, aœ), and then 

T 
y= / οία, a) da (9.38b) 

do 


isa particular solution of the inhomogeneous differential equation (9.32). This solution and their deriva- 
tives up to order (n — 1) are equal to zero at the point x = xo. 

W The general solution of the homogeneous equation associated to the differential equation (9.37d) 
which has been solved by the method variation of constants is y = Cie” + Cox. From this result 
follows y(o) — Cie* 4- Coo. — 0, y'(a) = Cie? -- C3 — a — 1 and g(r,o) — ae *e* — z, so that 
the particular solution y(x) of the inhomogeneous differential equation with y(rg) — y'(xo) = 0 15: 








y(x) = / (ae *e* — x) da = (a + 1)e*7* Ε (αρ -- Ίλα — a? — 1. With this result one can get the 
“αρ 
general βο]πί1οη μ(α) -- ΟἽ” ε” 4- C3*x — (x? -- 1) of the inhomogeneous differential equation. 
9.1.2.4 Solution of Linear Differential Equations with Constant 
Coefficients 


1. Operational Notation 
The differential equation (9.32) can be written symbolically in the form 


P,(D)y 5 (D" -- aD" a4 D"? -. --- as 4D as) y =F, (9.394) 
where D is a differential operator: 
dy ρε. ἂν 


pj Bl. 
97 das da* 


(9.39b) 
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If the coefficients a; are constants, then P,(D) is a usual polynomial in the operator D of degree n. 


2. Solution of the Homogeneous Differential Equation with Constant Coefficients 
To determine the general solution of the homogeneous differential equation (9.39a) with F — 0, i.e., 


P,(D)y = 0 (9.40a) 
one has to find the roots r4, r2,..., 1% of the characteristic equation 

P(r) =r? + air”! + agr™? +--+ as ar 4 a, = 0. (9.40b) 
Every root r; determines a solution e"** of the equation P,(D)y = 0. Ifaroot 7; has a higher multiplicity 
k, then xe", x2e®,..., c*—te" are also solutions. The linear combination of all these solutions is the 
general solution of the homogeneous differential equation: 

y= Cie + Coe™2* tees fe (C; + Crit Tec Cii) de (9.40c) 





If the coefficients a; are all real, then the complex roots of the characteristic equation are pairwise 
conjugate with the same multiplicity. In this case, for r1 — o 4- if and ra — o — iB one can replace the 
corresponding complex solution functions e"'* and e"?* by the real functions e** cos Bx and e?* sin Bx. 
'The resulting expression C1 cos fx: 4- Ca sin Bx can be written in the form A cos(fr -- p) with some 
constants A and q. 
W In the case of the differential equation y + y® — y” — y = 0, the characteristic equation is rê + 
r* — 7? — 1 2 0 with roots r1 — 1, r3 — —1, Ta4 — i, 15,6 = —i. The general solution can be given in 
two forms: 
y = Cie” + Coe ” + (C3 + Cax) cosx + (Cs + Cex) sina, or 
y = Cie” + Cae * + A cos(x + yi) + Aa cos(z + v). 











3. Hurwitz Theorem 
In different applications, e.g., in vibration theory, it is important to know whether a solution of a given 
homogeneous differential equation with constant coefficients tend to zero for x — +00 or not. It tends 
to zero, obviously, if the real parts of the roots of the characteristic equation (9.40b) are negative. 
According to the Hurwitz theorem an equation 

αμα" ραπ. ια p. paqr-cag-—0 (9.41a) 
has only roots with negative real part if and only if all the determinants 

















αιαρθ ...0 
αι αι a ao 0 E a a 0 
1 do 3 0204... 
Di =a, D= , D3 = ΟΡΟ”... 
a3 02 ganas . —— ipee 
000 Gn 
(with an =0 for m>n)  (9.41b) 
are positive. The determinants Dj haveon their diagonal the coefficients a1, a5, ..., ay (k — 1,2,...,m), 


and the cocfficient-indices are decreasing from left to right. Coefficients with negative indices and also 
with indices larger than n are all put to 0. 
W For a cubic polynomial the determinants have in accordance to (9.41b) the following form: 


Ta αι αρ 0 
Di =a, D= Ps E , D3 =| ag a2 ay |. 
9m 0 0 aa 














4. Solution of Inhomogeneous Differential Equations with Constant Coefficients 
These differential equations can be solved by the method variation of constants, or by the method of 
Cauchy, or with the operator method (see 9.2.2.3, 5., p. 588). If the right-hand side of the inhomo- 
geneous differential equation (9.32) has a special form, then a particular solution can be determined 
easily. 


1.Form: F(a) = Ae**, P,(a) #0 (9.42a) 
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A particular solution is 


Ae** 





= ; 9.42b 
Pala) ( ) 
If a is a root of the characteristic equation of multiplicity m, i.e., if 
Py (a) = Ph’ (a) =... = Pa -Y(a) = 0, (9.42c) 
emer 
then y = Pa) is a particular solution. These formulas can also be used by applying the decompo- 
nM (o 
sition theorem, if the right side is 
F(x) = Ae coswa or Ae?*sinwz. (9.42d) 


The corresponding particular solutions are the real or the imaginary part of the solution of the same 
differential equation for 
F(a) = Ae®**(coswa + isinwx) = Aet) (9.42e) 
on the right-hand side. 
Wi A: For the differential equation y" — 6y' + 8y = e?*, the characteristic polynomial is P(D) = 
—6D +8 with P(2) = 0 and P'(D) = 2D — 6 with P’(2) = 2-2—6 = —2, so the particular solution 














T rer 
δα 
E B: The differential equation y” + y’ + y — e? sin z results in the equation (D? + D + 1)y = eA. 
. . et tie e* (cosa + isin) . . 
From its solution y - - - one gets a particular solution yı = 
(1-1? - (14i) 41 2+ 3i 


e? 
BÊ sinx — 3 cos x). Here y; is the imaginary part of y. 


2. Form: F(x) = Qn(x)e%, Qn(x) is a polynomial of degree n (9.43) 
A particular solution can always be found in the same form, i.e., as an expression y — R(r)e^*. R(x) is 
a polynomial of degree n multiplied by x” if a is a root of the char. acteristic equation with a multiplicity 
m. Considering the coefficients of the polynomial R(x) as unknowns and substituting the expression 
into the inhomogeneous differential equation a linear system of equations is obtained for the coeffi- 
cients, and this system of equations always has a unique solution. 
"This method is very useful especially in the cases of F(x) — Q, (x) for a — 0and F(x) — Q,(x)e'* coswx 
or F(x) — Q,(x)e'* sinwz for à — r X iw. There is a solution in the form y — z"e'*[M, (x) coswz 4 
Na (x) sin wa]. 

n a 


Wl Tho roots of the characteristic equation associated to the differential equation y® + 2y” + y" = 
6x + 2xsin x are kı = kə = 0, ka — k4 — —1. Because of the superposition principle (see 9.1.2.3, a. 

p. 554), one can calculate the particular solutions of the inhomogeneous differential equation for the 
summands of the right-hand side separately. For the first summand the substitution of the given form 
yi — a? (ax 4- b) results in a right-hand side 12a 4- 2b -- 6ax — 6x, and so: a — 1 and b — —6. For the sec- 
ond summand one substitutes yo — (cx-4-d) sin x-- (f x-4-g) cos x. One gets the coefficients by coefficient 
comparison from (2g4-2 f — 6c--2 f x) sin z — (2c--2d4- 6f --2ex) cos c — 2rsinz,soc 20, d — —3, f — 
1, g — —1. Therefore, the general solution is y — ci--cax — 6a? 4-1? 4 (c3z--c4)e 7 —3 sin z4- (z— 1) cos x. 











3. Euler Differential Equation 
'The Euler differential equation 
n 
S* ax(cx + d)'y™ = F(x) (9.44a) 
k=0 
can be transformed with the substitution 


cx+d= (9.44b) 
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into a linear differential equation with constant coefficients. 

Wl The differential equation z?y" — 5zy' 4- 8y — x? is a special case of the Euler differential equation 

for n = 2. With the substitution x = et it becomes the differential equation discussed earlier in W A, 
24 d t x 

p. 557: e E ο +8y = e”. The general solution is y = Ce! + Coe“ — z” — Cy? 4 Oga* — = ln |æ]. 


9.1.2.5 Systems of Linear Differential Equations with Constant 
Coefficients 


1. Normal Form 
The following simple case of a system of first-order linear differential equations with constant coeffi- 
cients is called a normal system or a normal form: 


yr = ayy + A242 + +++ + ainYns 
Yo! = Anyi + azz + +++ + G2nYn; (9.45a) 





Yn = Ani Yi + Gn2¥2 + +++  OnnUn- 


To find the general solution of such a system, one has to find first the roots of the characteristic equation 


11 — T 012 ++ Ain 
21 1923 — T ... Q2n -— (9 45b) 
Ani απο ann — T 


To every single root r; of this equation there is a system of particular solutions 
yı = Aye™™, yo = Age”, ... Yn = Ane”, (9.456) 


whose coefficients A, (k = 1,2,...,n) are determined from the homogeneous linear equation system 


(αι — ri) A1 9 a1242 4: a4, As — 0, 
MEI (9.45d) 
Qni Ay + GngAg +°++ + (Gnn — Ti) An = 0. 


This system gives the relations between the values of the coefficients Ap (see Trivial Solution and Fun- 
damental System in 4.5.2.1, 2., p. 309). For every r;, the particular solutions determined this way will 
contain an arbitrary constant. If all the roots of the characteristic equation are different, the sum of 
these particular solutions contains n independent arbitrary constants, so in this way one gets the gen- 
eral solution. If a root r; has a multiplicity m in the characteristic equation, the system of particular 
solutions corresponding to this root has the form 


yr — Αι(α)ε"", ya = Ao(x)e"™”,...,Yn = An(a)e™, (9.450) 


where A1 (x), ..., A; (x) are polynomials of degree at most rn — 1. After substituting these expressions 
with unknown coefficients of the polynomials A;(2) into the differential equation system one first can 
cancel the factor e**, then one compares the coefficients of the different powers of x to have linear 
equations for the unknown coefficients of the polynomials, and among them m can be chosen freely. In 
this way, one gets a part of the solution with m arbitrary constants. The degree of the polynomials can 
be less than m — 1. 

In the special case when the system (9.45a) is symmetric, i.e., when aj, = agi, then it is sufficient to 
substitute A;(x) = const. For complex roots of the characteristic equation, the general solution can be 
transformed into a real form in the same way as has been shown for the case of a differential equation 
with constant coefficients (see 9.1.2.4, p. 555). 


WB. For the system yi' ^ 2yi 4- 2ys — ya, y?! — —2y1 +4y2 +43, ys’ = —3y1 + 8y2 + 2y3 the characteristic 
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equation has the form 


2-r 2 -1 
24-r 1 
-3 8 2-r 


For the simple root r1 — 6 one gets 441-2453 — A; — 0, 241 —245-4- As — 0, 3414-845 —443 — 0. 





— -(r-6)(r—1)? -0. 














From this system one has A; = 0, Az = 14s = C1, yı = 0, y2 = C1e®”, y3 = 2C1e®®. For the multiple 
root r2 = 1 one gets yı = (Pix + Q1)e”, y2 = (Pax + Q2)e”, y3 = (Psx + Q3)e”. Substitution into the 
differential equations yields 

Pig + (Pi T Q1) = (2P, + 2002 = Γη] + (2Q1 + 2ο = Q3), 

Paz + (Pz + Q2) = (—2P, + 4P2 + P3)a + (—2Q1 + 4Q2 + Q3), 

P3x + (P3 + Q3) = (—3P, + 8P2 + 2P3)a + (—3Q1 + 8Q2 + 2Q3), 
which implies that P 5C5, P, Co, Ps πο Φι 5C3 6C», Q» C5, Q3 7C3 1161. The 


general solution is yy = (5C2x + 5C3 — 6C2)e", yo = Cre® + (Cox + C3)e”, ys = 2Cye® + (7Cox + 
TC3 — 11C3)e*. 
2. Homogeneous Systems of First-Order Linear Differential Equations 
with Constant Coefficients 
have the general form 


Σ αι + Σ; bikYe =O (¢=1,2,...,n). (9.46a) 
k=1 k=1 

















If the determinant det(a;,) does not disappear, i.e., 

ἀοί(αικ) A 0, (9.46b) 
then the system (9.46a) can be transformed into the normal form (9.45a). 
In the case of det (aip) = 0 further investigations are necessary (see [9.15]). 
The solution can be determined from the general form in the same way as shown for the normal form. 
The characteristic equation has the form 

ἀοί(αικτ -- bi) — 0. (9.46c) 
The coefficients A; in the solution (9.45c) corresponding to a single root r; are determined from the 
equation system 

n 

M Y(aar; + bir) Ar =0 (i a, as ,n). (9.464) 

k=1 
Otherwise the solution method follows the same ideas as in the case of the normal form. 
W Thecharacteristic equation of the two differential equations 5yj' 4-414 — 2ys' — y» — 0, yr’ + 8y1—-3y2 = 
0 is: 

5r+4 —2r—1 
r+8 —3 

The coefficients A, and A for rı = 1 can be got from the equations 9A, — 345 = 0, 94; — 345 — 0 
so Ag = 3A, = 3C,. For rg = —2 one gets analogously Ag = 2A; = 2C . The general solution is 
yı = Cie 4- Coe 7, y — 3C4€* 4-205677, 
3. Inhomogeneous Systems of First-Order Linear Differential Equations 
have the general form 


Σ ane + Σ bryk = Fi(z) (1 1,2,...,n). (9.47) 
k=1 k=l 











2r?-2r—4—0, n-1l, rm 2. 
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1. Superposition Principle If y; and y; (j — 1,2,...,n) are solutions of inhomogeneous sys- 
tems which differ from each other only in their right-hand sides F;? and F;?), then the sum Yj = y; + 
yj (j — 1,2,...,n) is a solution of this system with the right-hand side Fi(x) = F;® (x) + F;® (x). 
Because of this, to get the general solution of an inhomogeneous system it is enough to add a particular 
solution to the general solution of the corresponding homogeneous system. 

2. The Variation of Constants can be used to get a particular solution of the inhomogeneous 
differential equation system. To do this one uses the general solution of the homogeneous system, and 
considers the constants C4, C5,..., C4 as unknown functions Ci(x), Co(x),..., C,(x). Then it is to 
be substituted into the inhomogeneous system. In the expressions of the derivatives of y, there is the 
derivative of the new unknown functions C;(2). Because yj, ya, .. . , y, are solutions of the homogeneous 
system, the terms containing the new unknown functions will be canceled; only their derivatives remain 
in the equations. This gives for the functions C; (x) an inhomogeneous linear algebraic equation system 
which always has a unique solution. After n integrations one gets the functions C; (x), Co(z), . .., Ca (x). 
Substituting them into the solution of the homogeneous system instead of the constants results in the 
particular solution of the inhomogeneous system. 

E For the system of two inhomogeneous differential equations 5y” + 4yı — 2y? — ο ει yi + 
8y1 — 3y2 = 5e™” the general solution of the homogeneous system is (see p. 559) y, = Cye® + Coe 7", 
να -- 3016" ἠ- 2C5e7*. Considering the constants C, and C5 as functions of x and substituting into 
the original equations gives 5C4'e? 4- 5C5'e-?" — 6C4'e* — 4Cs'e-?9 — e-7, Ce? 4- Cy'e?* = be^? or 
Cs'e?* — Ore — e, C,'e 4 Cy e?" — 5e7*, Therefore, 2C,'e* — 4e7*, C, — —e^?* + const, 
2Cs'e?* — 6e~*, Cy = 3e” + const. Since a particular solution is searched for, one can replace every 
constant by zero and the result is y; = 2e~*, yg = δε”. The general solution is finally yy — 2e * + 
Cie? 4- Coe 77, y — 3e? 4- 3C4€* - 205677. 

3. The Method of Unknown Coefficients is especially useful if on the right-hand side there 
are special functions in the form Q;(x)e?*. The application is similar to the one, used for differential 
equations of n-th order (see 9.1.2.5, p. 558). 

4. Second-Order Systems 

'The methods introduced above can also be used for differential equations of higher order. For the 
system 


Σ aye” + Σ bikYyk + Σ CikVk — 0 (6— 1,2,...,n) (9.48) 
k-i k-i k=l 


one can determine particular solutions in the form y; — A;e"*. 'To do this, one gets r; from the char- 
acteristic equation det(ajyr? + bir + Cik) = 0, and Aj from the corresponding linear homogeneous 
algebraic equations. 


9.1.2.6 Linear Second-Order Differential Equations 


Many special differential equations belong to this class, which often occur in practical applications. 
Several of them are discussed in this paragraph. For more details of representation, properties and 
solution methods see [9.15]. 


1. General Methods 


1. Solving the Inhomogeneous Differential Equation by the Help of the Superposition 
Principle 
y" + p(z)y * a(z)y — F(x). (9.494) 
To get the general solution of an inhomogeneous differential equation it is enough to add a particu- 
lar solution of the inhomogeneous equation to the general solution of the corresponding homogeneous 
equation. 
a) The general solution of the corresponding homogeneous differential equation, i.e., with F(x) = 0, is 
y = Cry + Coy. (9.49b) 
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Here yı and ya are two linearly independent particular solutions of (9.49a) (see 9.1.2.3, 2., p. 553). If 
a particular solution y; is already known, then the second one y» can be determined by the equation 
(9.34) of Liouville. From (9.34) follows: 











. E wid t 
n n = yy qnc Y1y2 3 νι ο — y (2) = Aexp ( fro dz) (9.49c) 
giving 
exp(—-fpda) ,. 
yo = Ayı / — (9.494) 


where A can be chosen arbitrarily. 
b) A particular solution of the inhomogeneous equation can be determined by the formula 


y= 3 f EO (f © d£) lale) - s Gove(O de. (9.490) 


where yı and ys» are two particular solutions of the corresponding homogeneous differential equation. 
C) A particular solution of the inhomogeneous differential equation can be determined also by variation 
of constants (see 9.1.2.3, 6., p. 554). 
2. Solving the Inhomogeneous Differential Equation by the Method of Undetermined Co- 
efficients 

s(x)y" c p(z)y - a(x)y — F(x) (9.50a) 
If the functions s(x), p(x), q(x) and F(x) are polynomials or functions which can be expanded into a 
convergent power series around zy in a certain domain, where s(xo) 4 0, then the solutions of this dif- 
ferential equation can also be expanded into a similar series, and these series are convergent in the same 
domain. Here they should be determined by the method of undetermined coefficients: The solution to 
be looking for as a series has the form 


Y = ao - ai(z — zo) 4- as(z — zo? ---- , (9.50b) 
and it has to be substituted into the differential equation (9.50a). Equating corresponding coefficients 
(of the same powers of (x — xo)) results in equations to determine the coefficients ao, a1, 2, . ... 

E To solve the differential equation y” + zy — 0 one substitutes y — ag 4- a1z + aga? + aga? +---, 

















y = a, + 2agr 4- 3aga? + ---, and y" — 2as 4- 6a3x 1 -:: getting 2ag = 0, 6a3 +a) = 0,.... 
a a 
The solution of these equations is ag = 0, a3 9 a , 04 3 y , ds = 0,..., so the solution is 
4 
T 








1 a? zê à a 
y=a t sae 1ο x t πια 
VD 2.3 ^ 2.3.5.6 1 3.4 3-4-6-7 


3. The Homogeneous Differential Equation 
Em 


xy" + ap(x)y’ + q(z)y = 0 (9.51a) 
can be solved by the method of undetermined coefficients if the functions p(x) and q(x) can be expanded 
as a convergent power series of r. The solutions have the form 


y = t" (ao + az +a? +--+), (9.51b) 
whose exponent r can be determined from the defining equation 
r(r — 1) + p(0)r + q(0) — 0. (9.51c) 


Ifthe roots of this equation are different and their difference is not an integer number, then one gets two 
linearly independent solutions of (9.51a). Otherwise the method of undetermined coefficients results 
only one solution. Then with the help of (9.49b) one can get a second solution or at least one can find 
a form which gives a second solution with the method of undetermined coefficients. 


W For the Bessel differential equation (9.52a) one gets only one solution with the method of the un- 
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determined coefficients in the form yı = x ajz"7* (ag 4 0), which coincides with J,(x) up to a 


constant factor. Since exp (- j pdr) = 7 — — one finds a second solution by using formula (9.49d) 
x 
oo 


dr x cy z?* oo 
An f Ay [ = de = By, me +a" Yd 
ya yı z.a^ (Y ms yı qontl Y ΠΠ ax ^ k 


The determination of the unknown coefficients cp and dẹ is difficult from the ap’s. But this last expres- 
sion can be used to get the solution with the method of undetermined coefficients. Obviously this form 
is a series expansion of the function Y, (x) (9.53c). 














2. Bessel Differential Equation 





αγ" + ay! + (2? — n*)y = 0. (9.52a) 
1. The Defining Equation is in this case 
r(r—-1)-r—n? zr? — n? — 0, (9.52b) 
so, r = +n. Substituting 
y = απ (αρ + ax ++) (9.52c) 
into this equation and equating the coefficients of z"** to zero gives 
k(2n T kjak αρ 90. (9.524) 
For k = 1 follows (2n + 1)a, = 0. For the values k = 2,3,... one obtains 
ao 
m. = =1,2,...), —U Uf) pe» 
am =0 (m » m—-3505 3) 
a is arbit (9.590) 
= ~, isa rary. .52e 
a deine ane? Oe 
1 
2. Bessel or Cylindrical Functions The series obtained above for aj = mil where I" is 
npn 


the gamma function (see 8.2.5, 6., p. 514), is a particular solution of the Bessel differential equation 
(9.52a) for integer values of n. It defines the Bessel or cylindrical function of the first kind of index n 





x” x αἱ 
“ία) - πηγη (: 20n42)  2-4- (2n 4 Z 2n 4 4) +) 


( 1k n+2k 
fov) (9.534) 


m 


The graphs of functions Jọ and Jı are shown in Fig. 9.12. 
The general solution of the Bessel differential equation for non-integer n has the form 


y = Cy JIn(x) + CoJ_n(x), (9.53b) 
where J_,(x) is defined by the infinite series obtained from the series representation of J,(a) by replac- 
ing n with —n. For integer n, holds J µ(α) -- (—1)"J, (x). In this case, the term J_,(«) in the general 
solution should be replaced with the Bessel function of the second kind 
Ja (c) cos mz — J_m(x) 


Yn (x) = jim, (9.53c) 


sin mz 
which is also called the Weber function. For the series expansion of Y, (x) sce, e.g., [9.15]. The graphs 
of the functions Yo and Y; are shown in Fig. 9.13. 
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Figure 9.12 Figure 9.13 


3. Bessel Functions with Imaginary Variables In some applications one uses Bessel functions 
with pure imaginary variables. In this case it is to be considered the product i", (ir) which will be 


denoted by 1, (x): 
Gr a 
2 ae pasa, (9.544) 


TNS 
Γ(α) --1-".Δη(1α) -- (5) | 
D(n-1) ΠΡ 123) 21Γ{η 1-3) 
The functions Z, (x) are solutions of the differential equation 
z?y" 4 zy — (z? 4 n?)y — 0. (9.54b) 
A second solution of this differential equation is the MacDonald function 
T nla) — Talg 
Ka(x) = T Lale) = att) 
2 sin nm 
If n converges to an integer number, this expression also converges. 
The functions [,,(a) and K,() are called modified Bessel functions. The graphs of functions Jp and 1; 


are shown in Fig. 9.14; the graphs of functions Ko and A, are illustrated in Fig. 9.15. The values of 
functions Jo(x), Ji(x), Yo(x), Yi(x), Io(x), (x), Ko(x), Ki(x) are given in Table 21.11, p. 1106. 


(9.54c) 
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Figure 9.14 Figure 9.15 Figure 9.16 
4. Important Formulas for the Bessel Functions Jn (x) 
2 dA. 
Jail) + Jay (2) = Zla), πο = —=Jn(a) + Jn—1(z). (9.55a) 
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The formulas (9.55a) are also valid for the Weber functions Y, (x). 








L.a(z) μμ) τον Mal) pa (s) ~In(2), (9.55b) 
Καμία) Kya(e) = Ao), mE = —K,_1(2) T kale). (9.556) 


For integer numbers n the following formulas are valid: 


71/2 


Jon (x) = 2 / cos(x sin y) cos 2ny dy, (9.55d) 
π 
0 
9 π/2 
Jan41(£) = — / sin(z sin o) sin(2n 4- 1) do (9.55e) 
W 
0 


or, in complex form, 


y T 





(x) 


cit cos P cos ny dg. (9.55f) 
0 
The functions J;41/2(«) can be expressed by using elementary functions. In particular, 


2 2 
ρα) 4] zn (9.564) αρ) EI (9.56b) 


By applying the recursion formulas (9.55a)—(9.55f) the expression for Jn+41/2(a) for arbitrary integer n 
can be given. For large values of x i S asymptotic formulas are valid: 


«a [s (2- 2-1) *o(2)]. (9.572) 


T 


Lie .. | +0 (2)] (9.57b) 


Y (j= (2 E (s - > - 1) +O (2)] ; (9.57c) 
E es (ze | +0 (2)] (9.574) 


1 1 
The expression O ( ) means an infinitesimal quantity of the same order as — (see the Landau symbol, 
x x 


B 

LR 

I 
ll 





2.1.4.9, p. 57). 

For further properties of the Bessel functions see [21.1]. 

5. Important Formulas for the Spherical Bessel Functions Spherical Bessel functions of 
the first and second kind j;(z) and m;(z) follow from the Bessel functions of the first and second kind 
Jn(z) (9.53a)and Y,(z) (9.53c) for half odd order index n = m .asin ji(z m NT z) and 


π 
ni(z) — V 5; m (z) with | — 0,1,2,.... They occur as regular or singular solutions of the potential 
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ΤΙΣ 
free radial Schroedinger equation (see 9.2.4.6,3., (9.137b), p. 599) with V(r) 20, E — ds m kr 
m 
and s;(z) = Ri(r): 
2 
d ; 
αρα si(z)] + [27 -- [({1- 1)]9ι(α) 20,  si(z) = ji(z) or ni(z). (9.58a) 


They also occur in the quantum mechanical scattering theory, where the n;(z) are called the spherical 
von Neumann functions. By the help of the Rayleigh formulas 


i) - car (5) 985. o - cx (a) co (9.591) 








follows, so 


sinz sin z — Z cos z 











Jo(2)=——, hz) = 3 eus (9.58c) 
COS Z cos z+ zsin z 
no(z) —, m™(z)= z v (9.58d) 


Complex spherical functions are used with $,,(o) — e"? in the form Yz(8) — Ol"(0)6,, (o), e.g. in 
9.2.4.6, (9.1360), p. 599. In the combined index L — (I, m) 1 — 0,1,2,... denotesthe quantum number 
of the orbital angular momentum. The magnetic quantum number m is restricted to the 2/ + 1 values 
m = —l,—l + 1,---, 4H. With the abbreviations 








ji(kr) — j(kr)Yr(&.), ni(kr)-— m(kr)YL(é))  €,— r (9.59a 
T 
one gets the Kasterinian formulas 
t5 - Y, (T) 3, fuum -Yv (T) c0, (9.590 
ik/ kr ik kr 


where V denotes the nabla operator (s. 13.2.6.1, S. 715). The expansion of a plane wave in terms o 
spherical or Bessel functions gives 





" Meu ME ο dH 
eK Ag M ijp(kr)YT(à), & = E Soe = Vee. (9.59c 
L L I-0 m—-1 
There are the following addition theorems 
i jp (Fs + Το)) = 4r 5 Crr, r1 P jr, (kr3)j7., (kr2) j Tia -— arbitrarily, (9.59d 
LiLa 
ing(k( + Το)) -- 4π 3 Cor ni Pn, (kr). τι» τὰ (9.59e 
Li,La 
with the Clebsch-Gordan coefficients (see 5.3.4.7, p. 345) 
Cirit = [ae Yr(e)Y7, ()Y7, (6) . (9.59f) 


For further details see [21.1], [9.28] until [9.31]. 


3. Legendre Differential Equation 
Restricting the investigations in this book to the case of real variables and integer parameters n = 
0,1,2,... the Legendre differential equation has the form 


(1—a*)y” —2ay'+n(n+1)y=0 or. ((1— a2)y - n(n 4 1)y ^ 0. (9.60a) 
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1. Legendre Polynomials or Spherical Harmonics of the First Kind are the particular solu- 
tions of the Legendre differential equation for integer n, which can be expanded into the power series 


oo 
y= ` a,x”. The method of undetermined coefficients yields the polynomials 
v=0 


Gi (2n)! | s n(n—1l) 435, n(n—1)(n—2)(n— 3) n4 Er 


πω, 7 


([z| € co;n 2 0,1,2,.. 





1-c 1 d^(a?—1) 
P,(x)— Finca, nl; = ; 
(x) (s T τ 2 ) 2an! da” 


2(2n — 1) 2-4(2n — 1)(2n — 3) | 


.). (9.60b) 


(9.60c) 


where F’ denotes the hypergeometric series (see 4., p. 567). The first eight polynomials have the fol- 


lowing simple form (see 21.12, p. 1108): 


Po(a) =1, (9.60d) Ῥι(α) τα, 
Py(x) — Ler - 1), (9.60f) P3(x) = je — 32), 


1 1 
Py(x) = go —302?--3),  (9.60h) P3(x) = g (63a° — 7023 + 152) , 


(9.60e) 


(9.60g) 


(9.601) 


1 1 
Po(x) = 45 (2312—31524-10527—5) (9.60) P;(x) — — (42927 —693a?--3152? — 352) .(9.60k) 


16 


The graphs of P,,(a) for the values from n = 1 to n — 7 are represented in Fig. 9.16. The numerical 


values can be calculated easily by pocket calculators or from function tables. 


2. Properties of the Legendre Polynomials of the First Kind 
a) Integral Representation: 


1 1f dip 
ο - f 3 n -- 
P(x) /α + cos pvV'x? — 1)" dy π / (x X cos oy/z2 — 1) 


The signs can be chosen arbitrarily in both equations. 

















b) Recursion Formulas: 
(η 1- 1) P4a(z) — (2n - 1)y P,(z) 2 nP,i(z) (n2 1;Py(z) — 1, Pi(x) = 2), 








(2? — 1) Pal) — n[|vP,(x) — P41(z)]) (n2 1). 
dx 
c) Orthogonality Relation: 
1 0 form £n, 
ΝΠ = 2 form =n. 
E 2n --1 


(9.61a) 


(9.61b) 


(9.61c) 


(9.61d) 


d) Root Theorem: All the n roots of P,(a) are real and single and are in the interval (—1, 1). 


e) Generating Function: The Legendre polynomial of the first kind can be represented as the power 


series expansion of the function 


1 oo 
--------------- = PW. 
VÀ — 2rz 4 r? 2 d 


For further properties of the Legendre polynomials of the first kind see [21.1]. 


(9.61e) 
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3. Legendre Functions or Spherical Harmonics of the Second Kind A second particular 

solution Q; (x) can be got, which is valid for || » 1 and linearly independent of P, (x), see (9.61a), by 
—(n+1) 

the power series expansion 5 bya”: 


ν---οο 








TE orp (Ht nee ees 1) 
(2n + 1)! 2 2 2 7r 
2"(n!)? MUN (n + 1)(n + 2) α-(ο13) 
(2n + 1)! 2(2n + 3) 
(at Dnt 2) 4 +S) ety a: | 
2-4- (2n + 3)(2n +5) 


The representation of Q„ (x) valid for |x| < 1 is: 





(9.62a) 


1 112 
zP(r)lnn TE 
2 l-r 





Qn(x) = = 3 t PeaG) Ps). (9.62b) 


The spherical harmonics of the first and second kind are also called the associated Legendre functions 
(see also 9.2.4.6, 4., (9.138c), p. 600). 
4. Hypergeometric Differential Equation 


The hypergeometric differential equation is the equation 


d? d 
RE F Iv - (o4 B+ 1a)" aBy = 0, (9.63a) 


where a, f, ^; are parameters. It contains several important special cases. 




















1-2 
a) Fora=n+1, 8 π, ΥΞ 1, απά 1 5 it is the Legendre differential equation. 


b) If y £ 0 or y is not a negative integer, it has the hypergeometric series or hypergeometric function 
as a particular solution : 
a-b a(a 4- 1)8(B 4-1) 4 
F(a,8,y;z) 1 --᾽--ᾱ----------τ 1... 
(α,β, Ύ;α) 1-3 1-2-9407 +1) 





4, 9(a - 1)... (a - n)B(B * 1)... (B 1- n) ni bts, 9.63b) 

1-2...(n- 1) -y(y 1)... (y n) 
which is absolutely convergent for |v| « 1. The convergence for r — +1 depends on the value of 
ô = y — a — B. Forz — litis convergent if ó ^ 0, it is divergent if 6 < 0. For x — —1 it is absolutely 
convergent if 6 < 0, it is conditionally convergent for —1 « 6 «€ 0, and it is divergent for ó € —1. 
c) For 2 — y Z 0 or not equal to a negative integer it has a particular solution 


yai F(a1—y,841—4,2— 3,2). 9.63c) 
d) In some special cases the hypergeometric series can be reduced to elementary functions, e.g. 








, 





F(1,8,6;2) = F(a,la;z) = i l 


-T 


, (9.0649)  F(—-n,8,8;—x) — (14- x)", 9.64b) 


arcsin x 





— In(1- m) 
= L5 


113 
F(1,1,2; -x) (9.64c) F (5.5. 5:29) = 9.64d) 


£ 





jim F (ο, 1; z) =e, 9.049) 
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5. Laguerre Differential Equation 
Restricting the investigation to integer parameters (n = 0,1,2,...) and real variables, the Laguerre 
differential equation has the form 








vy” +(a+1—2)y' +ny=0. (9.65a) 
Particular solutions are the Laguerre polynomials 
Beg dh = 2 (n c o (-«)* 
LO (z) = p R= ; 651 
n (a) n! ane E 2. n—k] k (9.65Þb) 
The recursion formula for n > 1 is: 
(n+1)L (9 (a £) = (~x + 2n +a + 1)L® (£) — (n + a)L® (a), (9.65c) 
L(yj-21 LP =1+a- r. (9.65d) 
An orthogonality relation for a > —1 holds: 
oo 0 form z n, 
TT ya e) Pos ros — 
[5 LO) (x) D(x) de = (" g ) ος (9.650) 
n 


I' denotes the gamma function (see 8.2.5, 6., p. 514). 
6. Hermite Differential Equation 

Two defining equations are often used in the literature: 
a) Defining Equation of Type 1: 


y” — sxy +ny=0 (n=0,1,2,...). (9.66a) 
b) Defining Equation of Type 2: 
y' —2xy -- ny 20 (n=0,1,2,...). (9.66b) 


Particular solutions are the Hermite polynomials, He, (x) for the defining equation of type 1, and H,,(x) 
for the defining equation of type 2. 


a) Hermite Polynomials for Defining Equation of Type 1: 


αλ. αἲ x 
He,(z) — (-1)^ Je ) Zx w(- z) 
— a^ — [7a c 1.3( "7 ]a74 1.3.5 : gr 64... (neN) (9.66c 
2 4 6 
For n > 1 the following recursion formulas are valid: 
Henii(x) = «He,(x) — nHen_1(x), (9.664) Περ(α) --1, He(x) = 2. (9.66e 
The orthogonality relation is: 
+0 9 
T for m £n, 
—— | Hegs(x)Hegs(x ΙΝ 9.66Ε 
[ool 3 rit) Heg (n) ae nlv2xm form nm. ( 


b) Hermite Polynomials for Defining Equation of Type 2: 





Hn (x) = (—1)" exp (a?) B exp (74?) (n € N). (9.66g 


The relation with the Hermite polynomials for defining equation of type 1 is the following: 


enle) — 27" τ n j .66h 
He,(x) — 2 *& (35) (n € IN) (9.661 
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9.1.3 Boundary Value Problems 
9.1.3.1 Problem Formulation 


1. Notion of the Boundary Value Problem 

In different applications, e.g., in mathematical physics, differential equations must be solved as so- 
called boundary value problems (see 9.2.3, p. 589), where the required solution must satisfy previously 
given relations at the endpoints of an interval of the independent variable. A special case is the linear 
boundary value problem, where a solution of a linear differential equation should satisfy linear boundary 
value conditions. In the following section the discussion is restricted to second-order linear differential 
equations with linear boundary values. 


2. Self-Adjoint Differential Equation 


Self-adjoint differential equations are important special second-order differential equations of the form 


[py'! — ay + Aoy = f. (9.67a 
The linear boundary values are the homogeneous conditions 
Aoy(a) + Boy!(a) =0,  Ary(b) + Bry'(b) = 0. (9.67b 


The functions p(z), p'(x), q(x), o(x), and f(a) are supposed to be continuous in the finite interva 
a < x < b. In the case of an infinite interval the results change considerably (see [9.5]). Furthermore, 
it is supposed that p(x) > po > 0, o(x) » 9o » 0. The quantity A, a parameter of the differentia 
equation, is a constant. For f — 0, it is called the homogeneous boundary value problem associated to 
the inhomogeneous boundary value problem. 

Every second-order differential equation of the form 


Ay" + By' + Cy + ARy = F (9.67c 
can be reduced to the self-adjoint equation (9.67a) by multiplying it by p/A if in [a,b], A z 0, and 
performing the following substitutions 


R pC pR 
= — dx], =-—, =—. 67d 
p = exp g A z) q A ΟΞ (9.67¢ 
To find a solution satisfying the inhomogeneous conditions 
Aoy(a) + Boy'(a) = Co, Ary(b) + Biy'(b) = C1 (9.67e 





one returns to the problem with homogeneous boundary conditions, but the right-hand side f(x 
changes and y = z + u is substituted where u is an arbitrary twice differentiable function satisfying 
the inhomogeneous boundary conditions and z is a new unknown function satisfying the corresponding 
homogeneous conditions. 

3. Sturm-Liouville Problem 

For a given value of the parameter A there are two cases: 

1. Either the inhomogeneous boundary value problem has a unique solution for arbitrary f(x), while 
the corresponding homogeneous problem has only the trivial, identically zero solution, or, 

2. The corresponding homogeneous problem also has non-trivial, i.e., not identically zero solutions, 
but in this case the inhomogeneous problem does not have a solution for arbitrary right-hand side; and 
if a solution exists, it is not unique. 

The values of the parameter A, for which the second case occurs, i.e., the homogeneous problem has a 
non-trivial solution, are called the eigenvalues of the boundary value problem, the corresponding non- 
trivial solutions are called the eigenfunctions. The problem of determining the eigenvalues and eigen- 
functions of a differential equation (9.67a) is called the Sturm-Liouville problem. 


9.1.3.2 Fundamental Properties of Eigenfunctions and Eigenvalues 


1. The eigenvalues of a boundary value problem form a monotone increasing sequence of real numbers 
No < AL < Ag <0 Ano, (9.68a) 
tending to infinity. 
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2. The eigenfunction associated to the eigenvalue A, has exactly n roots in the interval a «€ x < b. 
3. If y(x) and z(x) are two eigenfunctions belonging to the same eigenvalue A, they differ only in a 
constant multiplier c, i.e., 

2(x) = cy(x). (9.68b) 
4. Two eigenfunctions y; (2) and yo(x), associated to different eigenvalues \; and Ag, are orthogonal to 
each other with the weight function o(x) 


[οι yo(x) o(x) dx — 0. (9.68c) 


5. If in (9.67a) the coefficients p(x) and q(x) are replaced by p(x) > p(x) and q(x) > q(x), then the 
eigenvalues will not decrease, i.e., Ai > An, where \n and A, are the n-th eigenvalues of the modified 
and the original equations respectively. But if the coefficient o(r) is replaced by o(r) > o(x), then 
the eigenvalues will not increase, i.e., Ap < Àn. The n-th eigenvalue depends continuously on the 
coefficients of the equation, i.e., small changes in the coefficients will result in small variations of the 
n-th eigenvalue. 

6. Reduction of the interval [a, b] into a smaller one does not result in smaller eigenvalues. 


9.1.3.3 Expansion in Eigenfunctions 


1. Normalization of the Eigenfunction 
For every A, an eigenfunction o, (x) is chosen such that 


J len(x)Pola) dx =1. (9.69a) 


It is called a normalized eigenfunction. 
2. Fourier Expansion 
To every function g(x) defined in the interval [a, b], one can assign its Fourier series 


x b 
g(x) e È capal), Gr = E ts (x) o(x) dx (9.69b) 


with the eigenfunctions of the corresponding boundary value problem, if the integrals in (9.69b) exist. 
3. Expansion Theorem 

Ifthe function g(x) has a continuous derivative and satisfies the boundary conditions of the given prob- 
lem, then the Fourier series of g(x) (in the eigenfunctions of this boundary value problem) is absolutely 
and uniformly convergent to g(x). 

4. Parseval Equation 

If the integral on the left-hand side exists, then 


/ [ο(α))Λο(α) απ -- Do (9.699) 


is always valid. The Fourier series of the function g(x) converges in this case to g(a) in mean, that is 
b N 2 
lim ο) — So enyn(x)| o(x)dx — 0. (9.69d) 
n=0 


N-00 J 
a 


9.1.3.4 Singular Cases 


Boundary value problems of the above type occur very often in solving problems of theoretical physics 
by the Fourier method, however at the endpoints of the interval [a, b] some singularities of the differen- 
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tial equation may occur, e.g., p(r) vanishes. At such singular points some restrictions are imposed on 
the solutions, e.g., continuity or being finite or unlimited growth with a bounded order. These condi- 
tions play the role of homogeneous boundary conditions (see 9.2.3.3, p. 591). In addition, often occurs 
the case where in certain boundary value problems homogeneous boundary conditions should be con- 
sidered, such that they connect the values of the function or its derivative at different endpoints of the 
interval. Often occur the relations 


y(a) — vb),  p(a)y'(a) — p(b)y' (b), (9.70) 
which represent periodicity in the case of p(a) — p(b). For such boundary value problems everything 
being introduced above remains valid, except statement (9.68b). For further discussion of this topic 
see [9.5]. 


9.2 Partial Differential Equations 


9.2.1 First-Order Partial Differential Equations 
9.2.1.1 Linear First-Order Partial Differential Equations 


1. Linear and Quasilinear Partial Differential Equations 
'The equation 
Oz Oz Oz 
Kis λος +°°- + Xn=— 
tar Ore "Or, 
is called a linear first-order partial differential equation. Here z is an unknown function of the inde- 
pendent variables z4,...,:,, and X4,..., X4, Y are given functions of these variables. If functions 
Xj,..., X4, Y depend also on z, the equation is called a quasilinear partial differential equation. In the 
case of 
Yao, (9.71b) 
the equation is called homogeneous. 
2. Solution of a Homogeneous Partial Linear Differential Equation 
The solution of a homogeneous partial linear differential equation and the solution of the so-called 
characteristic system 


=Y (9.71a) 


da: απ dx 
Muni LLL M E Ἡ (9.72a) 
Xi X3 Xn 
are equivalent. This system can be solved in two different ways: 
1. Any z,,for which X, 7 0, can be chosen as an independent variable, so the system is transformed 
into the form 


dx j X. j 

— =- (41Ξ 1... τι). 9.72b 

dx, Xz 6 ο ( ) 
2. A more convenient way is to keep symmetry and to introduce a new variable t getting 

dx; 

e = Xp GH 2325.4). (9.72c) 


Every first integral of the system (9.72a) is a solution of the homogeneous linear partial differential 
equation (9.72a,b), and conversely, every solution of (9.72a,b) is a first integral of (9.72a) (see 9.1.2.1, 
2., p. 551). If the n — 1 first integrals 


Qi(1,...,24,) 20 (£2 12,...,n— 1) (9.724) 
are independent (see 9.1.2.3, 2., p. 553), then the general solution is 
z = (y1, - --, Pn-1)- (9.72e) 


Here @ is an arbitrary function of the n — 1 arguments y; and a general solution of the homogeneous 
linear differential equation. 
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3. Solution of Inhomogeneous Linear and Quasilinear 

Partial Differential Equations 
To solve an inhomogeneous linear and quasilinear partial differential equation (9.71a) one can try to 
find the solution z in the implicit form V(21,...,2n,z) — C. The function V is a solution of the 
homogeneous linear differential equation with n 4- 1 independent variables 


oV oV oV OV 
X X. fees +X, ΕΥ =0, 9.73a 
t ὅσι Tp Ox ” ὅση Oz ( a) 


whose characteristic system 





drj dx dz, dz 
= Εις Ss e 159: 
τπτ. X. ¥ (9.73b) 
is called the characteristic system of the original equation (9.71a). 


4. Geometrical Representation and Characteristics of the System 
In the case of the equation 
Oz Oz 
Pla,y.2)5~ + Q(a,y,2)5~ = Rix, y, 2) (9.74a) 
Ox Oy 
with two independent variables x; = x and v2 = y, a solution z = f(x,y) is a surface in z, y, z space, 
and it is called the integral surface of the differential equation. Equation (9.74a) means that at every 


. : Oz Oz . 
point of the integral surface z — f(x,y) the normal vector (= δι” 4) is orthogonal to the vector 
x’ Oy 
(P, Q, R) given at that point. Here the system (9.73b) has the form 


dx ο dy o dz 

P(z,yz) Q(nwz) αν) 
It follows (see 13.1.3.5, p. 708) that the integral curves of this system, the so-called characteristics, are 
tangent to the vector (P, Q, R). Therefore, a characteristic having a common point with the integral 
surface z = f(x,y) lies completely on this surface. Since the conditions for the existence theorem 
13.1.3.5, 1., p. 551 hold, there is an integral curve of the characteristic system passing through every 
point of space, so the integral surface consists of characteristics. 
5. Cauchy Problem 


There are given n functions of n — 1 independent variables t4, 19, ..., £4 3: 


(9.74b) 


Tı = U1(t1,te,...,tn-1), T2 =pl aa ae Zn ος ο ης (9.75a) 
The Cauchy problem for the differential equation (9.71a) is to find a solution 
z — (xi, 22,..., t4) (9.75b) 


such that if one substitutes (9.75a), the result is a previously given function v(t4, t9, ..., t4 4): 


eri (t, to, RUNE ἐπ. 1), σα({ι, τοι... -- νο Ust, τοι ντα ἔπ α)] = w(t, bosses Ina). (9.75c) 
In the case of two independent variables, the problem reduces to find an integral surface passing through 
the given curve. Ifthis curve has a tangent depending continuously on a point and it is not tangent to the 
characteristics at any point, then the Cauchy problem has a unique solution in a certain neighborhood 
of this curve. Here the integral surface consists of the set of all characteristics intersecting the given 
curve. For more mathematical discussion on theorems about the existence of the solution of the Cauchy 
problem see [9.15]. 


Oz 
@ A: For the linear first-order inhomogeneous partial differential equation (mz — ny) (nz — 
£ 
Oz . ΜΗ da: dy 
lz) — = ly — mz (l; m, n are constants), the equations of the characteristics are ———— = ———— = 
Oy mz—my nr—iz 
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dz 
ly — max 
as characteristics, whose centers are on a line passing through the origin, and this line has direction 
cosines proportional to l,m, n. The integral surfaces are rotation surfaces with this line as an axis. 


. The integrals of this system are la + my + nz = Ci, £? +y? + 2? = C2. One gets circles 


: ; : : . ς . OZ 
W B: Determine the integral surface of the first-order linear inhomogeneous differential equation E + 





Oz a 

ay 7» which passes through the curve x = 0, z = (y). The equations of characteristics are = 
y 

dy dz je . : - 

uec The characteristics passing through the point (ro, yo, zo) are y = £ — £o + yo, Z = zoe* το. 


A parametric representation of the required integral surface is y = x 4- yo, z — e*q(yo), if we substitute 
£o = 0, 20 = (yo). The elimination of yo results in z — e*o(y — x). 


9.2.1.2 Non-Linear First-Order Partial Differential Equations 
1. General Form of First-Order Partial Differential Equation 


is the implicit equation 


Oz Oz 
Fl a,...,%n,2, ——)---) z— |] = 0. 9.76a 
(s wt δρ x) (9.762) 
1. Complete Integral is the solution 
z — q(zi,... $5; 01... , 0n); 9.76b) 
depending on n parameters a4, ..., à; if at the considered values of z1,... 24, z the functional deter- 


minant (or Jacobian determinant, see 2.18.2.6, 3., p. 123) is non-zero: 


O (zii - Pan) 
O(ai,..., a4) 


2. Characteristic Strip The solution of (9.76a) is reduced to the solution of the characteristic 
system 


#0. 9.76c) 


























dx, u dEn dz —dpı - —dpn 9.76d) 
hn Pro nmPctetbbà XctnZ Xn t+ PnZ 
with 
Z a X; £, pi z, P, E (i=1,...,n). 9.76e) 
The solutions of the characteristic system satisfying the additional condition 
F(21,...,;2n;2,P1,+-+,Pn) =0 (9.76f) 


are called the characteristic strips. 
2. Canonical Systems of Differential Equations 


Sometimes it is more convenient to consider an equation not involving explicitly the unknown function 
z. Such an equation can be obtained by introducing an additional independent variable x,4; = 5 











and an unknown function V(z, ..., 24, 2541), which defines the function z(z1,39,...,14) with the 
equation 
V (23,552.88, 2) =C (9.77a) 
. . : V oV . Oz : 
At the same time, one substitutes the functions — 2 ($ ο ο ο τον in (9.76a). 
it, | OXn41 Ox; 


Then one solves the differential equation (9.76a) for an arbitrary partial derivative of the function V. 
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The corresponding independent variable will be denoted by zx after a suitable renumbering of the other 
variables. Finally, one gets the equation (9.76a) in the form 


oV oV 
mec——dq—L-— (50d) 9.77b 
an’ Ox; ( ᾿ } ( ) 
The system of characteristic differential equations is transformed into the system 


dr; ƏH dp; OH 


p H(zy..., Za 2, pi... pn) 2 0, p 




















de Op dt Or (i=1,...,n) and (9.7Το) 
dV OH OH dp OH 

ο H, . 77d 
dep, Pn Dn ae (9.774) 


Equations (9.77c) represent a system of 2n ordinary differential equations, which corresponds to an 
arbitrary function H(z,... s, 2, pi, ... , pa) with 2n + 1 variables. It is called a canonical system or 
a normal system of differential equations. 

Many problems of mechanics and theoretical physics lead to equations of this form. Knowing a complete 
integral 








V = (zi... 35, 2,01,...,as) +a (9.77e) 
of the equation (9.77b) one can find the general solution of the canonical system (9.77c), since the 
ὃ ὃ 
equations a = bi, an = pi (i = 1,2,...,n) with 2n arbitrary parameters a; and b; determine a 
Gi Ti 


2n-parameter solution of the canonical system (9.77c). 
3. Clairaut Differential Equation 
If the given differential equation can be transformed into the form 


Oz ; 
Z = pp T2P2 + `` + LnPn + f (P1, P2,- --, Pn), Pi = Jz (@=1,...,n), (9.78a) 
ME 

it is called a Clairaut differential equation. The determination of the complete integral is particularly 
simple, because a complete integral with the arbitrary parameters a4, d2,..., Gn is 

Z — d474 + Agta +++» + ἀπῃ + f (01,2, . .. , à). (9.78b) 
WB Two-Body Problem with Hamilton Function: Consider two particles moving in a plane under 
their mutual gravitational attraction according to the Newton field (see also 13.4.3.2, p. 728). Choosing 
the origin as the initial position of one of the particles, the equations of motion have the form 


xr OV dy _ OV. e 














QV i 9.79a 
d? ðr’ d? ὃν va? a? ( a) 
Introducing the Hamiltonian function 
1 k? 
Η -- ορ) +) -— (9.79b) 


2 VEU 
the system (9.79a) is transformed into the normal system (into the system of canonical differential 
equations) 


dx OH dy OH dp OH dq OH 

















di Op’ dt Og’ dt Ox’ dt dy (3:190) 
with variables 
dz d 
σσ πι απο. (9.794) 
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Now, the partial differential equation has the form 


ὃσ 1|(82V , (02V? k? 
πο | =. (9.79e) 
Ot 2 |VOxr ὃν Vx? + y? 
Introducing the polar coordinates p, p in (9.79e) one obtains a new differential equation having the 
solution 








2k? b 
z = —at — bp +c f 2a +4 g dr (9.79f) 
po r rn 
with the parameters a, b, c. The general solution of the system (9.79c) follows from the equations 
Oz Oz 
eat — = wp. i 
θα ΝΣ; Po (9.798) 


4. First-Order Differential Equation in Two Independent Variables 

For vy = 2, £2 = Y, pı = p, p2 = q the characteristic strip (see 9.2.1.2, 1., p. 573) can be geometrically 
interpreted as a curve at every point (x, y, z) of which a plane p(€— x) -- q(rj — y) — € — z being tangent 
to the curve is prescribed. So, the problem of finding an integral surface of the equation 


Oz Oz 
r (xy, T3243 Ξ0 (9.80) 


passing through a given curve, i.e., to solve the Cauchy problem (see 9.2.1.1, 5., p. 572), is transformed 
into another problem: To find the characteristic strips passing through the points of the initial curve 
such that the corresponding tangent plane to each strip is tangent to that curve. One gets the values 
p and q at the points of the initial curve from the equations F (x,y,z, p,q) = 0 and pdx + qdy = dz. 
There can be several solutions in the case of non-linear differential equations. 
Therefore, under the formulation of the Cauchy problem, in order to obtain a unique solution one can 
assume two continuous functions p and q satisfying the above relations along the initial curve. 
For the existence of solutions of the Cauchy problem see [9.15]. 
E For the partial differential equation pq = 1 and the initial curve y = z?, z — 22?, one can choose 
p = v and q = 1/x along the curve. The characteristic system has the form 
dr dy dz _ 2 dp 0 dq 
A gg 5 gw 
'The characteristic strip with initial values zo, yo, zo, po and qo for t = 0 satisfies the equations x = 
xo + got, y = yot pot, z = 2poqot + 20, P = po, q = qo. For the case of po = Xo, Yo = 1/20 the equation 
of the curve belonging to the characteristic strip that passes through the point (xo, yo, zo) of the initial 
curve is 





0. 




















£ = To + > Y To) --ixo, z = 2t 4 2x9. 


Eliminating the parameters xo and t gives 2? — 4vy. For other chosen values of p and q along the initial 
curve one can get different solutions. 

Remark: The envelope of a one-parameter family of integral surfaces is also an integral surface. Con- 
sidering this fact one can solve the Cauchy problem with a complete integral. One finds a one-parameter 
family of solutions tangent to the planes given at the points of the initial curve. Then one determines 
the envelope of this family. 

W Determine the integral surface for the Clairaut differential equation z — pr — gy 4- pq — 0 passing 
through the curve y — z, z — z?. The complete integral of the differential equation is z = ax + by — ab. 
Since along the initial curve p — q — x, one determines the one-parameter family of integral surfaces 


1 
by the condition a — b. When the envelope of this family is found then one gets z — 1€ 4 y). 
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5. Linear First-Order Partial Differential Equations in Total Differentials 
Equations of this kind have the form 

dz = fidzi + fodtg +--+ frdtn, (9.81a) 
where fi, f, ..., f, are given functions of the variables z4, 12, ..., 24, z. The equation is called a com- 
pletely integrable or exact differential equation when there exists a unique relation between 21, 12, .. . , tq, 
z with one arbitrary constant, which leads to equation (9.81a). Then there exists a unique solution 
z = 2(%1,22,...,%n) of (9.81a), which has a given value zo for the initial values 219,..., m4? of the 
independent variables. Therefore, for n — 2, 71 = 2, x9 — y a unique integral surface passes through 
every point of space. 

(n — 1) 
2 


n 
The differential equation (9.81a) is completely integrable if and only if the equalities 


Of ofi ὃν, , Ofk 

Tk EL az Ox; + hi Oz 
in all variables 21, 3, ..., x4, z are identically satisfied. 
If the differential equation is given in symmetric form 

fida o fada, — 0, (9.81c) 
then the condition for complete integrability is 

Of, — OF; Of;  Ofx Of; Of, 
fi E 8s) E (54 η + te (2 25) =0 ἘΠῚ 


for all possible combinations of the indices i, j, k. If the equation is completely integrable, then the 
solution of the differential equation (9.81a) can be reduced to the solution of an ordinary differential 
equation with n — 1 parameters. 


(i,k =1,...,n) (9.81b) 





9.2.2 Linear Second-Order Partial Differential Equations 


9.2.2.1 Classification and Properties of Second-Order Differential 
Equations with Two Independent Variables 


1. General Form 
of a linear second-order partial differential equation with two independent variables x,y and an un- 
known function u is an equation in the form 
Oru Ou Ou ðu du 
Α- -2Β +C +a—+b F cu ; 9.82a 
Ox? | OxOy Oy? Ox Oy | f ( ) 
where the coefficients A, B, C, a, b, c and f on the right-hand side are known functions of x and y. 
The form of the solution of this differential equation depends on the sign of the discriminant 
ô= AC — B? (9.82b) 
in a considered domain. The following cases should be distinguished. 
1.6 < 0: Hyperbolic type. 
2.6 = 0: Parabolic type. 
3.6 > 0: Elliptic type. 
4. 6 changes its sign: Mixed type. 
An important property of the discriminant ô is that its sign is invariant with respect to arbitrary trans- 
formation of the independent variables, e.g., to introduction new coordinates in the x, y plane. There- 
fore, the type of the differential equation is invariant with respect to the choice of the independent 
variables. 
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2. Characteristics 
of linear second-order partial differential equations are the integral curves of the differential equation 
dy B+tv—é 

dx A : 


For the characteristics of the above three types of differential equations the following statements are 
valid: 





Ady? — 2Bdxdy + Cdx? =0 or 





(9.83) 


1. Hyperbolic type: There exist two families of real characteristics. 

2. Parabolic type: There exists only one family of real characteristics. 

3. Elliptic type: There exists no real characteristic. 

4. A differential equation obtained by coordinate transformation from (9.82a) has the same character- 
istics as (9.82a). 

5. If a family of characteristics coincides with a family of coordinate lines, then the term with the 
second derivative of the unknown function with respect to the corresponding independent variable is 
missing in (9.82a). In the case of a parabolic differential equation, the mixed derivative term is also 
missing. 

3. Normal Form or Canonical Form 

One has the following possibilities to transform (9.82a) into the normal form of linear second-order 
partial differential equations. 


1. Transformation into Normal Form: The differential equation (9.82a) can be transformed into 
normal form by introducing the new independent variables 





€=(z,y) and n= Ņ(z,y), (9.84a) 
which according to the sign of the discriminant (9.82b) belongs to one of the three considered types: 

9? 9? 

"a - T +---=0, <0, hyperbolic type; (9.84b) 
P^ =0, ó-0 arabolic type; (9.84c) 
op =0, 6=0, parabolic type; 

u Pu ein 

ag + δηρ +---=0, 020, elliptic type. (9.84d) 


The terms not containing second-order partial derivatives of the unknown function are denoted by dots. 
2. Reduction of a Hyperbolic Type Equation to Canonical Form (9.84b): If, in the hyper- 
bolic case, one chooses two families of characteristics as the coordinate lines of the new coordinate 
system (9.84a), i.e., if substituting & = y(#,y),m = w(x,y), where y(x,y) = constant, y(x, y) = 
constant are the equations of the characteristics, then (9.82a) becomes the form 
o?u " 

060m 
This form is also called the canonical form of a hyperbolic type differential equation. From here one gets 
the canonical form (9.84b) by the substitution 

€=&+m, nu-&-m. (9.84f) 
3. Reduction of a Parabolic Type Equation to Canonical Form (9.84c): The only family of 
characteristics given in this case is selected for the family € = const, where an arbitrary function of x 
and y can be chosen for 7, which must not be dependent on £. 
4. Reduction of an Elliptic Type Equation to Canonical Form (9.84d): If the coefficients 
A(x,y), B(x, y), C(x, y) are analytic functions (see 14.1.2.1, p. 732) in the elliptic case, then the char- 
acteristics define two complex conjugate families of curves y(x, y) = constant, y(x, y) = constant. By 
substituting € = p+ w and n = i(y — w), the equation becomes of the form (9.84d). 





-=0. (9.84e) 
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4. Generalized Form 
Every statement for the classification and reduction to canonical forms remains valid for equations 
given in a more general form 


Ou Ou Ou Ou ð 
A(x, yas t 2BG. Wana, By set F(a Yth S> Ὃν) =o, (9.85) 


where F is a non-linear function the unknown function uw and its first-order partial derivatives 0u/Ox 
and Qu/Oy, in contrast to (9.82a). 


9.2.2.2 Classification and Properties of Linear Second-Order Differential 
Equations with More than Two Independent Variables 


1. General Form 
A differential M of this kind for u — u(x1,x2,...,c,) has the form 


Σ «ρος ma “=, (9.86) 


where aj, are given functions of the independent variables and the dots in (9.86) mean terms not con- 
taining second-order derivatives of the unknown function. 

In general, the differential equation (9.86) cannot be reduced to a simple canonical form by transforming 
the independent variables. However, there is an important classification, similar to the one introduced 
above in 9.2.2.1, p. 576 (see [9.5]). 

2. Linear Second-Order Partial Differential Equations with Constant Coefficients 
If all coefficients a;; in (9.86) are constants, then the equation can be reduced by a linear homogeneous 
transformation of the independent variables into a simpler canonical form 


Ou 
ο - (9.87) 


where the coefficients «; are +1 or 0. Several characteristic cases have to be distinguished. 

1. Elliptic Differential Equation If all coefficients x; are different from zero, and they have the same 
sign, then it is the case of an elliptic differential equation. 

2. Hyperbolic and Ultra-Hyperbolic Differential Equation If all coefficients «; are different 
from zero, but one has a sign different from the other’s, then it is the case of a hyperbolic differential 
equation. If both types of signs occur at least twice, then it is an ultra-hyperbolic differential equation. 
3. Parabolic Differential Equation If one of the coefficients &; is equal to zero, the others are dif- 
ferent from zero and they have the same sign, then it is the case of a parabolic differential equation. 

4. Simple Case for Elliptic and Hyperbolic Differential Equations If not only the coefficients 
of the second order derivatives of the unknown function are constants, but also those of the first order 
derivatives, then it is possible to eliminate the terms of the first order derivatives, for which &; Z 0, by 
substitution. For this purpose is 


u = v exp ( a za) À (9.88) 





substituted where b; is the coefficient of B in (9.87) and the summation is performed for all «; 4 0. 
Tk 
In this way, every elliptic and hyperbolic differential equation with constant coefficients can be reduced 
to a simple form: 
2 
a) Elliptic Case: Av -- kv — g. (9.89) b) Hyperbolic Case: um — Av -- kv — g. (9.90) 
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Here A denotes the Laplace operator (see 13.2.6.5, p. 716). 


9.2.2.3 Integration Methods for Linear Second-Order Partial Differential 
Equations 


1. Method of Separation of Variables 

Certain solutions of several differential equations of physics can be determined by special substitutions, 
and although these are not general solutions, one gets a family of solutions depending on arbitrary 
parameters. Linear differential equations, especially those of second order, can often be solved if looking 
for a solution in the form of a product 


u(zi, ..., 25) — qa(z1)ga(z2) . . . n (2). (9.91) 
Next, one tries to separate the functions «y (c), i.e., for each of them one wants to determine an or- 
dinary differential equation containing only one variable rj. This separation of variables is successful 
in many cases when the trial solution in the form of a product (9.91) is substituted into the given 
differential equation. In order to guarantee that the solution of the original equation satisfies the re- 
quired homogeneous boundary conditions, it may appear to be sufficient that some of functions q (11), 
ya(X2),---, Pn(Ln) satisfy certain boundary conditions. 

By means of summation, differentiation and integration, new solutions can be acquired from the ob- 
tained ones; the parameters should be chosen so that the remaining boundary and initial conditions 
are satisfied (see examples). 

Finally, don’t forget that the solutions obtained in this way, often infinite series and improper inte- 
grals, are only formal solutions. That is, one has to check whether the solution makes a physical sense, 
e.g., whether it is convergent, satisfies the original differential equation and the boundary conditions, 
whether it is differentiable termwise and whether the limit at the boundary exists. 

The infinite series and improper integrals in the examples of this paragraph are convergent if the func- 
tions defining the boundary conditions satisfy the required conditions, e.g., the continuity assumption 
for the second derivatives in the first and the second examples. 


ΒΑ: Equation of the Vibrating String is a linear second-order partial differential equation o: 
hyperbolic type 


Pu Pu 

JE Bag (9.92a 
It describes the vibration of a spanned string. The boundary and the initial conditions are: 

ὃ 
u =f), “| = (x), wle-o =0, ule =0. (9.92b 
t=0 Ot t=0 

Seeking a solution in the form 

u= X(x)T(t), (9.92c 
and after substituting it into the given equation (9.92a) follows 

qu WM 

ον ια 9.924 

aT X ( 


The variables are separated, the right side depends on only x and the left side depends on only t, so 
each of them is a constant quantity. The constant must be negative, otherwise the boundary conditions 
cannot be satisfied, i.e., non-negative values give the trivial solution u(x,t) = 0. This negative constan 
is denoted by —\?. The result is an ordinary linear second-order differential equation with constam 
coefficients for both variables. For the general solution see 9.1.2.4, p. 555. The results are the linear 
differential equations 


X" +X =0, (9.929) and T” +a@?T =0. (9.92f 





From the boundary conditions follows X(0) — X(/) — 0. Hence X(xz) is an eigenfunction of the 
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Sturm-Liouville boundary value problem and X is the corresponding eigenvalue (see 9.1.3.1, 3., p. 569). 
Solving the differential equation (9.92e) for X with the corresponding boundary conditions one gets 


X(r)- Csinàr with sinM=0, ie.,with A= > ο (9.92g) 
Solving equation (9.92f) for T yields a particular solution of the original differential equation (9.92a) 
for every eigenvalue Apn: 

















us (zm, t) (ω cos TM H bn sin T) sin ται (9.921) 
Requiring that for t = 0, 
ul = >> u,(#,0) is equal to f(x), and (9.921) 
t=0 n=1 
δι o^ Otis 
or B B 2. 2 (x,0) is equal to y(x), (9.925) 
one gets with a Fourier series expansion in sines (see 7.4.1.1, 1., p. 474) 
2p um 2 p y 
ün = > [ f (x) sin a dx, bn = — | «(x)sin D ds. (9.92k) 
l Jo l nam Jo l 


W B: Equation of Longitudinal Vibration of a Bar is a linear second-order partial differential 
equation of hyperbolic type, which describes the longitudinal vibration of a bar with one end free and a 
constant force p affecting the fixed end. Here is to solve the same differential equation as inl A (p. 579), 
i.e., 


Pu Pu 





























JE ^ Bg (9.93a 
with the same initial but different boundary conditions: 
Ou Ou 
u = f(x) -- = (zx), 9.93b — =0 (free end), 9.93c 
Lí PUR. up. cp. (093b) δε] ο ( ) ( 
δι 
Du = kp. (9.934 
Ox |, 
The conditions (9.93c,d) can be replaced by the homogeneous conditions 
Oz Oz 
E = = 9.93 
δα α--θ Ox α--ὶ ( e 
where instead of u is introduced a new unknown function 
Lg ERE (9.93f 
z—u op Ἷ 
The differential equation becomes inhomogeneous: 
Oz Oz ak 
=a? =, (9.93g 





=a 
Ot? Ox? l 
Looking for the solution in the form z = v + w, where v satisfies the homogeneous differential equation 
with the initial and boundary conditions for z, i.e., 
kpa? Oz 
Ζ = f(x) - 


— = T 931 
E ον δις φ(α), (9.951) 
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and w satisfies the inhomogeneous differential equation with zero initial and boundary conditions. This 


2,42 
ca pt 
gives w — m Substituting the product form of the unknown function v(x, t) into the differentia 





equation (9.93a) 





v — X(x)T(t) (9.93i 
gives the separated ordinary differential equations as in B A. (p. 579) 

X" que 

== ==), 9.93] 

X dT MM 


Integrating the differential equation for X with the boundary conditions X'(0) — X'(I) — 0 one finds 
the eigenfunctions 


Xn = cos ΙΙ (9.93k 
and the corresponding eigenvalues 
nen 
An? = πα (n = 0,1,2,...). (9.931 


Proceeding as in Ml A (p. 579) one finally obtains 














u = 





ka?pt? kpr? annt b, . annt nur 
οἱ 9l +a qe i ot + x Qn COS i + "LL cos τ (9.93m 


where an and bn (n — 0,1,2,...) are the ο αμήν of the Fourier series expansion in cosines of the 


functions f(a) — 





l 
and —ọ(x) in the interval (0,1) (see 7.4.1.1, 1., p. 474). 
απ 


W C: Equation of a Vibrating Round Membrane fixed along the boundary: 
The differential equation is linear, partial and it is of hyperbolic type. It has the form in Cartesian and 
in polar coordinates (see 3.5.3.2,3., p. 211) 

Ou u 18u Ou lou 18u 18u 

aa ta στ 94a —t+-—4+555=s555: 94: 
Ox? T Oy αἱ δι πο Op? T p Op T pog! a ot (9:946) 


The initial and boundary conditions are 
Ou 
ulizo — f(p. €). (9.94c) s. F(p,q), (9.94d) u|p-g — 0. (9.946) 
t=0 


The substitution of the product form 
u=U(p)®(p)T(t) (9.948) 
with three variables into the differential equation in polar coordinates yields 
U" U' ^s 1 η 
EU. Mu ed comer E CMM 9.94g 
7 pU ut aT (9.948) 
Three ordinary πι aun equations are obtained for the separated variables analogously to examples 





A (p. 579) and B (p. 580): 
2ΤΤΗ 1 a 
T" 4 a?X?T — 0, (9.94h) P +2 = T =P, (9.94i) 
Q" + °h=0. (9.945) 





From the conditions &(0) = &(27), 4 (0) — 9" (27) it follows that: 
$(o) — a, cosmp t b, sinnp, v? —n? (n=0,1,2,...). (9.94k) 
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2 
U and A will be determined from the equations [pU"]|' — ce —\*pU and U(R) = 0. Considering 
p 


the obvious condition of boundedness of U (p) at p — 0 and substituting Ap — z gives 


2U' E zU' (22 -n.)U —0, ie, U(p)— Ja(z) — Ja (ως) (9.941) 
where J, are the Bessel functions (see 9.1.2.6, 2., p. 562) with \ = s and J,(u) — 0. The system of 
functions 

Une(p) = Jn (n) (k—1,2,...) (9.94m) 


with jin, as the k-th positive root of the function J,(z) is a complete system of eigenfunctions of the 
self-adjoint Sturm-Liouville problem which are orthogonal with the weight function p. 
The solution of the problem can have the form of a double series: 











co co - t 
U-MM [eus cos ny + bng sin ny) cos ΞΡ 
n=0 k=1 R 
nkt 
+(Cnz cos NY 4 da sin ng) sin ΠΠ di (ns) s (9.94n 
From the initial conditions at t — 0 one obtains 
F(p. v) — Y (an cosny + bpp sin ny) Jn (na) ; (9.940 
n=0 k=1 
F(p,g) 3; SPUR Y (cay cos nq 4- dy, sin ny) In (int 5): (9.94p 
noki È R 
where 
——— κι Jn ( tine) pd 9.94 
Onk = ΤΈΣΣ {πὴ / ef F(p, 9) cosngpdn (ως) p dp, (9.94q 
b τ TET inngJs (uo) od 4 
nk — us us ef f(o, e) sinneJ, (κυρ) pap. (9.941 


In the case of n — 0, the numerator 2 should be changed to 1. To determine the coefficients c;;; and d; 

the function f(p, ϱ) is replaced by F(p, p) in the formulas for anx and bng and finely it is multiplied by 
R 

Oink 

E D: Dirichlet Problem (see 13.5.1, p. 729) for the rectangle 0 € x € a, 0 € y € b (Fig. 9.17): 





y Find a function u(x, y) satisfying the elliptic type Laplace differen- 
tial equation 
b Au =0 (9.95a) 


and the boundary conditions 
u(0,y) = pı(y), ula, y) = vx); 


0 a x u(x,0) = v1(x), u(z, b) — v»(x). (9.95b) 
First there is to determine a particular solution for the boundary 
Figure 9.17 conditions qi1(y) — q»(y) — 0. Substituting the product form 


u = X(x)Y (y) (9.95c) 
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into (9.95a) gives the separated differential equations 





xe. ym 
Eee c =)? (9.95d) 
with the eigenvalue A analogously to examples A (p. 579) through C (p. 581). Since X(0) = X(a) = 0, 
X=Csinid\r, A= E =Aq (m-ti2..) (9.95e) 
In the second step the Αλλα] solution of the differential equation is obtained: 
γη ΓΤ 0 (9.95f) in the form Y — a, sinh To — y) + bn sinh Ty, (9.95g) 
a 


From these equations one gets a particular solution of (9.95a) satisfying the boundary conditions u(0, y) = 
u(a, y) — 0, which has the form 


Un = [en sinh er t — y) +6, sinh e sin Zr. (9.95h) 
a a a 
In the third step one considers the general solution as a series 
οο 
u= Y un, (9.951) 
n=1 


so from the boundary conditions for y = 0 and y = b 


u=). (an sinh PT — y) + bn sinh y) sin Zr (9.951) | 
aaa a a a 
follows with the coefficients 
= can q(x) sin Tr dz, b,— — I V» (a z)sin* € da. (9.95k) 
a sinh — a sinh — 


The problem with the boundary conditions q(x) — vo(x) — S can be solved in a similar manner, and 
taking the series (9.95j) one gets the general solution of (9.95a) and (9.95b). 


W E: Heat Conduction Equation Heat conduction in a homogeneous bar with one end at infin- 
ity and the other end kept at a constant temperature is described by the linear second-order partia 
differential equation of parabolic type 


Ou Pu 

ae ai (9.96a 
which satisfies the initial and boundary conditions 

ultio — f(x), ulļz=0 = 0 (9.96b 


in the domain 0 € x < +00, t > 0. It is also to be supposed that the temperature tends to zero a 
infinity. Substituting 





u = X(x)T(t) (9.96c 
into (9.96a) one obtains the ordinary differential equations 
gu Au 
— =^ =) .96d 
η X ; (9.96 


whose parameter À is introduced analogously to the previous examples A (p. 579) through D (p. 582). 
One gets 


T(t) = Ce” (9.966) 


584 9. Differential Equations 





as a solution for T(t). Using the boundary condition X(0) — 0, gives 


X(x) = C sin àx (9.96f) andso uy =Cye*"* sin Az, (9.96g) 
where A is an arbitrary real number. The solution can be obtained in the form 
u(x,t) = L C(A)e "t sin Ax dA. (9.96h) 
0 
From the initial condition u|;zo = f(x) follows (9.961) with (9.967) for the constant (see 7.4.1.1, 1., 
f(a) = 1. C(A)sin Ar dd, (996) | CQ)- “f° f(s)sinAsds. (9.96j) 
p. 474). 9 
Combining (9.96) and (9.96h) gives 
2 oo oo 
u(a,t) = — [ f(s) fi e 77€ sin As sin Ar aa) ds (9.96k) 
T Jo Jo 


or after replacing the product of the two sines with one half of the difference of two cosines ((2.122), 
p. 83) and using formula (21.27), in Table 21.8.2, p. 1100, it follows that 


u(x,t) = i: fo le ( “Ὁ ) σφ m J ds. (9.961) 


2. Riemann Method for Solving Cauchy's Problem for the Hyperbolic 
Differential Equation 
Oru ðu ðu 


andy ur + z +cu=F (9.97a) 








1. Riemann Function isa function v(z, y;€,7), where € and rj are considered as parameters, sat- 
isfying the homogeneous equation 
@v alw)  O(bv 
Ὃν BU. OU Ἔσυξθ (9.97b) 
OxdOy Ox Oy 


which is the adjoint of (9.97a) and the conditions 


v(x, n; £, n) = exp j vena) , v(6,yi£,m) = exp (/ a(£, s) a) : (9.97c) 
£ 


η 
In general, linear second-order differential equations and their adjoint differential equations have the 
form 


aN 
. (aiv) O(bv) | .- 
> ag on + ; ho te f (9.97d) and Y nd 2 On, Fcv —0. (9.97e) 








2. Riemann Formula is the integral formula which is used to 
determine function u(€, 7) satisfying the given differential equation 
(9.97a) and taking the previously given values along the previously 
given curve I’ (Fig. 9.18) together with its derivative in the direc- 
tion of the curve normal (see 3.6.1.2, 2., p. 244): 


1 1 Ou Ov 
u(n) = σ(αυ)ρ + σ(αυ)ο z [w +3 seg- v.) ‘ai 
Figure 9.18 B 
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E few Je > (s = "5 dy um Fu dx dy. (9.97£) 


The smooth curve I" (Fig. 9.18) must not have tangents parallel to the coordinate axes, i.e., the curve 
must not be tangent to the characteristics. The line integral in this formula can be calculated, since the 
values of both partial derivatives can be determined from the function values and from its derivatives 
in a non-tangential direction along the curve arc. 


: xs ' Qu 
In the Cauchy problem, the values of the partial derivatives of the unknown function, e.g., δη are often 
y 


given instead of the normal derivative along the curve. Then another form of the Riemann formula is 
used: 


Qv Ou 
u(£,n) = (uv)p Í (w a) dx (ων | "a dy + "i Fu da dy. (9.97g) 
2 PMQ 








BW Telegraph Equation (Telegrapher's Equation) isa linear second-order partial differential equation 
of hyperbolic type 

Pu Ou Pu 

a + 2b— + cu = — 9.98a 

ot? ot T Ox? ( ) 
where a > 0, b, and c are constants. The equation describes the current flow in wires. It is a general- 
ization of the differential equation of a vibrating string. 
Replacing the unknown function u(x,t) by u = z exp(—(b/a)t), so (9.98a) is reduced to the form 


9 2. Oa 
oe ee tn (ve = τ n= E e) ] (9.98b) 





Oe Ox? a a2 


Replacing the independent variables by 


E= 2 (mt +2), n= 
m 


n 
—(mt — x) (9.98c) 
m 

finally one gets the canonical form 


O?z z 
---ο 9.984 
ὄέδη 4 (9.98) 
of a hyperbolic type linear partial differential equation (see 9.2.2.1, 1., p. 577). The Riemann function 
v(£, rj; £o, no) should satisfy this equation with unit value at £ — £o and rj — rg. Choosing the form 





w — (£ — &)(n — no) (9.98e) 
for w in v = f(w), then f(w) is a solution of the differential equation 
df df 1 


uoc lE e im 0 (9.98f) 


with initial condition f(0) = 1. The substitution w = a? reduces this differential equation to Bessel’s 
differential equation of order zero (see 9.1.2.6, 2., p. 562) 


Φ} id , (9.980) 


do? ada 
hence the solution is 


v- [E - $0 - m). (9.981) 
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A solution of the original differential equation (9.98a) satisfying the boundary conditions 


Oz 
2 = f(x), Ot 


t=0 


= g(x) (9.981) 


1-0 
can be obtained substituting the found value of v into the Riemann formula and then returning to the 
original variables: 





z(a t) = Hre — mt) + f(x + mt)] 


qe Io (= mt? — (s — 2) ntl, (= mt? — (s — =?) 
+5 g(s)—™ f(s) 
2 m mt? — (s — «)? | 





ds. (9.981) 


a—mt 


3. Green's Method of Solving the Boundary Value Problem for Elliptic 
Differential Equations with Two Independent Variables 
This method is very similar to the Riemann method of solving the Cauchy problem for hyperbolic dif- 
ferential equations. 
If one wants to find a function u(x, y) satisfying the elliptic type of linear second-order partial differen- 
tial equation 
Ou Ou Ou ðu 
uoa bd a. b cus 9.99a 
Or? Oy? Ox Oy f ( ) 
in a given domain and taking the prescribed values on its boundary, first the Green function G(x, y, €, n) 
has to be determined for this domain, where £ and r are regarded as parameters. The Green function 
must satisfy the following conditions: 


1. The function G(r, y; £, 7) satisfies the homogeneous adjoint differential equation 
2G , PG AaG) AbG) 
Ox? * Ay? Ox dy — 
everywhere in the given domain except at the point x = £, y — 7. 
2. The function G(x, y; £, n) has the form 





cG=0 (9.99b) 


1 
Um=+V (9.996) with r= /(2—€2+4(y—n)?, (9.994) 


where U has unit value at the point x = £, y — r and U and V are continuous functions in the entire 
domain together with their second derivatives. 

3. The function G(z, y; €, n) is equal to zero on the boundary of the given domain. 

The second step is to give the solution of the boundary value problem with the Green function by the 
formula 


z mfe (x, y; £n) ds — m f(x, y) Gr, yi 6,0) dz dy, (9.990) 


where D is the mu deed domain, S is its boundary on which the function is assumed to be known and 


denotes the normal derivative directed toward the interior of D. 


ðn 
Condition 3 depends on the formulation of the problem. For instance, if instead of the function values 
the values of the derivative of the unknown function are given in the direction normal to the boundary 
of the domain, then in 3 the condition 


m (acosa + bcos 8)G = 0 (9.998 
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holds on the boundary. «o and f denote here the angles between the interior normal to the boundary of 
the domain and the coordinate axes. In this case, the solution is given by the formula 


1 [Ou 1 
u(n) = πὶ anc ee ii fG dz dy. (9.99g) 
5 D 


4. Green's Method for the Solution of Boundary Value Problems with Three 
Independent Variables 
The solution of the differential equation 
ðu δι Ou 


BUR ET e re u=f (9.100a 


should take the given values on the boundary of the considered domain. As the first step, one constructs 
again the Green function, but now it depends on three parameters €, 7, and ¢. The adjoint differentia 
equation satisfied by the Green function has the form 
O(aG) bG) (cG) 
Ox Oy Oz 


As in condition 2, the function G(x, y, 2; £, 7, C) has the form 





AG +eG= (9.100b 


U- +V (9.100c) wih r-—4(r—£-c-(y—m?-(z—60y. (9.100d 


The solution of the P is: 


u(&,n,¢) = i [| m n a-z fG dz dy dz. (9.100e 


Both methods, Riemann’s and Green’s, have the common idea first to determine a special solution of the 
differential equation, which can then be used to obtain a solution with arbitrary boundary conditions. 
An essential difference between the Riemann and the Green function is that the first one depends only 
on the form of the left-hand side of the differential equation, while the second one depends also on the 
considered domain. Finding the Green function is, in practice, an extremely difficult problem, even if 
it is known to exist; therefore, Green’s method is used mostly in theoretical research. 





W A: Construction of the Green function for the Dirichlet problem 
of the Laplace differential equation (see 13.5.1, p. 729) 

Au — 0 (9.1012) 
for the case, when the considered domain is a circle (Fig. 9.19). 
The Green function is 


G(x, y3€,7) = ii + In 2 (9.101b) 
F 


RO 
where r = MP, p = OM, 1, = M;P and R is the radius of the 
considered circle (Fig. 9.19). The points M and M; are symmet- 
ric with respect to the circle, i.e., both points are on the same ray 
starting from the center and 

OM -OM, = R?. (9.101c) 
The formula (9.99e) for a solution of Dirichlet’s problem, after sub- 
stituting the normal derivative of the Green function and after cer- 
Figure 9.19 tain calculations, yields the so-called Poisson integral 








1 2n R2 = p? 
2m Jo R* + p? — 2Rpcos(o — v) 





ul, n) = u(y) dy. (9.101d) 
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The notation is the same as above. The known values of u are given on the boundary of the circle by 
u(q). For the coordinates of the point M(£, rj) follows £ — pcos v», r — psin v. 


W B: Construction of the Green function for the Dirichlet problem of the Laplace differential equation 
(see 13.5.1, p. 729) 


Au — 0, (9.102a 
for the case when the considered domain is a sphere with radius R. The Green function now has the 
form 


1 R 
G(x,y,z;£, m, Q) 2  — — (9.102b 

T np! 
with p = VE + n? + Ẹ as the distance of the point (£, n, Ç) from the center, r as the distance between 
the points (x,y,z) and (£, n, Ç), and rı as the distance of the point (x, y, z) from the symmetric poin 


RS E RG 


of (€, 7, ¢) according to (9.101c), i.e., from the point (= =) . In this case, the Poisson integra 
Dp'p p 


has the form (with the same notation as in Bl A (p. 587)) 


u(£,n,C) = wha E ds. (9.102c 


5. Operational ο) 

Operational methods can be used not only to solve ordinary differential equations but also for partia 
differential equations (see 15.1.6, p. 769). They are based on transition from the unknown function 
to its transform (see 15.1, p. 767). In this process the unknown function is regarded as a function o 
only one variable and the transformation is performed with respect to this variable. The remaining 
variables are considered as parameters. The differential equation to determine the transform of the 
unknown function contains one less independent variable than the original equation. In particular, i 
the original equation is a partial differential equation of two independent variables, then one obtains 
an ordinary differential equation for the transform. If the transform of the unknown function can be 
found from the obtained equation, then the original function is obtained either from the formula for 
the inverse function or from the table of transforms. 


6. Approximation Methods 

Inorder to solve practical problems with partial differential equations, different approximation methods 
are used. They can be divided into analytical and numerical methods. 

1. Analytical Methods make possible the determination of approximate analytical expressions for 
the unknown function. 

2. Numerical Methods result in approximate values of the unknown function for certain values of 
the independent variables. Here the following methods (see 19.5, p. 976) are used: 

a) Finite Difference Method, or Lattice-Point Method: The derivatives are replaced by divided dif- 
ferences, so the differential equation including the initial and boundary conditions becomes an algebraic 
equation system. A linear differential equation with linear initial and boundary conditions becomes a 
linear equation system. 

b) Finite Element Method, or briefly FEM (sce 19.5.3, p. 978), for boundary value problems: Here 
a variational problem is assigned to the boundary value problem. The approximation of the unknown 
function is performed by a spline approach, whose coefficients should be chosen to get the best possible 
solution. The domain of the boundary value problem is decomposed into regular sub-domains. The 
coefficients are determined by solving an extreme value problem. 

c) Integral Equation Method (along a Closed Curve) for special boundary problems: The bound- 
ary value problem is formulated as an equivalent integral equation problem along the boundary of the 
domain of the boundary value problem. To do this, one applies the theorems of vector analysis (see 
13.3.3, p. 724, and followings), e.g., Green formulas. The remaining integrals along the closed curve 
are to be determined numerically by a suitable quadrature formula. 
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3. Physical Solutions of differential equations can be given by experimental methods. This is based 
on the fact that various physical phenomena can be described by the same differential equation. To 
solve a given equation, first a model is constructed by which the given problem is simulated and the 
values of the unknown function are obtained directly from this model. Since such models are often 
known and can be constructed by varying the parameters in a wide range, the differential equation can 
also be applied in a wide domain of the variables. 


9.2.3 Some further Partial Differential Equations from Natural 
Sciences and Engineering 


9.2.3.1 Formulation of the Problem and the Boundary Conditions 


1. Problem Formulation 


The modeling and the mathematical treatment of different physical phenomena in classical theoretical 
physics, especially in modeling media considered structureless or continuously changing, such as gases, 
fluids, solids, the fields of classical physics, leads to the introduction of partial differential equations. 
Examples are the wave (see 9.2.3.2, p. 590) and the heat equations (see 9.2.3.3, p. 591). Many problems 
in non-classical theoretical physics are also governed by partial differential equations. An important 
area is quantum mechanics, which is based on the recognition that media and fields are discontinu- 
ous. The most famous relation is the Schroedinger equation. Linear second-order partial differential 
equations occur most frequently and they have special importance in today's natural sciences. 


2. Initial and Boundary Conditions 

The solution of the problems of physics, engineering, and the natural sciences must usually fulfill two 
basic requirements: 

1. The solution must satisfy not only the differential equation, but also certain initial and/or boundary 
conditions. There are problems with only initial condition or only with boundary conditions or with 
both. All the conditions together must determine the unique solution of the differential equation. 


2. The solution must be stable with respect to small changes in the initial and boundary conditions, 
ie., its change should be arbitrarily small if the perturbations of these conditions are small enough. 
Then a correct problem formulation is given. 

One can assume that the mathematical model of the given problem to describe the real situation is 
adequate only in cases when these conditions are fulfilled. 

For instance, the Cauchy problem (see 9.2.1.1, 5., p. 572) is correctly defined with a differential equation 
of hyperbolic type for investigating vibration processes in continuous media. This means that the values 
of the required function, and the values of its derivatives in a non-tangential (mostly in a normal) 
direction are given on an initial manifold, i.e., on a curve or on a surface. 

In the case of differential equations of elliptic type, which occur in investigations of steady state and 
equilibrium problems in continuous media, the formulation of the boundary value problem is correct. If 
the considered domain is unbounded, then the unknown function must satisfy certain given properties 
with unlimited increase of the independent variables. 


3. Inhomogeneous Conditions and Inhomogeneous Differential Equations 

The solution of homogeneous or inhomogeneous linear partial differential equations with inhomoge- 
neous initial or boundary conditions can be reduced to the solution of an equation which differs from 
the original one only by a free term not containing the unknown function, and which has homogeneous 
conditions. It is sufficient to replace the original function by its difference from an arbitrary twice dif- 
ferentiable function satisfying the given inhomogeneous conditions. 

In general, one uses the fact that the solution of a linear inhomogeneous partial differential equation 
with given inhomogeneous initial or boundary conditions is the sum of the solutions of the same dif- 
ferential equation with zero conditions and the solution of the corresponding homogeneous differential 
equation with the given conditions. 
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To reduce the solution of the linear inhomogeneous partial differential equation 


Ou 


oB L[u] — g(x. t) (9.103a) 
with homogeneous initial conditions 
Ou 
EU. = 9.105: 
4 t=0 ' δέκα ( ) 








to the solution of the Cauchy problem for the corresponding homogeneous differential equation, one 
substitutes 
t 








u= ic t; T) dr. (9.103c) 
0 

Here (x,t; T) is the solution of the differential equation 

Ou 

oB Llu] = 0, (9.103d) 
which satisfies the boundary conditions 

δι 
ul =0, =| = g(x, T). (9.103e) 
t=T Ot i-T 

In this equation, x represents symbolically all the n variables x1, x2, ..., x4 Of the n-dimensional prob- 


lem. L[u] denotes a linear differential expression, which may contain the derivative dr but not higher- 
order derivatives with respect to t. 


9.2.3.2. Wave Equation 


The extension of oscillations in a homogeneous media is described by the wave equation 








a 
om — a@Au = Q(z, t), (9.104a) 
whose right-hand side Q(x, t) vanishes when there is no perturbation. The symbol x represents the n 
variables x1, ...,@, of the n-dimensional problem. The Laplace operator A (see also 13.2.6.5, 716,) is 
defined in the following way: 
Pu Ou Ou 
Au = —; ee i 9.104: 
s ðr? + Ox? Tol Ox? ( y 


The solution of the wave equation is the wave function u. The differential equation (9.104a) is of hy- 
perbolic type. 
1. Homogeneous Problem 
The solution of the homogeneous problem with Q(«,t) = 0 and with the initial conditions 
Ou 
u = T = 
φία), 5 


t=0 


= (a) (9.105) 


t=0 





is given for the cases n = 1, 2,3 by the following integrals. 
Case n = 3 (Kirchhoff Formula): 


1 to PN ;02, 
u(21, 22, 22,1) — 15 / ΝΤ do 4 i pu T 88) a. (9.1062) 
(Sat) (Sat) 


where the integration is performed over the spherical surface Sq: given by the equation (ay — £1)? + 
(ag — πο)” + (a3 — 23)? = a?£?. 
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Case n — 2 (Poisson Formula): 


die 1 ff w(a1, a2) dadag 
PEN 2ra B acm 








(o4 — 21) (a3 — x3)? 


,9 / (03, 03) dadas (9.106b) 





242 — (ay — 21)? — (a2 — 22)? 





where the integration is performed along the circle C4, given by the equation (o4 —21)? -- (a9 — z3)? € 
2,2 

at’. 

Case n = 1 (d’Alembert formula): 


(x1 + at) + y(a1 — at) d 3 


u(x,t) = 7 2a pla) da. (9.106c) 


2. Inhomogeneous Problem 


In the case, when Q(z,t) # 0, one has to add to the right-hand sides of (9.106a,b,c) the correcting 
terms: 


Case n = 3 (Retarded Potential): For a domain K given by r < at with 





r = y (&1 — 11)? + (£2 — 12)? + (& — %3)?, the correction term is 


πα H—-— - - a “ξιάξράξα. (9.107a) 





1 d&,d&gd 
Case n = 2: ms JI Θίξι; ἔα, τ) ἀξιάξρώτ ; (9.107b) 





a?(t — 7? — (& — zi)? — (& — x2)? 


where K is a domain of &, 2,7 space defined by the inequalities 0 < 7 < t, (£j — z1)? 4- (£g — 23)? € 


a?(t — r^. 
Case n — 1: on a Q(£&, 7) d&dr, (9.107c) 


where T' is the E 0 <T X< t, |E— z| < alt — T|. a denotes the wave velocity of the perturbation. 


9.2.3.3 Heat Conduction and Diffusion Equation for Homogeneous Media 


1. Three-Dimensional Heat Conduction Equation 
The propagation of heat in a homogeneous medium is described by a linear second-order partial differ- 
ential equation of parabolic type 

ðu 
m a? Au — Q(x, t), (9.1082) 
where A is the three-dimensional Laplace operator defined in three directions of propagation x1, £2, 
13, determined by the position vector r. If the heat flow has neither source nor sink, the right-hand side 
vanishes since Q(x, t) = 0. 
The Cauchy problem can be posed in the following way: It is to determine a bounded solution u(x, t) for 
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t 0, where u[izo — f(x). The requirement of boundedness guarantees the uniqueness of the solution. 
For the homogeneous differential equation with Q(x, t) — 0, one gets the wave function 
+00 +00 +00 


u(x1, x2, 13,0) = mE Lf f fevenes) 


. ( (21 = ay)? + (xo — a2)? + (x3 ES 
exp 





aN dodazd 9.108b) 
dou άαιάαοάαα. (9. D 
In the case of an inhomogeneous differential equation with Q(zr, t) Z 0, one has to add to the right-hand 
side of (9.108b) the following expression: 


(iii. 


oo oo oo 
(x1 — a1)? + (z2 — a)? + (43 — 03)? 
- exp 
Aa?(t — 7) 
The problem of determining u(x,t) for t < 0, if the values u(a,0) are given, cannot be solved in this 
way, since the Cauchy problem is not correctly formulated in this case. 
Since the temperature difference is proportional to the heat, one often introduces u — T'(r, t) (temper- 
ature field) and a? = Dw (heat diffusion constant or thermal conductivity) to get 
OT 
ot 
2. Three-Dimensional Diffusion Equation 
In analogy to the heat equation, the propagation of a concentration C in a homogeneous medium is 
described by the same linear partial differential equation (9.108a) and (9.108d), where Dy is replaced 
by the three-dimensional diffusion coefficient Dc. 'The diffusion equation is: 
oC 
δι DoAC - Qc(r,t). (9.109) 
One gets the solutions by changing the symbols in the wave equations (9.108b) and (9.108c). 


9.2.3.4 Potential Equation 
The linear second-order partial differential equation 

Au = —4np (9.110a) 
is called the potential equation or Poisson differential equation (see 13.5.2, p. 729), which makes the 
determination of the potential u(x) of a scalar field determined by a scalar point function o(x) possible, 
where x has the coordinates ri, ro, r3 and A is the Laplace operator. The solution, the potential 
UM (11, 12, 13) at the point M, is discussed in 13.5.2, p. 729. 
One gets the Laplace differential equation (see 13.5.1, p. 729) for the homogeneous differential equation 
with o z 0: 

Au = 0. (9.110b) 
The differential equations (9.110a) and (9.110b) are of elliptic type. 





) dedos dr. (9.108c) 


— DwAT = Qw (F, t). (9.108d) 


9.2.4 Schroedinger's Equation 
9.2.4.1 Notion of the Schroedinger Equation 


1. Determination and Dependencies 
The solutions of the Schroedinger equation, the wave functions v, describe the properties of a quantum 
mechanical system, i.e., the properties of the states of a particle. The Schroedinger equation is a second- 
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order partial differential equation with the second-order derivatives of the wave function with respect 
to the space coordinates and first-order with respect to the time coordinate: 





Ow n? mM - 

ihar = ama” + U (21, %2,43,t)b = Hw (9.11 1a) 
22 

É p - .. ῃ h 

H=- tU, = Tn = 7V. 111» 
2m MI P ior Y (9 ) 


h 
Here, A is the Laplace operator, 4 = — is the reduced Planck's constant, i is the imaginary unit and 
π 
V is the nabla operator. The relation between the impulse p of a free particle with mass m and wave 
length \ is \ = h/p. 
2. Remarks: 
a) In quantum mechanics, an operator is assigned to every measurable quantity. The operator occur- 
ring in (9.111a) and (9.111b) is called the Hamilton operator H (“Hamiltonian”) . It has the same role 
as the Hamilton function of classical mechanical systems (see, e.g., the example on Two-Body Problem 
on p. 574). It represents the total energy of the system which is divided into kinetic and potential en- 
ergy. The first term in H isthe operator for the kinetic energy, the second one for the potential energy. 
b) The imaginary unit appears explicitly in the Schroedinger equation. Consequently, the wave func- 
tions are complex functions. Both real functions occurring in 0) + iq?) are needed to calculate the 
observable quantities. The square |W|? of the wave function, describing the probability dw of the par- 
ticle being in an arbitrary volume element dV of the observed domain, must satisfy special further 
conditions. 





c) Besides the potential of the interaction, every special solution depends also on the initial and bound- 
ary conditions of the given problem. In general, there is a linear second-order boundary value problem, 
whose solutions have physical meaning only for the eigenvalues. The squares of the absolute value of 
meaningful solutions are everywhere unique and regular, and tend to zero at infinity. 

d) The micro-particles also have wave and particle properties based on the wave-particle duality, so 
the Schroedinger equation is a wave equation (see 9.2.3.2, p. 590) for the De Broglie matter waves. 

e) The restriction to the non-relativistic case means that the velocity v of the particle is very small 
with respect to the velocity of light c (v «& c). 


The application of the Schroedinger equations is discussed in detail in the literature of theoretical 
physics (see, e.g., [9.15], [9.7], [9.10], [22.15]). In this chapter only some most important examples are 
demonstrated. 

9.2.4.2 Time-Dependent Schroedinger Equation 

The time-dependent Schroedinger equation (9.111a) describes the general non-relativistic case of a 
spin-less particle with mass rn in a position-dependent and time-dependent potential field U (2 , x2, v3, t). 
The special conditions, which must be satisfied by the wave function, are: 

a) The function 7) must be bounded and continuous. 

b) The partial derivatives Ov/Ox4, Ov//Ox», and Ov/Oxs must be continuous. 

c) The function |v|? must be integrable, i.e., 


/ lb (x1, 2, 23, t) dV. « oo. (9.112a) 
V 


According to the normalization condition, the probability that the particle is in the considered domain 
must be equal to one. (9.112a) is sufficient to guarantee the condition, since multiplying by an ap- 
propriate constant the value of the integral becomes one. 

A solution of the time-dependent Schroedinger equation has the form 


ψί(αι, £2, £3, t) = Y (x1, £2, £3) exp (-i£t) ] (9.112b) 
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The state of the particle is described by a periodic function of time with angular frequency w — E/h. I 
the energy of the particle has the fixed value E — const, then the probability dw of finding the particle 
in a space element dV is independent of time: 


dw = ||? dV = vw* dV. (9.112c 
Then one speaks about a stationary state of the particle. 
9.2.4.3 Time-Independent Schroedinger Equation 


If the potential U does not depend on time, i.e., U = U (z1, £2, £3), then it is the time-independen 
Schroedinger equation and the wave function V (44, 19, x3) is sufficient to describe the state. Reducing 
it from the time-dependent Schroedinger equation (9.111a) with the solution (9.112b) gives 


2m 





AY + πρ — U)v — 0. (9.113a. 
l 
In this non-relativistic case, the energy of the particle is 
É h 
E-P. (pol,hz-2mh). (9.113b 
2m λ 


The wave functions V satisfying this differential equation are the eigenfunctions; they exist only for 
certain energy values E, which are given for the considered problem of the special boundary conditions. 
The union of the eigenvalues forms the energy spectrum of the particle. If U is a potential of finite depth 
and it tends to zero at infinity, then the negative eigenvalues form a discrete spectrum. 

If the considered domain is the entire space, then it can be required as a boundary condition that V is 
quadratically integrable in the entire space in the Lebesgue sense (see 12.9.3.2, p. 696 and [8.5]). If the 
domain is finite, e.g., a sphere or a cylinder, then one can require, e.g., V — 0 for the boundary as the 
first boundary condition problem. 

This gives the Helmholtz differential equation in the special case of U (x) — 0: 


2mE 

n 
W = 0 is often required here as a boundary condition. (9.114a) represents the initial mathematical 
equation for acoustic oscillation in a finite domain. 


9.2.4.4 Statistical Interpretation of the Wave Function 


'The quantum mechanics postulates that the complete description of a regarded single-particle system 
in the time t is to be performed by the complex wave function (T, t) as a state function and normalized 
solution of the Schroedinger equation. So, the wave function contains all possible experimental infor- 
mation, which can be got by measurements on this system. There exist no hidden sub-structures of the 
theory and no hidden parameters which could eliminate the principal statistical character of quantum 
mechanics, as it contains the connection of state function and y and measurement results. 


1. Observable and Probability Amplitude 


A physical expression (position, momentum, angular momentum, energy), which can be determined 
by a suitable measuring instrument, is called an observable. In quantum mechanics every observable A 





AV + \W = 0 (9.114a) with the eigenvalue AÀ = (9.114b) 


is represented by a linear, hermitian operator A with At = A, which interacts on the wave function. 
At the same time the operator of the quantum mechanics takes over the structure of the classical ex- 
pression. 


W For the operator 1 of the angular momentum, where r is the position operator p the momentum 
operator: 


"S ANON. xo Uus xu d ὃ ὃ 
l= (ls, ll) =r xp ie. b=- a= t ae) (9.115a) 
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s h ὃ ὃ s h ὃ ὃ 
ly = £p, — dp, — z T , b= &py — Ups = x : 9.115b 
5 pe =T ( δα z) ? Pty ( Oy Voz ( ) 
In general, it is not possible to assign a certain numerical value to an observable by determining the 
wave function, but first only as the result of a measurement. The only possible measurement values A 
are the real eigenvalues a; of A; the associated eigenfunctions οι form a complete orthogonal system: 


Ags = ary (k=1,2,...), fff ρῖριὰν -- δι. (9.116) 
ν 


If the system is in an arbitrarily general state 7), the result of a single experiment, i.e. the occurrence 
of a certain measure value a; in a single measurement can not be predicted. If imaging to perform the 
measurement on N — oo identical systems being in the same state w, then among the measurement 
results every possible result a; can be found with a frequency N;. The probability W; to find the value 
à; in a single measurement can be determined: 


: _ Νι 
W: = jim +> Ee (9.117) 


To determine this probability from the wave function v, one performs an expansion of y as a series of 
eigenfunctions gj: 


b= Vag, a= [ff orwav. (9.118) 
i τ γ᾽ 


The coefficient of expansion c; is the probability to find the system v in its characteristic state q;, i.e., 
to obtain the measuring value a;. From the absolute square of c; one gets the probability W; for the 
measuring value a;: 


2 Wis [ff orvav =1. (9.119) 
J} 


Because in every measurement it is sure to find one of the possible measuring values a;, the sum of the 
probabilities W; fulfills the condition of normalization for the wave function v . 
If two states 4, w2 of a physical system are known, then from the linearity of the Schroedinger equation 
follows, that the superposition 

Y = V4 t V» (9.120) 
also represents a possible physical state. This fundamental superposition principle of quantum mechan- 
ics is the reason why at the determination of probabilities with the state function 7, e.g., 

|? = [br + dol? = [di]? ο 4- 2Βο(ψι 43) (9.121) 
besides the single probabilities |44|? , |v»|? occurs an additional term with sign. This explains the sur- 
prising interference effects of quantum mechanics, e.g. (wave-particle duality). 
2. Expectation Value and Uncertainty 


The quantum mechanical expectation value A is defined as the mean value of the measurement results 
obtained from measurements with N — oo identical systems: 


A= lim ΣΑΝ Σαν” [ff vAvar. (9.122) 


W = 








Ci 





Ν-λοο 


The expectation value usually is not identical to a possible measurement result. 


E The calculation of the expectation value T = (T, Y, Z) of a position measurement for a particle in the 
state v(r, t) , e.g. 


z= ff ald(#,t)/2 dV 
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shows, that the wave function v(r,t) is to be interpreted as a probability amplitude. The absolute 
square |v|? then is a probability density. The expression 


dW -— iw(z, t)? dV , faw = ff oz, tav 21 
V 


is to be understood as a probability, to find the particle at the time t in the volume element dV in the 
position T (probability of the position). 
As a measure of the distribution of the measured results for an observable A, given for a general state in 


some measurements, can be defined by the help of the so-called uncertainty A A near the expectation 
value A, which is to be introduced via the standard error: 


1 = B 
(AA)? = Aimy YN (a; — A)? = Σ Wi (a; — A? . (9.123) 


By the help of the wave function v one can determine the uncertainty A A of a observable as expectation 
value of the quadratic deviation from the mean value A: 


(AAy 2(A— A2 2 Z2 42 = | ff (A Ava. (9.124) 
iif 


If the system is in an eigenstate y; of A, then all measurements give the same measuring value a;: 
A= ai, AA=0. (9.125) 
A distribution near the expectation value A does not appear. 


3. Uncertainty Relation 
Considering two observable A , B , whose operators commutate (see Lie brackets in 5.3.6.4, 2., p. 356), 


6 5 [À, B] - ÀB — BÀ — 0, (9.126) 
then (and only then) exists a simultaneous system of eigenfunctions y;, (i, v — 1,2,...) 
Agiy = Qi Pip Byip = by Yin - (9.127) 


In this case exist physical states, in which the expected values of both operators are eigenvalues, so that 
the uncertainties A A, A B simultaneously disappear: 

A =a, B=b,, AA=AB=0. (9.128) 
Performing in this system a measurement of the observable A, which leads with the measured value a; 
to the state p;x, then a following measurement of B gives the measured value b, , without interfering 
the state generated in the first measurement (compatible observable, tolerance measurement). 

For two observable A and B , which are represented by non-commutative operators there does not exist 
a simultaneous system of eigenfunctions. In this case it is impossible to find a physical state, for which 
the uncertainties A A, A B can be simultaneously arbitrarily small. For the product of the uncertainties 


exists a lower bound, defined by the expectation value of the commutator C: 
di 
AAAB » [; [À. B] s (9.129) 
i 


This relation is called the uncertainty relation. The commutation relation (9.130) (see also 5.3.6.4, 
2., p. 356) between the components, e.g., of the position and the momentum operator into the same 
direction 

ħ 


h 
lb. .a] - - (9.130) ΔριΔαΣ». (9.131) 


2 


leads to the Heisenberg uncertainty relation (9.131). In other words: there is a fundamental limitation 
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on how precisely both the position and the momentum of a particle can be simultaneously known. 
9.2.4.5 Force-Free Motion ofa Particle in a Block 


1. Formulation of the Problem 

A particle with a mass m is moving freely in a block with impenetrable walls of edge lengths a, b, c, 
therefore, it is in a potential box which is infinitely high in all three directions because of the impene- 
trability of the walls. That is, the probability of the presence of the particle, and also the wave function 
W, vanishes outside the box. The Schroedinger equation and the boundary conditions for this problem 
are 


Ov PY Qw 2m 2= 0, 2a, 
E F p Ln EE + =o =0, (9.132a) V —0 for | ur d : E b, (9.132b) 


2. Solution Approach 
Separating the variables 

V(r,y, z) — V(x) V,(y) V.(z) (9.133a) 
and substituting into (9.132a) gives 

1 d^, wl dy, IA 2m 

D, dr? ` D, dy W, dz? h? 
Every term on the left-hand side depends only on one independent variable. Their sum can be a constant 
—B for arbitrary x, y, z only if every single term is a constant. In this case the partial differential 
equation is reduced to three ordinary differential equations: 


Py, dy du, 
Uu p --- δε. (9.133c) 


B. (9.133b) 








, 2 
ky Vo, dz 


The relation for the separation constants —k,”, =k —k,? is 
n2 
Καὶ ει +k? =B, (9.133d) consequently E = Sate Tk? ck). (9.133e) 
m 


3. Solutions 
of the three equations (9.133c) are the functions 

W, = Arsin kg £, Wy = Aysink,y, V, — A;sink,z (9.134a) 
with the constants A;, Ay, Az. With these functions W satisfies the boundary conditions V — 0 for 
x = 0, y = 0 and z = 0. 

sin kz a = sin ky b = sin k; c = 0 (9.134b) 
must be valid to satisfy also the relation V — 0 for x = a, y = b and z = ç, i.e., the relations 

















kf qoe. E (9.134c) 
α b c 
must be satisfied, where ng, ny, and n, are integers. 
One gets for the total energy 
hn? 2 EV n, A2 
Pe (=) $ (0) + (=) ο dl, a). (9.1344) 
vi 2m |N à b c 





It follows from this formula that the changes of energy of a particle by interchange with the neigh- 
borhood is not continuous, which is possible only in quantum systems. The numbers nz, ny, and nz, 
belonging to the eigenvalues of the energy, are called the quantum numbers. 
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After calculating the product of constants AyA,A, from the normalization condition 





abe 
(A, AyAz)” nii sin? .. sin? au sin? -- drdydz —1 (9.134e) 
000 


one gets the complete eigenfunctions of the states characterized by the three quantum numbers 


8 πια TN. TN. 
Danin =A sin sU in E sin Z, (9.134f) 
ee abc a b 6 


The eigenfunctions vanish at the walls since one of the three sine functions is equal to zero. This is 
always the case outside the walls if the following relations are valid 

a 2a (n; — 1)a b 2b (n, — 1)b c 26 (n4 — 1)c 

y , n > ; 


S ahora . inj E 


2 . “a L 3 7 
Ne Ne Ne ny Ny Ny Nz Τὶς Nz 























(9.134g) 


So, there are nz — 1 and n, — 1 and n; — 1 planes perpendicular to the x- or y- or z-axis, in which V 
vanishes. These planes are called the nodal planes. 
4. Special Case of a Cube, Degeneracy 
In the special case of a cube with a = b — c, a particle can be in different states which are described 
by different linearly independent eigenfunctions and they have the same energy. This is the case when 
the sum n}? + ny +n,? has the same value in different states. They are called degenerate states, and 
if there are i states with the same energy, they are called i-fold degeneracy. 
The quantum numbers nz, ny and n; can run through all real numbers, except zero. This last case 
would mean that the wave function is identically zero, i.e., the particle does not exist at any place in 
the box. The particle energy must remain finite, even if the temperature reaches absolute zero. This 
zero-point translational energy for a block is 
P /1 1 1 

y= 2m (2 Tg a : pue) 
9.2.4.6 Particle Movement in a Symmetric Central Field (see 13.1.2.2, p. 702) 
1. Formulation of the Problem 
The considered particle moves in a central symmetric potential V (r). This model reproduces the move- 


ment of an electron in the electrostatic field of a positively charged nucleus. Since this is a spherically 
symmetric problem, it is reasonable to use spherical coordinates (Fig. 9.20). The following relations 





hold: 
r= x£? +y? +22, x=rsinvcosy, 
V = arccos Ξ ; y-—rsinÜsin p, (9.135a) 
p= arctan Ž i z=rcosy, 


where r is the absolute value of the radius vector, 2 is the angle be- 
tween the radius vector and the z-axis (polar angle) and q is the 

! angle between the projection of the radius vector onto the x, y plane 
Figure 9.20 and the x-axis (azimuthal angle). For the Laplace operator 











Ow 20V 1 ow cos) OW 1 ow 
d Or? F rör ` r? ðY τοπ ῦ δὺ T r? sin? 0 Oy?’ (2:133b) 


holds, so the time-independent Schroedinger equation is: 


10 / , Əv 1 ὃ. OV 1 Qv 2m 
i H sin ) E jjv — 0. Hi 
r? Or ( or ) 1? sin 0 00 (sn ) + rsn?0 Og? R? l νο] a (9.1350) 
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2. Solution 
Looking for a solution in the form 

V(r,0,q) — Ri(r)Y7" (0, o), (9.1362) 
where fi; is the radial wave function depending only on r, and Y?" (0, p) is the wave function depending 
on both angles. Substituting (9.136a) in (9.135c) gives 


1 9 (295), m 2m x 
Hr ( n tul V(r) RY; 


1 ð oy” to ayr 
sind — } δι} ZŁ Rip. .1361 
{5 sin Ü OU (sn C ) fü r2sin? 0 Oy? πὶ - 
Dividing by R,Y/” and multiplying by r? gives 
1 d f dR\ 2m? i 1 1 8[. 0Y"|, 1 OY" 
R, dr ( dr ) τ n? E Vi Yi" | sin9 O0 EUR οὐ ] su? Og» (3:1360) 


Equation (9.136c) can be satisfied if the expression on the left-hand side depending only on r and 
expression on the right-hand side depending only on J and q are equal to a constant, i.e., both sides 
being independent of each other are equal to the same constant. From the partial differential equation 
two ordinary differential equations follow. If the constant is chosen equal to I(l + 1), then the so-called 
radial equation results depending only on r and the potential V (r): 





























1 d f ,dRı\ 2m τις 
: + E-V( 0. 9.136d 
Rır? dr ( dr ) h? | (r) 2mr? ( 
To find a solution for the angle-dependent part also in the separated form 
γώ, ϱ) -- Θ(9)Φ() (9.136e 
one substitutes (9.1366) into (9.136c) giving 
1 d dO 1 Ë 
in? i | I - 3 
sin | saa (ων αρ) + I( 4 D} σπα (9.136f 
If the separation constant is chosen as m? in a reasonable way, then the so-called polar equation is 
1 d dO m? 
5111 13 -ἰ(ΐ1-1 = 1966 
Osind dd (sa 2) μετ (9:1368 
and the azimuthal equation is 
PP 
— 4 m?$ — Q0. (9.136h 


dy? 
Both equations are potential-independent, so they are valid for every central symmetric potential. 
There are three requirements for (9.136a): It should tend to zero for r + 00, it should be one-valued 
and quadratically integrable on the surface of the sphere. 
3. Solution of the Radial Equation 
Beside the potential V (r) the radial equation (9.136d) also contains the separation constant /(I 4- 1). 
Substituting 

u(r) =r- Ri(r), (9.137a) 
since the square of the function u(r) gives the last required probability |u;(r)|?dr = |R)(r)|?r2dr of 
the presence of the particle in a spherical shell between r and r + dr. The substitution leads to the 
one-dimensional Schroedinger equation 

d'u(r) 2m I(L4- 1)? 


d Tu E — V(r) 





n u(r) = 0. (9.137b) 
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This one contains the effective potential 


Ver = V(r) + Vi), (9.137c) 
which has two parts. The rotation energy 
(L4 1)? 


Vi(1) 2 Vx(I) = (9.137d) 


2mr? 
is called the centrifugal potential. 
The physical meaning of | as the orbital angular momentum follows from analogy with the classical 
rotation energy 

"ἃ 22 
(OG? l l 
20 20  2mr? 


a rotating particle with moment of inertia @ = pr? and orbital angular momentum [— Od: 
T’ =+, [l= An). (9.13τῃ 


4. Solution of the polar equation 

The polar equation (9.136g), containing both separation constants l(1 -- 1) and m?, is a Legendre differ- 
ential equation (9.60a), p. 565. Its solution is denoted by OP (V), and it can be determined by a power 
series expansion. Finite, single-valued and continuous solutions exist only for I(l + 1) = 0,2,6,12,.... 
One gets for | and m: 





1 
Ero = 39 = (9.137e) 


£=10,1,2,. 25, |m| €t. (9.138a) 
So, m can take the (2/ + 1) values 
-l, (2E 1), (-20-2),..., (0-2), (1-1), L (9.138b) 


For m ¥ 0 one gets the corresponding Legendre polynomials, which are defined in the following way: 
(-1)™ m/2d't™ (cos? J — 1)! 
211! (d cos 9) 


As a special case (l = 0, m = n, cos — z) follow the Legendre polynomials of the first kind (9.60c), 
p. 566. Their normalization results in the equation 


mpgs.) (Ze (-—m)b lan sa sepe 
Or (0) = —3 (xm) Pr" (cos 0) — Nj" Fr" (cos 0). (9.138d) 


5. Solution of the Azimuthal Equation 

Since the motion of the particle in the potential field V (r) is independent of the azimuthal angle even 
in the case of the physical assignment of a space direction, e.g., by a magnetic field, the general solution 
P = aci"? - Be 3" * can be specified by fixing 





Pr" (cos) — (1— cos? 9) (9.138c) 
































mly) = AeH, (9.139a) 
because in this case |, |? is independent of y. The requirement for uniqueness is 

Py (y + 2) = On (vy), (9.139b) 
so m can take on only the values 0,+1,+2,.... 
It follows from the normalization 

2π 

2π 

[ oP dp = i= AP f dy = 2n|AP? (9.139c) 

i Jo 
that 

f.i 
mlp) = em (i= Ope E e): (9.139d) 
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'The quantum number m is called the magnetic quantum number. 


6. Complete Solution for the Dependency of the Angles 
In accordance with (9.1366), the solutions for the polar and the azimuthal equations should be multi- 
plied by each other: 
1 : 
Y" (0,9) = OW) Bly) = Jg; NP FP (cos vem. (9.140a) 


The functions Y” (V, p) are the so-called surface spherical harmonics. 


When the radius vector F is reflected with respect to the origin (F — —r), the angle 9 becomes 7 — 3 
and y becomes y + 7, so the sign of Y;” may change: 


Y/"(x — 9,9 +T) — (1) Y" (9, 9). (9.140b) 
Then for the parity of the considered wave function holds 

P — (-1).. (9.141a) 
7. Parity 


The parity property serves the characterization of the behavior of the wave function under space in- 
version Y — —¥ (see 4.3.5.1, 1., p. 287). It is performed by the inversion or parity operator P: 
Pv(r,t) — v(—r,t). Denoting the eigenvalue of the operator by P, then applying P twice it must 
yield P P v(r,t) — P Pv(r,t) = Y(T, t), the original wave function. So: 

P?=1, = +1. (9.141b) 
It is called an even wave function if its sign does not change under space inversion, and it is called an 
odd wave function if its sign changes. 


9.2.4.7 Linear Harmonic Oscillator 





1. Formulation of the Problem 

Harmonic oscillation occurs when the drag forces in the oscillator satisfy Hooke’s law F = —kax. For 
the frequency of the oscillation, for the frequency of the oscillation circuit and for the potential energy 
the following formulas are valid: 





1 /k k 1 w? 
Ξ ολ] -- 42a =4\/— 142 Epa = ska? = a". 142 
Mese ek (9.142a) w mi (9.142b) pot = pha 7” (9.142c 
Substituting into (9.114a), the Schroedinger equation becomes 
PY 2m ub 2 z 
di + P E- ο πια V — Q0. (9.143a 
With the substitutions 
2E 
y = n] 22, (9.143b) ncc (9.143c 
h hw 
where A is a parameter and not the wavelength, (9.143a) can be transformed into the simpler form o 
the Weber differential equation 
Py 
2? (A - yi - 0. (9.143d 
dy? 


2. Solution 
A solution of the Weber differential equation can be got in the form 


V(y) — e"? H(y). (9.1442) 
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Differentiation shows that 
2 2 
ay _ wp dH 2, dH à 








2 t 1)H|. 1441 
dj απο (y  — 1) (9.144b 
Substitution into (9.143d) yields 
PH E 
—- A-1)H =0. 144 
dd Vy τί ) 0 (9.144c 
The determination of a ο is convenient in the form of a series: 
H= Ma with ----- -Xi iaiy "uu M — Lay’? (9.145α 
i=0 dy dy i-2 
Substitution of (9.145a) into ας results in 
i(i — 1)a ο Qiayy’ + Σια =0. (9.145b 
i-2 
Comparing the coefficients of y? leads to the recursion formula 
(j +2) + 1)aj+2 = [2j — (A — 1)ļa; (j =0,1,2,...). (9.145c 


The coefficients a; for even powers of y begin from ag, the coefficients for odd powers begin from ay. 
So, ao and a; can be chosen arbitrarily. 

3. Physical Solutions 

The determination of the probability of the presence of a particle in the different states can be performed 
by a quadratically integrable wave function W(a) and by an eigenfunction which has physical meaning, 
i.e., normalizable and for ay values of y it tends to zero. 

The exponential function exp(—y?/2) in (9.144a) guarantees that the solution W(y) tends to zero for 
y — oo if the function H (y) is A polynomial. To get a polynomial, the coefficients a; in (9.145a), 


starting from a certain n, must vanish for every 7 > n: dn # 0, Qn41 = àn42 — An43 =... = 0. The 
recursion formula (9.145c) with j = n is 
2n — (4—1 
( ) (9.1462) 


Tee Gaines iy A 
Gn+42 = 0 can be satisfied for a, £ 0 only if 
2E 
2n—(A—1)20, A= =2 1. 9.140] 
n-(-1-20 A-g η (9.1460) 
The coefficients @n42, Gn44,... Vanish for this choice of A. Also a4, 1 — 0 must hold to make the coeffi- 
cients à541, 0543, ... equal to zero. 
One gets the Hermite polynomials from the second defining equation (see 9.1.2.6, 6., p. 568) for the 
special choice of a, — 2", a,  — 0. The first six of them are: 


Ho(y) — 1, Hs(y) = —12y + 8y", 
Hy(y) = 2y, Ha(y) = 12 — 48y? -- 16y*, (9.146c) 
Ha(y) — —2 4 Ay?, Hs(y) = 120y — 160y? + 32y°. 
The solution W(y) for the vibration quantum number n is 
V, = Npe”? Haly), (9.147a) 


where Npn is the normalizing factor. One gets it from the normalization condition / V,?dy = 1 as 


Nios (= with Va=2= J (see (9.143), p. 601). (9.147b) 


2n! 
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From the terminating condition of the series (9.143c) 
1 
E, Πω (n+5) (n 0,1,2,...) (9.147c) 


follows for the eigenvalues of the vibration energy. The spectrum of the energy levels is equidistant. 
'The sum- 

mand +1/2 in the parentheses means that in contrast to the 

classical case the quantum mechanical oscillator has energy V(x)A E(x) 

even in the deepest energetic level with n = 0, which is known W(x) 
as the zero-point vibration energy. 

Fig. 9.21 shows a graphical representation of the equidistant 

spectra of the energy states, the corresponding wave functions 

from V, to V; and also the function of the potential energy 

(9.142c). The points of the parabola of the potential energy 

represent the reversal points of the classical oscillator, which 


= N QOQ HH σι 


1 ; 
are calculated from the energy E — —mu?a? as the amplitude 


fiv 

1 [2E : m ; 

à — —4| —. The quantum mechanical probability of finding a 
m 


ω 
particle in the interval (, x-4-dz) is given by dug, = |Y (x)|? dx. 
It is different from zero also outside of these points. So for, e.g., Figure 9.21 


e 





[A 
n = 1, hence for E — (3/2)hw, according to dw,, = 2 Dg dx, the maximum of the probability of m 
T 


presence is at 


+1 h 
Tmazqu = Te = a (9.147d) 


For a corresponding classical oscillator, this is 


2E 3h. 
Tmaz,kl = a = 27+ I . (9.146) 
πιω πιω 




















For large quantum numbers n the quantum mechanical probability density function approaches the 
classical one in its mean value. 





9.2.5 Non-Linear Partial Differential Equations: Solitons, Periodic 


Patterns and Chaos 
9.2.5.1 Formulation of the Physical-Mathematical Problem 


1. Notion of Solitons 

Solitons, also called solitary waves, from the viewpoint of physics, are pulses, or also localized distur- 
bances of a non-linear medium or field; the energy related to such propagating pulses or disturbances 
is concentrated in a narrow spatial region. They occur: 

e in solids, e.g., in inharmonic lattices, in Josephson contacts, in glass fibres and in quasi-one-dimensional 
conductors, 

e in fluids as surface waves or spin waves, 

e in plasmas as Langmuir solitons, 

e in linear molecules, 

e in classical and quantum field theory. 

Solitons have both particle and wave properties; they are localized during their evolution, and the do- 
main of the localization, or the point around which the wave is localized, travels as a free particle; in 
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particular it can also be at rest. A soliton has a permanent wave structure: based on a balance between 
nonlinearity and dispersion, the form of this structure does not change. 

Mathematically, solitons are special solutions of certain non-linear partial differential equations occur- 
ring in physics, engineering and applied mathematics. Their special features are the absence of any 
dissipation and also that the non-linear terms cannot be handled by perturbation theory. Dissipative 
solitons are in non-conservative systems. 

2. Important Examples of Equations with Soliton Solutions 





a) Korteweg de Vries (KdV) Equation Ut + Uug + Ugre = 0. (9.148) 
b) Non-Linear Schroedinger (NLS) Equation iu + Uzr + 2|u|?u = 0, (9.149) 
c) Sine-Gordon (SG) Equation Ut — Ure + Sinu = 0. (9.150) 


The subscripts x and t denote partial derivatives, ¢.g., Urx = 0?u/Ox?. 

In these equations the one-dimensional case is considered, i.e., u has the form u = u(x,t), where x is the 
spatial coordinate and t is the time. The equations are given in a scaled form, i.e., the two independent 
variables x and t are here dimensionless quantities. In practical applications, they must be multiplied 
by quantities having the corresponding dimensions and being characteristic of the given problem. The 
same holds for the velocity. 

3. Interaction between Solitons 

If two solitons, moving with different velocities, collide, they appear again after the interaction as if 
they had not collided. Every soliton asymptotically keeps its form and velocity; there is only a phase 
shift. Two solitons can interact without disturbing each other asymptotically. This is called an elastic 
interaction which is equivalent to the existence of an N-soliton solution, where N (N = 1,2,3,...) is 
the number of solitons. Solving an initial value problem with a given initial pulse u(x, 0) that disag- 
gregate into solitons, the number of solitons does not depend on the shape of the pulse but on its total 
amount [T° u(a, 0) dx. 

4. Periodic Patterns and Non-Linear Waves 

Such non-linear phenomena occur in several classic dissipative systems (i.e. friction or damping sys- 
tems), when an external impact or force is sufficiently large. E.g., if there is a layer of fluid in the 
gravitational field, and it is heated from below, the difference of temperature between the upper and 
lower surface corresponds to an external force. The higher temperature of the lower layer reduces its 
density and makes it lighter than the upper part, so the layering becomes unstable. At a sufficiently 
large temperature difference this unstable layering turns spontaneously into periodically arranged con- 
vection cells. It is called the bifurcation from the state of thermal conductivity (without convection) 
into the well ordered Rayleigh-Bénard convection. Taking away the external force results because of 
dissipation into damping of the waves (here the cellular convection). Strengthening of the external 
force drives the ordered convection into a turbulent convection and into chaos (see 17.3, p. 892). Also 
in chemical reactions similar phenomena can occur. Important examples for equations describing such 
phenomena are: 


a) Ginsburg-Landau (GL) Equation u — u — (1 + ib)uss + (1 + ic)|ul?u = 0, (9.151) 
b) Kuramoto-Sivashinsky (KS) Equation t + Urs + Urrez + U2 = 0. (9.152) 


In contrast to the dissipation-less KdV, NLS, SG, equations, the equations (9.151) and (9.152) are non- 
linear dissipative equations, which have, besides spatiotemporal periodic solutions, also spatiotemporal 
disordered (chaotic) solutions. Appearance of spatiotemporal patterns or structures is characteristic 
which turn into chaos. 

5. Dissipative Solitons 

Solitary (isolated) wave phenomena in non-conservative systems often are called dissipative solitons. 
Contrary to the conservative systems, in which the solitons usually form families of solutions with at 
least one continuously changing parameter, dissipative solitons can be found at single points of the 
parameter space, at which a balance is formed between dispersion and nonlinearity from one side and 
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energy or particle flow and dissipation from the other side. This property leads to a special kind of sta- 
bility of dissipative solitons, although they are not solutions of integrable wave equations. Dissipative 
solitons are described among others by the complex Ginsburg-Landau equation. They occur ,e.g., in 
non-linear optic cavitations, in optic semiconductor amplifiers and in reaction-diffusion systems (see 
also [9.16]). 

6. Non-Linear Evolution Equations 

Evolution equations describe the evolution of a physical quantity in time. Examples are the wave equa- 
tion (see 9.2.3.2, p. 590), the heat equation (see 9.2.3.3, p. 591) and the Schroedinger equation (see 
9.2.4.1, 1., p. 592). The solutions of the evolution equations are called evolution functions. 

In contrast to linear evolution equations, the non-linear evolution equations (9.148), (9.149), and (9.150) 
contain non-linear terms like ιϑιι/ϑα, |u|?u, sinu and u?. These equations are (with the exception of 
(9.151)) parameter-free. From the viewpoint of physics non-linear evolution equations describe struc- 
ture formations like solitons (dispersive structures) as well as periodic patterns and non-linear waves 
(dissipative structures). 


9.2.5.2 Korteweg de Vries Equation 
(KdV) 


1. Occurrence 

The KdV equation is used in the discussion of 

e surface waves in shallow water, 

e inharmonic vibrations in non-linear lattices, 

e problems of plasma physics and 

e non-linear electric networks. 

2. Equation and Solutions 

The KdV equation for the evolution function u is 





Ut + 6utlz + User = 0. (9.153) 
It has the soliton solution 
u(x,t) = - (9.154) 


2 cosh? [Eva — vt — e) | 


Figure 9.22 

This KdV soliton is uniquely defined by the two dimensionless parameters v (v > 0) and y. In Fig. 
9.22 v = 1 is chosen. A typical non-linear effect is that the velocity of the soliton v determines the 
amplitude and the width of the soliton: KdV solitons with larger amplitude and smaller width move 
faster than those with smaller amplitude and larger width (taller waves travel faster than shorter ones). 
The soliton phase y describes the position of the maximum of the soliton at time t = 0. 


Equation (9.153) also has N-solitons solutions. Such an N-soliton solution can be represented asymp- 
totically for t — -Eoo by the linear superposition of one-soliton solutions: 
N 


u(x,t) ~ X unl, t). (9.155) 


n=1 





Here every evolution function un(x,t) is characterized by a velocity vn and a phase qz. The initial 
phases y} before the interaction or collision differ from the final phases after the collision v7 , while the 
velocities v1, v3, ..., vw have no changes, i.e., it is an elastic interaction. 

For N — 2, (9.153) has a two-soliton solution. It cannot be represented for a finite time by a linear 


1 1 
superposition, and with kn = 2 Vn and an = 5 Un(T — Vnt — Pn) (n = 1,2) it has the form: 
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1 
kera + k2e% -- (Κι -- Κο) ε(αι 1») |2 + hab (kie + 29] 
u(x,t) =8 ; Ξ 5 : (9.156) 
1 ρα 50 p(a1+a2) 
+e% +e + (P22) gun 7 





Equation (9.156) describes two non-interacting solitons for t + —oo asymptotically with velocities 
vy, = 4k? and vp = 4k3, which transform after their mutual interaction again into two non-interacting 
solitons with the same velocities for t + -+-oo asymptotically. 

The non-linear evolution equation 





w;, + 6(wz)? + Were = 0 (9.157a) 
Fr . . 
where w — T has the following properties: 
1 
a) For F(x#,t)=1+e*%, a= 3 V via — vt — q) (9.157b) 
it has a soliton solution and 
kı — k2\” (otan) 
b)for F(z,t) 2 1--e?* - e?? 4- guttam (9.157c) 
kı + k2 


it has a two-soliton solution. With 2w, = u the KdV equation (9.153) follows from (9.157a). Equation 
(9.156) and the expression w following from (9.157c) are examples of a non-linear superposition. 

If the term +6uu, is replaced by —6uu, in (9.153), then the right-hand side of (9.154) has to be multi- 
plied by (—1). In this case the notation antisoliton is used. 


9.2.5.3 Non-Linear Schroedinger Equation (NLS) 

1. Occurrence 

The NLS equation occurs 

e in non-linear optics, where the refractive index n depends on the electric field strength E, as, e.g., for 
the Kerr effect, where n(E) = no + ng|E|? with no, n2 = constant holds, and 

e in the hydrodynamics of self-gravitating discs which allow us to describe galactic spiral arms. 

2. Equation and Solution 

The NLS equation for the evolution function wu and its solution are: 


exp (—if2ge + 4(€ = 9? )t = x1) 


iu, 4- ugs E 2|u[^u = 15 x,t) =2 
itu [uu — 0, (9.158) u(x,t) — 2n ρα αι ø) 








(9.159) 


Here u(x, t) is complex. The NLS soliton is characterized by the four dimensionless parameters n, £, p, 
and x. The envelope of the wave packet moves 

with the velocity v = —4€; the phase velocity Re u(x,t) 

of the wave packet is 2(1? — €?)/é. 1 

In contrast to the KdV soliton (9.154), the 
amplitude and the velocity can be chosen in- 
dependently of each other. Fig. 9.23 displays 
the real part of (9.159) with y = 1/2 and 
£ —4. 

The solutions of the form (9.159) often are 
called light solitons; they solve the focusing 
NLS equation (9.158) for the case ”+”. The 
defocusing NLS equation (case ”—”) gives 
solitons, for which |u|? at the position of the Figure 9.23 








9.2 Partial Differential Equations 607 








solitons is reduced in comparison to a constant background |u(x — +00)| = n. Such dark solitons have 
the form 


.υ. | v? | v? af 
u(x,t) = (5 +4? — T tanh | η -- τα - w|) “exp [-i (2n t+ x) ; (9.160) 


They depend on the three parameters 7, v and x and propagate with the velocity v < 27 on a back- 
ground with trivial (flat) phase (see [9.26], [9.23]). 

A general solution has in addition a phase gradient, which can be interpreted as velocity c of the back- 
ground, relative to which the soliton is moving. Then the solution has the following form: 


) 2 2 5 2 
u(x,t) = ( +4)? — T | tanh | m A (a — ut «| exp | (sri Ex σα τη) : 


(9.161) 


Beside of these exponential positioned soliton waves also periodic solutions of the NLS equation exist, 
which can be interpreted as wave packets of solitons. Such solutions can be find demanding stationarity 
and by integration of the remaining ordinary differential equation. Generally such solutions are elliptic 
Jacobian-functions (see 14.6.2, p. 763). Some relevant solutions see [9.17]. 

In the case of N interacting solitons, one can characterize them by 4N arbitrary chosen parameters: 
Ώπ, ἔπ, ῥπ,Χπ (πι 1,2,..., N). 

If the solitons have different velocities, the N-soliton solution splits asymptotically for t —^ +00 into a 
sum of N individual solitons of the form (9.159). 


9.2.5.4 Sine-Gordon Equation (SG) 

1. Occurrence 

The SG equation is obtained from the Bloch equation for spatially inhomogeneous quantum mechani- 
cal two-level systems. It describes the propagation of 

e ultra-short pulses in resonant laser media (self-induced transparency), 

e the magnetic flux in large surface Josephson contacts, i.e., in tunnel contacts between two supercon- 
ductors and 

e spin waves in superfluid helium 3 (*He). 

The soliton solution of the SG equation can be illustrated by a mechanical model of pendula and springs. 
The evolution function goes continuously from 0 to a constant value c. The SG solitons are often called 
kink solitons. If the evolution function changes from the constant value c to 0, it describes a so-called 
antikink soliton. Walls of domain structures can be described with this type of solutions. 











2. Equation and Solution 


The SG equation for the evolution function u is 6 ae 2π 
Ug — ας + sinu = 0. (9.162) 

It has the following soliton solutions: 4 

1. Kink Soliton 
u(x,t) = 4arctan e7070), (9.163) " 2 

where y — AT and —1 « v < +1. 

The kink soliton (9.163) for v = 1/2 is givenin Fig.9.24. — -10 -5 0 5 10 

The kink soliton is determined by two dimensionless pa- X-X,-vt 


rameters v and zo. The velocity is independent of the am- 
plitude. The time and the position derivatives are ordi- Figure 9.24 
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nary localized solitons: 


i 2 
ο τι μαμα μμ (9.164) 
υ cosh y(x — zo — vt) 
2. Antikink Soliton 
u(x,t) = 4arctan e7707), (9.165) 


3. Kink-Antikink Soliton One gets a static kink-antikink soliton from (9.163, 9.165) with v = 0: 


u(r,t) — Aarctan et 79, (9.166) 
Further solutions of (9.162) are: 
4. Kink-Kink Collision 

















ο] I T 
u(x,t) = Aarctan | EUN | (9.167) 
cosh yut 
5. Kink-Antikink Collision 
1 sinh vt 
u(x,t) = 4arctan E ult | (9.168) 
v cosh yg 
6. Double or Breather Soliton, also called Kink-Antikink Doublet 
1-4? sin wt 
x,t) — 4arctan | ———— — ————7 |. 9.169 
SE — | w  coshy1-— z ( ) 


Equation (9.169) represents a stationary wave, whose envelope is modulated by the frequency w. 
7. Local Periodic Kink Lattice 





u(x,t) — 2 arcsin [em ( a x)| Tm. (9.1702) 
k/1— v2 
The relation between the wavelength A and the lattice constant k is 
A — AK(k)kv/1 — v?. (9.170b) 
For k — 1, i.e., for A — oo, one gets 
u(x,t) — Aarctan e*? 790, (9.170c) 


which is the kink soliton (9.163) and the antikink soliton (9.165) again, with ao = 0. 


Remark: snz is a Jacobian elliptic function with parameter k and quarter-period A (see 14.6.2, 
p. 763): 


snz = sin p(x, k), (9.171a) 
sin g(z,k) 7/2 
d 
r= ————À (971b) K(k) = I a faire) 
i (1 — q?)(1 — kq?) 1 ΝῚ-- Κ25η7Θ 


Equation (9.171b) comes from (14.104a), p. 763, by the substitution of sin  — q. The series expansion 
of the complete elliptic integral is given as equation (8.104), in 8.2.5, 7., p. 515. 


9.2.5.5 Further Non-linear Evolution Equations with Soliton Solutions 
1. Modified KdV Equation 


2 
Ut E 6U Ug + Ugre = 0. 





(9.172) 


The even more general equation (9.173) has the soliton (9.174) as solution. 
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jdeos00«2 


Uu dr uPus + User =0, (9.173) u(x,t) = ye (9.174) 
:osh2 S 
cosh (sevivi@ — vt — al] 
2. Sinh-Gordon Equation 
Ug — ugs + sinh u = 0. (9.175) 


3. Boussinesq Equation 

Uze = Ug }- (υᾶ)εᾳ + Urene = 0. (9.176) 
This equation occurs in the description of non-linear electric networks as a continuous approximation 
of the charge-voltage relation. 
4. Hirota Equation 

αι 4 i3a|u[?u, -- Bus, +ioUzer + 5]ul?u = 0, aß = có. (9.177) 








5. Burgers Equation 

Ut — Ugg + Uug = 0. (9.178) 
This equation occurs when modeling turbulence. With the Hopf-Cole transformation it is transformed 
into the diffusion equation, i.e., into a linear differential equation. 
6. Kadomzev-Pedviashwili Equation 
The equation 

(us -- Gut + Uzsr)e = Uyy (9.179a) 
has the soliton 


o? 1 i 2,2 
u(x,y,t) — 258 In lz + lx + iky — 3k | | (9.179b) 
as its solution. The equation (9.179a) is an example of a soliton equation with a higher number of in- 
dependent variables, e.g., with two spatial variables. 


Remark: The CD-ROM to the 7th, 8th and 9th German editions of this handbook (see [22.8]) con- 
tains more non-linear evolution equations. Furthermore there is shown the application of the Fourier 
transformation and of the inverse scattering theory to solve linear partial differential equations. 





10 Calculusof Variations 
10.1 Defining the Problem 


1. Extremum of an Integral Expression 

A very important problem of the differential calculus is to determine for which x values the given func- 
tion y(r) has extreme values. The calculus of variations discusses the following problem: For which 
functions has a certain integral, whose integrand depends also on the unknown function and its deriva- 
tives, an extremum value? The calculus of variations concerns itself with determining all the functions 
y(x) for which the integral expression 


Ify] = / Εἴα, γ(α), γ'(α),..., γ("(α))άν (10.1) 


has an extremum, if the functions y(x) are from a previously given class of functions. Here, it ist possible 
to define some boundary and side conditions for y(x) and for its derivatives. 

2. Integral Expressions of Variational Calculus 

There can also be several variables instead of x in (10.1). In this case, the occurring derivatives are 
partial derivatives and the integral in (10.1) is a multiple integral. In the calculus of variations, mainly 
the following types of integral expressions are discussed: 


Ily] = ΠΠ; (0.2 


b 
Ily, Y2,- -, Yn] = [Fon sync). i (x), un (x)) da, (10.3 
b 
Ify] = [Ee (a), y (x), . .. y (a) dz, (10.4 


~= 


ul = f| Fle, y, ustes ty) dx dy. (10.5 
Q 


Here the unknown function is u — u(x, y), and € represents a plane domain of integration. 


Πα] -- If F(2,Y, 2, U, Ug, uy, uz) dz dy dz. (10.6 
Je 











The unknown function is u = u(x, y, z), and R represents a space region of integration. Additionally, 
boundary values can be given for the solution of a variational problem, at the endpoints of the interva. 
a and b in the one-dimensional case, and at the boundary of the domain of integration Q in the two- 
dimensional case. Besides, various further side conditions can be defined, e.g., in integral form or as a 
differential equation. 

A variational problem is called first-order or higher-order depending whether the integrand F contains 
only the first derivative y' or higher derivatives y™ (n > 1) of the function y. 

3. Parametric Representation of the Variational Problem 

A variational problem can also be posed in parametric form. Considering a curve in parametric form 
x = x(t), y = y(t) (a <t < 8), then, e.g., the integral expression (10.2) has the form 





B 
Ile y] — [| FG. (0,80. 30) dt. (10.7) 
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10.2 Historical Problems 


10.2.1 Isoperimetric Problem 
The general isoperimetric problem is to determine the plane region with the largest area among the 
plane regions with a given perimeter. The solution of this problem, a circle with a given perimeter, 
originates from queen Dido, who was allowed, as legend has it, to take such an area for the foundation 
of Carthago which she could be surround by one bull's leather. She cut the leather into fine stripes, 
and formed a circle with them. 

A special case of the isoperimetric problem is to find the equation of 

the curve y = y(x) in a Cartesian coordinate system connecting the y y(x) 
points A(a, 0) and B(b, 0) and having the given length l, for which 
the area determined by the line segment AB and the curve is the 
largest possible (Fig. 10.1). For the mathematical formalization a 
once continuously differentiable function y(x) is to be determined, 
such that 


0A(a0) B(bO) x 


Figure 10.1 


b 
Ify] = Jro dx = max (10.8a) 
holds, where the side condition (10.8b) and the boundary conditions (10.8c) are satisfied: 


Gly] = / 14 y?(x)dr —l (10.8b) y(a) = y(b) =0 (10.8c) 


10.2.2 Brachistochrone Problem 


The brachistochrone problem was formulated in 1696 by J. Bernoulli, and it is the following: A point 
mass descends from the point P (0, yo) to the origin in the vertical plane x, y only under the influence 
of gravity. The curve y = y(x) along which the point reaches the origin in the shortest possible time 
from P (Fig. 10.2) is to be determined. With the formula for the time of fall T', (see 11.5.1, p. 648), the 
following mathematical description is possible to determine a once continuously differentiable function 
y = y(x) for which 


πρ Y 

/l 4,72 

Tly| J N dr — min (10.9) Yo 
0 


v 29(yo — v) 
holds, (g is the acceleration due to gravity). The boundary value 
conditions are 
y(0)=0, ^ y(ro) = yo- (10.10) 0 
For x = xo in (10.9) there is a singularity. Figure 10.2 





10.8 Variational Problems of One Variable 


10.3.1 Simple Variational Problems and Extremal Curves 


A simple variational problem is to determine the extreme value of the integral expression given in the 
form 


Ty] = [Fe y(x), y (x)) dv, (10.11) 





612 10. Calculus of Variations 





where y(x) is a twice continuously differentiable function satisfying the boundary conditions y(a) = A 
and y(b) = B. The values a, b and A, B, and the function F are given. 

The integral expression (10.11) is an example of a so-called functional. A functional assigns a real 
number to every function y(x) from a certain class of functions. 

If the functional I [y] in (10.11) takes, e.g., its relative maximum for a function yo(x), then 


I[yo] 7 I[u] (10.12) 


for every twice continuously differentiable function y satisfying the boundary conditions. The curve 
y — yo(x) is called an extremal curve. Sometimes all the solutions of the Euler differential equation of 
the variational calculus are called extremal curves. 


10.3.2 Euler Differential Equation of the Variational Calculus 


A necessary condition for the solution of the variational problem can be constructed by the help of an 
auxiliary curve or comparable curve for the extremal yo(a) characterized by (10.12) 


y(@) = yo(x) + n(x) (10.13) 
with a twice continuously differentiable function 7(2) satisfying the special boundary conditions 7(a) = 
n(b) = 0; € is a real parameter. Substituting (10.13) into (10.11) there is a function depending on € 
instead of the functional I [y] 


b 
1{ε) -- [ Fe, Yo €n, yo t em) da. (10.14) 


The functional 7 [y] has an extreme value for yo(zx) if the function I (c), as a function of e, has an extreme 

value for c — 0. Deducing the variational problem to an extreme value problem with the necessary 

condition 

2 =0 for «=0 (10.15) 
de 

and supposing that the function F, as a function of three independent variables, is differentiable as 

many times as needed, by the help of the Taylor expansion (see 7.3.3.3, p. 471) follows 


b 
OF OF 
Ie) — / [βία + gy Ewon t gg o en O(®)| dx. (10.16) 
The necessary condition (10.15) results in the equation 
b b 
OF OF 
[1ο dx + he, dx = 0. (10.17) 


Partial integration of this equation and considering the boundary conditions for n(x), gives 


b 
oF d (dF 
p (= E (=) dx =0. (10.18) 


From the assumption of continuity and because the integral in (10.18) must vanish for any considerable 


n(x), 
OF d (OF 
Ee 10.1 

Oy dx (5) η (10.19) 


must hold. The equation (10.19) gives a necessary condition for the simple variational problem and it 
is called the Euler differential equation of the calculus of variations. The differential equation (10.19) 
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can be written in the form 


OF OF OF , OF, 


Oy  OzxOy 8y8y'" Τ ay?” 





=0. (10.20) 


It is an ordinary second-order differential equation if Fyy 4 0 holds. 
The Euler differential equation has a simpler form in the following special cases: 
Case 1: F(x, y,y') = F(y'), i.e., x and y do not appear explicitly. Then instead of (10.19) holds 
OF d (OF 
a 0 (10.21a) and a (5) -0. (10.219) 
Case 2: F(r,y, y) — F(y, y), i.e., x does not appear explicitly. From 


d Ῥ F F F 1 (8F F d (OF 
(r je j- y+? y" y? y (5 ) y (5 (5 )) (10.22a) 








απ Oy! Oy Oy! Oy  " dr NOy ὃν dr VOy 
and because of (10.19) follows 


2 (r — vÆ) =0, (10.22b) ie, Ε- y; —c (c const) (10.22c) 


as a necessary condition for the solution of the simple variational problem in the case F = F (y, y’). 
E A: The functional to determine the shortest curve connecting the points P, (a, A) and P»(b, B) in 
the x, y plane is: 


b 
Ify] = / V 1-4 y? dx — min. (10.23a) 


It follows from (10.21b) for F — F(y) 2 V1-- y? that 


= (55) - εδ (10.280) 
dx \ Oy (vix y?) 


so y" — 0, i.e., the shortest curve is the straight line. 
E B: Connecting the points P; (a, A) and P(b, B) by a curve y(x), and rotating it around the z-axis 
the surface area is 


b 
Jy] = 2r f yy1+y?dz. (10.24a) 
Which curve y(x) creates the smallest surface area? From (10.22c) with F = F(y,y’) = 2ry/1T + y? 
l ND ; 
follows y = = 1 +y? or y? = (5) — 1, where c = z . This differential equation is separable 
T e π 


(see 9.2.2.3, 1., p. 579), and its solution is 


y = & cosh (= + ca) (c1, c9 const), (10.24b) 
e 


the equation of the so-called catenary curve (see 2.15.1, p. 107). The constants c4 and c» can be de- 
termined from the boundary values y(a) — A and y(b) — B. So, it is to solve a system of non-linear 
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equations (see 19.2.2, p. 961), which cannot be solved for every boundary value. 


10.3.3 Variational Problems with Side Conditions 


These problems are usually isoperimetric problems (see 10.2.1, p. 611): The simple variational problem 
(see 10.2.1, p. 611), given by the functional (10.11), is completed by a further side condition in the form 


[Glo.yla),y(@)) de =I! (I const) (10.25) 


where the constant / and the function G are given. A method to solve this problem is given by Lagrange 
(extreme values with side conditions in equation form, see 6.2.5.6, p. 456). Considering the expression 


A(x, y(x),y'(2),A) = F(x, y(2), y'(a)) + AG(a, y(a), y'(a)) , (10.26) 


where is a parameter, and solving the problem 





b 
[Βία y(x), y'(x), A)dx = extreme!, (10.27) 
as an extreme value problem without side condition. The corresponding Euler differential equation is: 
OH d (OH 
----- zs 10.2 
ὃν dx E ) 5 ΠΡ 


The solution y = y(x, A) depends on the parameter À, which must be determined by substituting y(x, A) 
into the side condition (10.25). 
W For the isoperimetric problem 10.2.1, p. 611, one gets 


H (v, y(x), y (x), ) 2 y - M1 y?. (10.294) 


Because the variable x does not appear in H, instead of the Euler differential equation (10.28), analo- 
gously to (10.22c), one gets the differential equation 


IZ SEI e c a2 
ytAVvl+y?— LS = cq or y= {ΕΣ (ει const), (10.295) 
/ 1# y? €1— 
whose solution is a family of circles 
(x — cea)? +(y- ay? =)? (c1, c2, À const). (10.29c) 
The values ci, c? and A are determined from the conditions y(a) — 0, y(b) — 0 and from the requirement 


that the arclength between A and B should be l. The result is a non-linear equation for A, which should 
be solved by an appropriate iterative method. 


10.3.4 Variational Problems with Higher-Order Derivatives 
There are two types of problems to be considered here. 
1l. F= F(z,y,y',y”) 


The variational problem is: 
b 
Ify] = [Ee y, y. uy") dx = extreme! (0.304) 


with the boundary values 


y(a) —- A, w(b)-B, w(a)- A, 0) -- Β' (10.30b) 
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where the numbers a, b, A, B, A', and B', and the function F are given. Similarly as in 10.3.2, p. 612, 
introducing comparable curves y(x) = yo(x) 4- en(z) with n(a) — n(b) — n'(a) — (b) = 0, yields the 
Euler differential equation 


OF d (OF @ (OF 
-- pea p = 10.31 
ὃν dx (5) T da? (25) n (10:31) 


as a necessary condition for the solution of the variational problem (10.30a). The differential equation 
(10.31) represents a fourth-order differential equation. Its general solution contains four arbitrary con- 
stants which can be determined by the boundary values (10.30b). 


W Consider the problem 





1 
Ify] = [ (y? — ay? — fy?) dx — extreme! (10.32a) 
Jo 


with the given constants a and 8 for F = F(y,y',y”) = y'? — ay? — By?. Then: Fy = —28y, 


1 d? 
Εν ---2αγ', Εμ» — 2y", ; (Fy) = -2ay", dii (Fy) = 2y, and the Euler differential equation is 
dx x 








yO + ay” — By = 0. (10.32b) 
This is a fourth-order linear differential equation with constant coefficients (see 9.1.2.3, p. 553). 
2. F = F(x,y, Y's- y) 
In this general case, when the functional 7 [y] of the variational problem depends on the derivatives of 
the unknown function y up to order n. (n 7 1), the corresponding Euler differential equation is 


OF d (OF d (OF ΝΠ 
ὃν dx [or e ο) ECU Ge (zs) 0, (10.33) 


whose solution should satisfy the boundary conditions analogously to (10.30b) up to order n — 1. 





10.3.5 Variational Problem with Several Unknown Functions 
Suppose the functional of the variational problem has the form 











b 
Ily, Yas., Yn] = | FG mms ete af ds (10.34) 
where the unknown functions yi(x), ya(x), ..., ys (x) should take given values at x = a and x = b. 
Considering n twice continuously differentiable comparable functions 
yi(x) — vio(x) *- eim(x) | (6 — 1,2,..., m), (10.35) 
where the functions rj; (x) should vanish at the endpoints, (10.34) becomes /(ei, €2, . .., €4) with (10.35). 
From the necessary conditions 
ol 
—=0 (@=1,2,...,n) (10.36) 
OG; 
for the extreme values of a function of several variables, follow the n Euler differential equations 
OF d (OF OF d (ƏF OF 1 (OF 
- a ADU MEE ms - 0, (10.37) 
Oy — dx \ Oyj Oy. dx \ Oy) Oy, dx \ Oy}, 


whose solutions yi(x), yo(x), ..., yn (c) must satisfy the given boundary conditions. 


10.3.6 Variational Problems using Parametric Representation 


For some variational problems it is useful to determine the extremal, not in the explicit form y — y(x), 
but in the parametric form 


c=x(t), y=y(t) (1 <t< te), (10.38) 
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where tı and tz are the parameter values corresponding to the points (a, A) and (b, B). Then the simple 
variational problem (see 10.3.1, p. 611) is 


Ile, y] = / F(z(t),y(), &(£), $(£))dt — extreme! (10.392) 


with the boundary conditions 

αι) τα, z(t3)— b, γι) - Α, ο) -- Β. (10.39b) 
Here i and j denote the derivatives of z and y with respect to the parameter f£, as usual in the parametric 
representation. 
The variational problem (10.39a) makes sense only if the value of the integral is independent of the 
parametric representation of the extremal curve. To ensure the integral in (10.39a) is independent of 
the parametric representation of the curve connecting the points (a, A) and (b, B), F must be a positive 
homogeneous function of first order, i.e., 

F(z, y, ui, uj) — nF(x.y. 3,9). (p » 0) (10.40) 
must hold. 
Because the variational problem (10.39a) can be considered as a special case of (10.34), the correspond- 
ing Euler differential equations are 

OF d (OF OF d (OF 

ew .“- av | = 0. (10.41) 
Ox dt \ 0% Oy dt \ dy 


They are not independent of each other, but they are equivalent to the so-called Weierstrass form of 
the Euler differential equation: 











OF OF 
E EO es 42a 
διδὺ  OiOy T M(ig-—3dy)-0 (10.422) 
with 
1 OF 1 OF 1 F 
id m — - αν 


Analogous to the calculation of the radius of curvature R of a curve given in parametric representation 
(see 3.6.1.1, 1., p. 243), the calculation of the radius of curvature of the extremal curve is to be made, 
considering (10.42a), with 


D +9\3/2 
(à? 4 2" 


ij — ij 


M (2 e oy" 


y Fiy = Foy 


(10.42c) 














Hf The isoperimetric problem (10.8a to 10.8c) (see 10.2.1, p. 611) has the form in parametric represen- 
tation: 


t2 t2 
I[x,y] = [ y(t)i(t)dt = max! (10.43a) with / VEO - j*(t) dt — 1. (10.43b) 


This variational problem with the side condition becomes an unconstrained variational problem ac- 
cording to (10.26) with 


H = H(x,y, 4,9) ^ yà - AJ 8 4 P. (10.43c) 
H satisfies the condition (10.40), so it is a positive homogeneous function of first degree. Furthermore, 
1 λ 
M = — Hi; = za; AHiy=1, Hey =0, (10.43d) 
παρκ 9 : 





y 


holds, so (10.42c) yields the radius of curvature R = |A|. Since A is a constant, the extremals are circles. 
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10.4 Variational Problems with Functions of Several Variables 
10.4.1 Simple Variational Problem 


One of the simplest problems with a function of several variables is the following variational problem 
for a double integral: 


Πα] -- / Εαν, (η, ν) ας, τιν) ἆα ἄν -- οχίτοιησ!. (10.44) 
(G) 
The unknown function u = u(a,y) should take given values on the boundary I of the domain G. 
Analogously to 10.3.2, p. 612, a comparable function is to be introduced in the form 
u(x, y) = uo(x, y) + en(z, y), (10.45 
where uo(z, y) is a solution of the variational problem (10.44), which takes the given boundary values, 
while n(x, y) satisfies the condition 


n(z,y)=0 onthe boundary P. (10.46 
n(x,y) together with uo(x,y) both are differentiable as many times as needed. The quantity € is a 
parameter. 


Next, a surface is to be determined by u = u(x, y), which is close to the solution surface uo(z, y). I(u 
becomes J(e) with (10.45), i.e., the variational problem (10.44) becomes an extreme value problem 
which must satisfy the necessary conditions 








= = for e€=0. (10.47 
de 

From this follows the Euler differential equation 
OF ὃ 9Ρ ὃ (OF 
CC ME d ees if pu 4 
Ou öx (=) ὃν (=) (10.48 


as a necessary condition for the solution of the variational problem (10.44). 
@ A free membrane, fixed at the perimeter I of a domain G in the x, y plane, covers a surface with 
area 


e / dz dy. (10.492) 


If the membrane is deformed by a load so that every point has an elongation u — u(x,y) in the z- 
direction, then its area is calculated by the formula 


R= / VL ud d dd dz dy. (10.49b) 
(c) 
Linearizing the integrand in (10.49b) and using Taylor series (see 6.2.2.3, p. 449), one gets the relation 
1 
1954 ΓΠ1- 7 / (uz + uz) dg dy. (10.49c) 
G) 


For the potential energy U of the deformed membrane holds 


U=0(h-h)= ;//( u +u?) dedy, (10.494) 


where the constant c in the tension of the membrane. In this way arises the so-called Dirichlet 
variational problem: The function u = u(x, y) is to be determined so that the functional 


ul = n (ὦ + uz) dx dy (10.49e) 
(G) 
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should have an extremum, and u vanishes on the boundary T of the plane domain Œ. The corresponding 
Euler differential equation is 


Qu Ou 
— = 0. 10.49f 
Ox? * Oy? 0 (10:491) 


It is the Laplace differential equation for functions of two variables (see 13.5.1, p. 729). 


10.4.2 More General Variational Problems 

Two generalizations of the simple variational problem are to be considered here. 

1. F= F(a, Y, u(x, y). Ug, Uy, Ure, Unys Uyy) 

The functional depends on higher-order partial derivatives of the unknown function u(x, y). If the 
partial derivatives occur up to second order, then the Euler differential equation is: 


OF ὃ (9Ε} ὃ {8Ε ad δΕΧ ,  (àF πο OF \ _, (10.50) 
ðu Ox \ Juz Oy \ uy Ox? \ Ouse)  OxOy Olzy Oy? \ Puy i i 


2. P —JIX(n, 931305448245); Ua 526g) 
In the case of a variational problem with n independent variables x4, x3, ..., 5, the Euler differential 
equation is: 


Dome [το (10.51) 


Qu |. at ὅτι Our, 





10.5 Numerical Solution of Variational Problems 

Most often two ways are used to solve variational problems in practice. 

1. Solution of the Euler Differential Equation and Fitting the Found Solution to 
the Boundary Conditions 

Usually, an exact solution of the Euler differential equation is possible only in the simplest cases, so 

numerical methods are to be used solving the boundary value problem for ordinary or for partial dif- 

ferential equations (see 19.5, p. 976 or 20.3.4, p. 1042ff). 

2. Direct Methods 

The direct methods start directly from the variational problem and do not use the Euler differential 

equation. The most popular and probably the oldest procedure is the Ritz method. It belongs to the so- 

called approximation methods which are also used to get approximate solutions of differential equations 

(see 19.4.2.2, p. 974 and 19.5.2, p. 977). The following example demonstrates this method. 


W Solve numerically the isoperimetric problem 

[ y" (a) dx = extreme! (10.52a) for L y'(r)dr —1 and y(0)— y(1)— 0. (10.52b) 
The corresponding variational problem without integral side condition according to 10.3.3, p. 614, is: 

Ty] = [ [ν”(α) — Ay! ()] dx = extreme! (10.52c) 
As starting νο to find an approximation solution can be used 

ία) — ayz (a — 1) - az? (a — 1). (10.52d) 


Both approximation functions z (x — 1) and z?(x — 1) are linearly independent, and satisfy the boundary 
conditions. Reducing (10.52c) with (10.52d) gives 
1 1 


2 1 1 
τρ | ge λ (55% + τος + ge) ; (10.52e) 





1 
I (ay, a2) = 34 + 
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and the necessary conditions Or _ Ol _ 0 result in he homogeneous linear equation system 
Oa Oa 
1 i2 


22 1 A Ιλ 4 2 
(5 a )a+(5 ae 0, ς π)α-(5 τ)” 0. (10.52f 


This system has a non-trivial solution only if the determinant of the coefficient matrix is equal to zero: 
A2—52A--420—0, ie, λι -- 10, λο--49. (0.520 
For À — A, = 10 from (10.52f) follows ag = 0, a, arbitrary, so the normalized solution belonging to 
Ài — 10 is: 
y = 548x(xv — 1). (10.52h 
To make a comparison, consider the Euler differential equation belonging to (10.52f). Here the bound- 
ary value problem 











y" +Ay=0 with y(0)—y(1)-0 (10.52i 
holds with the eigenvalues Ap = k?n? (k — 1,2,...) and the solution yz, = cz sin ka. The normalized 
solution, e.g., for the case k = 1, i.e., 4 — s? z 9.87 is 

y = ν2οίππα, (10.52j 


which is really very close to the approximate solution (10.52h). 





Remark: With today’s level of computers and science first of all, the finite element method (FEM 
is to be applied for numerical solutions of variational problems. 

The basic idea of this method is given in 19.5.3, p. 978, for numerical solutions of differential equations. 
The correspondence between differential and variational equations will be used there, e.g., by Euler 
differential equations or bilinear forms according to (19.146a,b). 

Also the gradient method can be used for the numerical solution of variational problems as an efficient, 
numerical method for non-linear optimization problems (see 18.2.7, p. 936). 


10.6 Supplementary Problems 
10.6.1 First and Second Variation 


The derivation of the Euler differential equation with a comparable function by the help of the Taylor 
expansion of the integrand (see 10.3.2, p. 612) was stopped after the linear terms with respect to e: 
b 
Ie) = f Fle, yo+ en, y + en’) de. (10.53) 
a 
Considering also quadratic terms yields 
b 


1{9) -- I(0) — ef [Ee Yo. Yo)n + 55 wat dx (10.54) 


a 


b 
€ OF OF 
“S/o (x, yo, yon? + 2 Dydy Say (ts Yos Yo) + aya” yo. Yon” + Ο(ε)] ἆ 


Denoting as 
1. Variation 6/ of the functional J[y] the expression 


b 
F OF 
= / ΕΠ η + πο. ών dx and as (10.55) 
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2. Variation 0?I of the functional /[y] the expression 


b 
OF OF OF 
Pl = / ΕΠΙ + 25,9 (^ tos) ml + By (Yor Yo)” dx, (10.56) 


then one can write: 
2 
1(ε) -- 1(0) αν εδ] -- < δ31. (10.57) 


By the help of these variations different optimality conditions for the functional / [y] can be formulated 
(see [10.7]). 


10.6.2 Application in Physics 


Variational calculus holds its solid place in physics. So, e.g., the fundamental equations of the Newto- 
nian mechanics can be derived from a variational principle and in this way one can come the Hamilton- 
Jacobi theory. Variational calculus is also important in both atomic theory and quantum physics. It is 
obvious that the extension and generalization of classical mathematical notions is undoubtedly neces- 
sary. Therefore the calculus of variations must be discussed today by modern mathematical disciplines, 
e.g., functional analysis and optimization. Unfortunately, in this book it is not possible to give more 
than a brief account of the classical part of the calculus of variations (see [10.4], [10.5], [10.7]). 


11 LinearIntegral Equations 


11.1 Introduction and Classification 
1. Definitions 


An integral equation is an equation in which the unknown function appears under the integral sign. 
There is no universal method for solving integral equations. Solution methods and even the existence 
of a solution depend on the particular form of the integral equation. 
An integral equation is called linear if linear operations are performed on the unknown function. The 
general form of a linear integral equation is: 

δία) 


σ(ο)ρ(α) -- f(x) - ^ f KG yet) dy, cx ax d. (111) 
α(α) 

The unknown function is y(x), the function K(x, y) is called the kernel of the integral equation, and 
f(x) is the so-called perturbation function. These functions can take complex values as well. The 
integral equation is homogeneous if the function f(a) is identically zero over the considered domain, 
i.e., f(x) = 0, otherwise it is inhomogeneous. A is usually a complex parameter. 
Two types of equation (11.1) are of special importance. If the limits of the integral are independent of 
x, ie., a(x) = a and b(x) 2 b, it is called a Fredholm integral equation (11.2a,11.2b). 
If a(z) = a and b(x) = 2, it is called a Volterra integral equation (11.2c, 11.2d). 
If the unknown function y(x) appears only under the integral sign, i.e., g(x) = 0 holds, one speaks of 
an integral equation of the first kind as (11.2a), (11.2c). The equation is called an integral equation of 
the second kind if g(a) = 1 as in (11.2b), (11.2d). 





b b 
0= f(x) + A f Ki yet) dy, (11.22) p(x) = f(x) + A f KG iet) dy, (11.2b) 
0— f(x) -A f K(,ety)dy, (130) plz) = f(a) +A f K(s.)et) dy. (11.24) 
2. Relations with Differential Equations 
The problems of physics and mechanics relative rarely lead directly to an integral equation. These 


problems can be described mostly by differential equations. The importance of integral equations is 
that many of these differential equations, together with the initial and boundary values, can be trans- 
formed into integral equations. 

W From the initial value problem y'(x) — f(x, y) with x 7 zo and y(vo) = yo by integration from xo 
to x one gets 


wie) = mwt f FENO). (19) 


The unknown function y(x) appears on the left-hand side of (11.3) and also under the integral sign. The 
integral equation (11.3) is linear if the function f(£, y(£)) has the form f(£,n(£)) = a(£) (6) -- 06), 
i.e., the original differential equation is also linear. 

Remark: This chapter 11 deals with only integral equations of the first and second kind of Fredholm 
and Volterra types, and also with some singular integral equations. 


© Springer-Verlag Berlin Heidelberg 2015 621 
I.N. Bronshtein et al., Handbook of Mathematics, 
DOI 10.1007/978-3-662-46221-8_11 


622 11. Linear Integral Equations 





11.2 Fredholm Integral Equations ofthe Second Kind 
11.2.1 Integral Equations with Degenerate Kernel 


Ifthe kernel K (x,y) of an integral equation is the finite sum of products of two functions of one variable, 
i.e., one depends only on x and the other one only on y, it is called a degenerate kernelor a product kernel. 


1. Solution in the Case of a Degenerate Kernel 
The solution of a Fredholm integral equation of the second kind with a degenerate kernel leads to the 
solution of a finite-dimensional system of equations. Consider the integral equation 


ple) = fla) +A f Keyp) dy with (11.4a) 


K(z,y) = a(£)bı(y) + az(x)b2(y) - ... 4- os (x)8s (y). (11.4b) 


The functions o4 (x), ..., o4 (x) and fy(x), ..., B (x 9 are given on the interval [a, b] and are supposed to 
be continuous. Furthermore, the functions oi (x), ..., o, (x) are supposed to be linearly independent 
of one another, i.e., the equality 


Y Cj ag (m) = 0 (11.5) 
k=1 


with constant coefficients c, holds for every a in [a,b] only if cy = @ =... = c, — 0. Otherwise, 
K (v, y) can be expressed as the sum of a smaller number of products. 
From (11.4a) and (11.4b) follows: 


p(x) = f(x) - Aon (x af» y)ply) dy +... + Aa (x jf tne (y) dy. (11.64) 


The integrals are no longer functions of the variable x, they are constant values. Let's denote them by 
Ay: 


b 
Ay = / Bx(y)e(y)dy, k-l,..,n. (11.6b) 


The solution function (xz), if any exists, is the sum of the perturbation function f(x) and a linear 
combination of the functions o4(x), ..., a (x): 


p(x) = f(x) -- AA1o1(x) + λΑχαρ(α) +... + AAnan(z). (11.6c) 
2. Calculation of the Coefficients of the Solution 


The coefficients A1, . . . , An are calculated as follows. Equation (11.6c) is multiplied by 6;() and its 
integral is calculated with respect to x with the limits a and b: 


b 
fao s= f Blof x) dx + AA, fas x)oa(x) dz 4- ... -- AA, fas Ώ)αη(α) dx. (11.7a) 
The left-hand side of this equation is equal to Ax according to (11.6b). Using the following notation 
b b 
b= [Ale x)f(x)dx and ckj = J oae) da: (11.7b) 
there is for k = 1,...,n: 


Ax = bk + AcE Ay + Aco Ag Fst ACkn Án. (11.7c) 
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It is possible that the exact values of the integrals in (11.7b) cannot be calculated. When this is the 
case, their approximate values must be calculated by one of the formulas given in 19.3, p. 963. The 


linear system of equations (11.7c) contains n equations for the unknown values Aj,..., An: 
(1 = Aci) Ai —Ac12 Ag == ag —XCin An = bi 5 
--λοοι A1 +(1 — Àc223) 42 — ... — ÀC Ån = b2, 
21 Ay +( 22) Ag 2 2 (11.74) 
--λομι Αι -λόπο Αο-- ... ΕΙ -- λοππ) Απ = bn 


3. Analyzing the Solution, Eigenvalues and Eigenfunctions 
It is known from the theory of linear systems of equations that (11.7d) has one and only one solution 
for A1,..., A, if the determinant of the matrix of the coefficients is not equal to zero, i.e., 


(1— Ae) --λειο ... —XCin 
--λο 1— Ac T — ÀCon, 

D(A) = a ( 22) EI (11.8) 
-λομι -λόπο ... (l— Aena) 


Obviously D(A) is not identically zero, as D(0) = 1 holds. So there is a number R > 0 such that 
D(A) z: 0if |A| « R. For further investigation two different cases are considered. 
Case D(A) z 0: 
The integral equation has exactly one solution in the form (11.6c), and the coefficients A1,..., A; are 
given by the solution of the system of equations (11.7d). If (11.4a) is a homogeneous integral equation, 
i.e., f(x) = 0, then bj — b — ... — b, — 0. Then the homogeneous system of equations (11.7d) has 
only the trivial solution Ay = Ap =... = A, = 0. In this case only the function (x) = 0 satisfies the 
integral equation. 
Case D(A) = 0: 
D(A) is a polynomial of no higher than n-th degree, so it can have at most n roots. For these values 
of À the homogeneous system of equations (11.7d) with b} = by = ... = b, = 0 also has non-trivial 
solutions, so besides the trivial solution y(x) = 0 the homogeneous system of equations has other 
solutions of the form 

p(a) = C- (Ajay (") + Agae(%) +...+ Andn(x)) (C is an arbitrary constant.) 
Because o4(x),..., o (x) are linearly independent, y() is not identically zero. The roots of D(A) 
are called the eigenvalues of the integral equation. The corresponding non-vanishing solutions of the 
homogeneous integral equation are called the eigenfunctions belonging to the eigenvalue A. Several 
linearly independent eigenfunctions can belong to the same eigenvalue. If the integral equation has a 
general kernel, then are be to considered all values of À eigenvalues, for which the homogeneous integral 
equation has non-trivial solutions. Some authors call the A with D(A) = 0 the characteristic number, 


1 b 
and p= 3 is called the eigenvalue corresponding to an equation form jup(x) — Í K (v, y)e(y) dy. 
Ja 


4. Transposed Integral Equation 

Now it is necessary to investigate the conditions under which the inhomogeneous integral equation will 
have solutions if D(A) = 0. For this purpose the transposed integral equation (or adjoint in the complex 
case) of (11.4a) is introduced: 


p(x) = g(x) + A f Kio) dy. (11.9a) 


Let À be an eigenvalue and y() a solution of the inhomogeneous integral equation (11.4a). It is easy 
to show that A is also an eigenvalue of the adjoint equation. Now multiply both sides of (11.4a) by any 
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solution (x) of the homogeneous adjoint integral equation and evaluate the integral with respect to x 
between the limits a and b: 


[Ῥων x) dx - fre )ψ(α) ἀν ZI [i 2, y) (x η p(y) dy. (11.9b) 


Assuming that ¢(y) = f K (x, y) (x) dz, then f” f(x)w(x) dz = 0 holds. 


That is: The inhomogeneous integral equation (11.4a) has a solution for some eigenvalue A if and only 
if the perturbation function f(x) is orthogonal to every non- vanishing solution of the homogeneous ad- 
joint integral equation belonging to the same A. This statement is valid not only for integral equations 
with degenerate kernels, but also for those with general kernels. 


41 
m A: le) = κο ο 2, al) = z, asle) = -z Al) = 
y, Baly) = y, Bzly) = y. The functions az(x) are linearly dependent. Therefore one transforms the 
+1 
integral equation into the form y(x) = x + M [a?y -- z(y? — y)|e(y) dy. For this integral equation 


αι(α) -- αἲ, αρα) =x, fh(y) — y, Bo(y) — y? — y holds. If any solution p(x) exists, it has the form 
φί(α) μμ + Ag x 


+1 2 +1 2 
amf 2an anf dan, nof tanh, 


























2 2 2 
C21 E (ο΄ a?) da 8" C22 fv. (a? ax?) da 3 by 1 (z? ax?) da 3 
With these values the system of equations to determinate A; and As has the form: A1 — 24» — 
2 2 2 2 1 2 
3? -g^ + (1 + 5) Ag = a which in turn yield that A, = i Ag = -7 and y(x) = z + 
10, 2 105 5 


g= =g = =g FT: 


m^ T OU T7 
WB B: g(x) =2+A [ sin(x + y)y(y) dy, i.e.: K(x, y) = sin(x + y) = sin x cosy + cos xsin y, y(x) 
Jo 


—a Asin [^ cosye(y) dy cose γ᾽ sin y p(y) dy. 
0 0 








ene [ sinrcosrdr —0, Cy = i cos? x dx = a by = [ xcosxdx = —2, 
Jo Jo 2 Jo 
a= f sin? zdr = 2, on = | cos x sin x da: — 0, b= | esinedr =. 
0 2 0 0 
π π 
With these values the system (11.7d) is Ay — Li = —2, AGA + A = T. It has a unique solution 
2 
π π 
ὶ 1 AZ| aT Ay 8 n(1— A) 
for any \ with D(A) = T Ξ1-λ-- μα 80 ΑἹ -- -----ῃ, As — —— ——5, and 
“Az 1 1-422 1—22 
τ 4 4 
λ 2 
the solution of the integral equation is y(a) = «4 5 [e 2) sinz 4- z(1— A) cosa}. The 
1. yom 2 
. ] : 2 2 
eigenvalues of the integral equation are Ay = —, λο----. 
π π 


T 
The homogeneous integral equation p(x) = Ax / sin(r 4 y)g(y) dy has non-trivial solutions of the 
0 
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2 
form ye(a) = Ag(Arsina + Agcosa) (k = 1,2). For 4; — — holds A1 — Αρ, and with an arbitrary 
T 


2 

constant A it follows yı(x) = A(sin x + cos x). Similarly for åg = —— holds y2(x) = B(sin x — cos x) 
T 

with an arbitrary constant B. 


Remark: This previous solution method is fairly simple but it only works in the case of a degenerate 
kernel. This method can be used to get a good approximate solution in the case of a general kernel too 
if it is possible to approximate the general kernel by a degenerate one closely enough. 


11.2.2 Successive Approximation Method, Neumann Series 


1. Iteration Method 


Similarly to the Picard iteration method (see 9.1.1.5, 1., p. 549) for the solution of ordinary differential 
equations, an iterative method needs to be given to solve Fredholm integral equations of the second 
kind. Starting with the equation 


b 
p(x) = f(x) A f KG. )eQ) dv, (11.10) 
one defines a sequence of functions yo(x), yi(x), Yo(x),.... Let the first be wo(a) = f(x). The subse- 


quent y,(x) can be get by the formula 


b 
ο ο Οσο pole) = Ὁ (1.19) 
Following the given method the first step is 
b 
vir) = f(a) +A f Klev) f) dv. (11.11b) 


According to the iteration formula this expression of (y) is substituted into the right-hand side of 
(11.10). To avoid the accidental confusion of the integral variables, y is denoted by 7) in (11.11b). 








b b 
palz) = f(x) +A f Kley) f(y) +A KG. fo) a dy (1.116) 
* ° bb 
= f(x)+ af K(z,y) fly) dy +A? [[ KeK dy dn. (11.11d) 


b 
Introducing the notation of K4(r, y) — K(x, y) and Ko(r, y) — Í K (x, €) K (£, y) d£, and renaming η 


as y, one can write p(x) in the form 


b b 
pala) = F(E) +A f Kaley) Fly) dy + f Kolo.) Fy) dy. (11.116) 


Denoting 


b 
(z, y) = | KG 2,£)Kya(E,y) dé (n= 2,3,...) (11.11) 
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then there is the representation of the n-th iterated yn (£x): 
b b 
pala) = F(E) +A f Kiley) fl) dy + -+X f Kaley) F(U) dy. (11.118) 


a 
Ky (c, y) is called the n-th iterated kernel of K(x, y). 
2. Convergence of the Neumann Series 
To get the solution (x), it is to be discussed the convergence of the power series of À 


E b 
FE) +Y f Kaley) fO) dy (11.12) 
n=l a 


which is called the Neumann series. If the functions K (x, y) and f(x) are bounded, i.e., the inequalities 
|K(z,y)| <M (a<xr<b, a<y<b)and |f(z) « N (a x € b), (11.13a) 
hold, then the series 


N 3 |AM(b— a)l" (11.13b) 
n=0 


is a majorant series for the power series (11.12). This geometric series is convergent for all 


1 
A| € LÁ. 
Al M(b — a) 
The Neumann series is absolutely and uniformly convergent for all values of A satisfying (11.13c). By 
a sharper estimation of the terms of the Neumann series one can give the convergence interval more 
precisely. According to this, the Neumann series is convergent for 


1 


bb : 
"TH K (a, y)|? dx dy 


This restriction for the parameter A does not mean that there are no solutions for any |A| outside the 
bounds set by (11.13d), but only that one cannot get it by the Neumann series. The expression 


(11.13c) 


|A| < (11.13d) 





D(z,y; A) = 35A Ks (x, y) (11.14a) 


n=1 


is called the resolvent or solving kernel of the integral equation. Using the resolvent one gets the solution 
in the form 


v(x) = f(x) + Af I'(z,y; 3) f (y) dy. (1.145) 


1 
W Fortheinhomogeneous Fredholm integral equation of the second kind e(x) = «+A / xy p(y) dy fol- 
0 


and from 





. 1 1 
lows Ki(r, y) — zy, Ko(x,y) = / znny dy = 2zy, Ks(s,y) — gw. Kaley) = 3 
0 
oo AP 
this F(x,y; À) = xy (Σ τ) . With the limit (11.13c) the series is definitely convergent for |\| < 1, 


n=0 
Ty 
(5) 
με 
9 


because 





K(x,y)| € M = 1 holds. The resolvent I(x, y; A) = is a geometric series which is 
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y? Ὁ 
τ η". 

E 1—- 
ΠΝ 


Remark: If for a given A the relation (11.13d) does not hold, then any continuous kernel can be de- 
composed into the sum of two continuous kernels K(x, y) = K(x, y) + K?(z, y), where Kl(r, y) is a 
degenerate kernel, and K?(x, y) is so small that for this kernel (11.13d) holds. In this way one has an 
exact solution method for any À which is not an eigenvalue. 


11.2.3 Fredholm Solution Method, Fredholm Theorems 
11.2.3.1 Fredholm Solution Method 


1. Approximate Solution by Discretization 
A Fredholm integral equation of the second kind 





convergent even for |A| < 3. Thus from (11.14b) follows y(x) — x + À [ 


p(x) = fla) +A { Κία.υλρ(ν) dy (11.15) 


can be approximately represented by a linear system of equations. It should be assumed that the func- 
tions K (r, y) and f (x) are continuous for a € x € b, a € y € b. 

The integral in (11.15) should be approximated by the so-called left-hand rectangular formula (see 
19.3.2.1, p. 964). It is also possible to use any other quadrature formula (see 19.3.1, p. 963). An equidis- 
tant partition 








b— 
yk —a-c-(k—1)h (k-—1,2,-.,m; h— E (11.164) 
n 
yields the approximation 
pla) ~ f(x) - Ah [K& (z, y1)g(yi) +... + K(x, yn) e(Yn)) - (11.16b) 
Replacing p(x) in this expression by a [μίας p(x) exactly satisfying (11.16b) yields: 
D(a) = fle) + MLK (2, s oua) +... +E (@, ψε)φίψα)]. (1.165) 
To determine this approximate solution, it is necessary to know the substitution values of p(x) at the 
interpolation nodes x, = a+ (k — 1)h. Substituting « = x4, x — 23,..., x — Lp into (11.16c), yields 
a linear system of equations for the required n substitution values of P(x). Using the short-hand 


notations 


Ki = Klaj, y), or = Plan), fe = Fae) (11.17a) 
this system has the form 
α = AhK 1) 1 — Ah. K15 Qa— ... —AhK in Pn = fi, 
— Ah. Ko, Q1 +(1 = AhK2)~2 ES e —AhKon Pn = fa. (11.176) 
—AhK, qi —AhK 2 — ... +(1— ARKan) Gn = fn- 
This system has the determinant of the coefficients 
(1 = Ah) — Ah K15 bés —AhKin 
Dn(A) . —Ah Ka α = AhK 2) ο — Ah Ko, . (11.170) 
—AhK, —AhKn2 ... (1— Ah Ks) 


This determinant has the same structure as the determinant of the coefficients in the solution of an 
integral equation with a degenerate kernel. The system of equations (11.17b) has a unique solution 
for every \ where D,(A) 4 0. The solution gives the approximate substitution values of the unknown 
function y (x) at the interpolation nodes. The values of A with D,(A) = 0 are approximations of the 
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eigenvalues of the integral equations. The solution of (11.17b) can be written in quotient form (see 
Cramer rule, 4.5.2.3, p. 311): 

DE(A 

Pk = nl ) 

D4(A) 

Here follows DF (A) from D,(A) by replacing the elements of the k-th column by fi, f2,..., fs. 


2. Calculation of the Resolvent 
If n tends to infinity, so does the number of rows and columns of the determinants DF(A) and D,(A), 
as well. The determinant 





S plti) k-l,..,n. (11.18) 


D(A) = Jim D,() (11.19a) 
is used to get the solution kernel (resolvent) T (x,y; A) (see 11.2.2, p. 625) in the form 
D(x, y; A) 
Γία,ψ;λ) -- ------- 11.191 
(esa) = RE (11.190) 


It is true that every root of D(A) is a pole of I(x, y; ^). Exactly these values of A, for which D(A) = 0, 
are the eigenvalues of the integral equation (11.15), and in this case the homogeneous integral equation 
has non-vanishing solutions, the eigenfunctions belonging to the eigenvalue A. In the case of D(A) 4 0, 
knowing the resolvent I(x, y; A), an explicit form of the solution is: 


b b 
λ 
ola) = F(E) +A f Plew AA) dy = Fa) + izy | PEAU) dv- (11.190) 
To get the resolvent, one needs the power series of D(x, y; A) and D(A) with respect to A: 


Disi. 25 CU" Kan, y) N* (11.20) 
D(A) 3 (-1yd,- A 
n=0 


where dy = 1, Ko(x,y) = K(ax,y). One gets the further coefficients from the recursive formula: 





Γία, μ; λ) -- 


b b 
d; zd faa) dx, Kn(x,y) = K(a,y)+ dn - [ K@,0K alt, y) dt. (11.20b) 
n 


ΒΑ: oc(r) 2 sinz A [ * sing cos y y(y) dy. The exact solution of this integral equation is 
Jo 





2 
(x)= σα sinz. Forn — 3 with z4 —0, πο τ z3 = τ h= x gives 


























3 
1 0 0 
D(A) = aX -ᾱ 1 Vm)" Xm VSR 2 aang 
aes |e 2 24 24 192 12 ^ Wim ^ 
Υ3λπ 3λπ Υ3λπ 
12 24 24 


is an approximation of the exact eigenvalue \ = 2. From the first equation of the system of equations 
(11.17b) for f, — 0 follows the solution v, — 0. Substituting this result into the second and third equa- 


EP: AT 1 3AT Tg ν3λπ 
2. | 93499 9 Ty A? 24 | 8 
1 v3 


, P3 = . IfA = 1, then yı = 0, p2 = 


tion gives the system of equations: (: 











3 ; : 
a This system has the solution y2 
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0.915, p3 = 1.585. The substitution values of the exact solution are: y(0) = 0, y (=) -1,φ (2) = 
1.732. 


In order to achieve better accuracy, the number of interpolation nodes needs to be increased. 


1 1 
ΒΒ: (0) -τ- af (day — x?)p(y) dy; do =1, Ko(z,y) 2 4vy — à?, dy = j 32? dx — 1, 
0 0 














l 4 4 1 i it 
Ki (x,y) = Avy — a? 1 (4at — 2?) (4ty —#?) dt = a 1-20 -- -ᾱ--- -αγ, de / Μ(α, Ὁ) ἐσ ----, 
0 9 3 2 Jo 18 


1 1 
Ko(x,y) = ig v) - / K (x, t) Ki(t, y) dt — 0. With these the values da, Ks(v, y) and all the fol- 











4 4 
Ary — 2? — ls t 2x?y — 2x? — L| A 
lowing values of dp and K(x, y) are equal to zero. T(x, y; A) a 3 j 
1-A+— 
: + 18 
From 1— À+ 18 ^ 0 the two eigenvalues Ay. = 9 + ὃν τ follow. If À is not an eigenvalue, then the 
1 x(2\ — 3Ax 
solution is y(x) = x + af I(x, y; A) f(y) dy = — : 
11.2.3.2 Fredholm Theorems 
For the Fredholm integral equation of the second kind 
b 
ele) = Fla) +A f Kle, yo) dy (11.21a) 
the correspondent transposed integral equation is given by 
b 
(x) = g(x) + / K (y, x) (y) dy. (11.21b) 


For this pair of integral equations the following statements are valid (see also 11.2.1, p. 622). 

1. A Fredholm integral equation of the second kind can only have finite or countably infinite eigenval- 
ues. The eigenvalues cannot accumulate in any finite interval, i.e., for any positive R there are only a 
finite number of À for which |A| < R. 


2. If A is not an eigenvalue of (11.21a), then both of the inhomogeneous integral equations have a 
unique solution for any perturbation function f(a) or g(a), and the corresponding homogeneous inte- 
gral equations have only trivial solutions. 

3. If A is an eigenvalue of (11.21a), then A is also an eigenvalue of the transposed equation (11.21b). 
Both homogeneous integral equations have non-vanishing solutions, and the number of linearly inde- 
pendent solutions are the same for both equations. 

4. For an eigenvalue \ the homogeneous integral equation can be solved if and only if the perturbation 
function is orthogonal to every solution of the homogeneous transposed integral equation, i.e., for every 
solution of the integral equation 


b b 
p(z) = af K (y, x)w (y) dy, (11.22a) / f(x)v(x)dr — 0 holds. (11.22b) 


The Fredholm alternative theorem follows from these statements: Either the inhomogeneous integral 
equation can be solved for any perturbation function f(x) or the corresponding homogeneous equation 
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has non-trivial solutions. 


11.2.4 Numerical Methods for Fredholm Integral Equations ofthe 
Second Kind 


Often it is either impossible or takes too much work to get the exact solution of a Fredholm integral 
equation of the second kind 


p(x) = fa) + [ K(w,y)olu) dy (11.23) 


by the solution methods given in 11.2.1, p. 622, 11.2.2, p. 625 and 11.2.3, p. 627. In such cases certain 
numerical methods can be used for approximation. Three different methods are given below to get the 
numerical solution of an integral equation of the form (11.23). 


11.2.4.1 Approximation of the Integral 


1. Semi-Discrete Problem 
Working on the integral equation (11.23) often one replaces the integral by an approximation formula. 
These approximation formulas are called quadrature formulas. They take the form 


b n 
f iede = Qual f) = E wef (a), (11.24) 
z k=1 


i.e., instead of the integral there is now a sum of the substitution values of the function at the inter- 
polation nodes x, weighted by the values cw. The numbers wj should be suitably chosen (so as to be 
independent of f). Equation (11.23) can be written in the approximate form: 


p(x) 54 f(r) + AQ (IK (v, )](-)) — f(x) Ελ S wy K (x, yx)ouk)- (11.25a) 


k=1 
The quadrature formula Qja4j(K (2, -)y(-)) also depends on the variable x. The dot in the argument of 
the function means that the quadrature formula will be used with respect to the variable y. Defining 
the relation 


τι 
P(x) = f(x) HAY? wn Kk (x, yk) gu). (11.25b) 
k=1 
P(x) is an approximation of the exact solution y(x). One considers (11.25b) as a semi-discrete problem, 
because the variable y is turned into discrete values while the variable x can still be arbitrary. 
If the equation (11.25b) holds for a function p(x) for every x € [a,b], it must also be valid for the 
interpolation nodes x = xz: 


n 
Plrk) = f(k) AX wK (zr y Plu) ε--1.3,...νη. (11.25c) 

j=l 
This is a linear system of equations containing n equations for the n unknown values P(x). Substituting 
these solutions into (11.25b) yields the solution of the semi-discrete problem. The accuracy and the 
amount of calculations of this method depend on the quadrature formula used. For example using the 
left-hand rectangular formula (see 19.3.2.1, p. 964) with an equidistant partition y, = x, = a 4- h(k — 

1, h—(b—a)/n, (k — 1,...,n) yields: 


b n 
| κα.υρίώάν e Y hi Gs yeu (11.263) 
ἃ k=1 


With the notations 
Kir = Κίαγ νυν), fe = ας). vx wu) (11.26b) 
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the system (11.25c) has the form: 


(1 E AhlKGai)i — Ah. K15 Q3— ... —Ah Ki, Pn = fi 
—AhKy φι +(1 = AhK 202) E a —AhKon Pn = fa. (11.360) 
—Ah Kg qi —AhKn2 p2— ... +(1— AhKnn)Pn = fn- 


The same system was involved in the Fredholm solution method (see 11.2.3, p. 627). As the rectangular 
formula is not accurate enough, for a better approximation of the integral one can increase the number 
of interpolation nodes, along with an increase in the dimension of the system of equations. Hence one 
gets the idea of looking for another quadrature formula. 


2. Nyström Method 
In the so-called Nyström method the Gauss quadrature formula is to be used for the approximation of 
the integral (see 19.3.3, p. 965). In order to derive this, one considers the integral 


r= f f) dz. (11.272) 


The integrand is replaced by a polynomial p(x), namely the interpolation polynomial of f(x) at the 
interpolation nodes xz: 


ρα) = x Ly(x) f (xx) with 


(z—21) ... (z zy 1i)(Y — xg) ... (2 — Tn) 
(zy — 21)... (Xy — zy i) (ux — za). (z& — v4) 
For this polynomial 

p(rx) — f(zxx), k — 1,...,n. (11.27c) 
holds. The replacement of the integrand f(x) by p(x) results in the quadrature formula 





Ly(x) = (11.27b) 


b b n b | b 
z)dz tz z)dz = i L(x) dx = wef (ap) with wp = | Dp(x) dx. (11.27d 
[ fe) fro Lre f s(a) de = ¥en lee) with oa [ tale) (11.274) 


For the Gauss quadrature formula the interpolation nodes cannot be chosen arbitrarily but they must 
be chosen by the formula: 
a+b b-a 

+ 

2 2 

The n values t, are the n roots of the Legendre polynomial of the first kind (see 9.1.2.6, 3., p. 565) 

1 d"[(t? — 1)”| 

aren! dt” 
These roots are in the interval [—1,+1]. The coefficients w, can be calculated by the substitution 








te, k=1,2,...,n. (11.28a) 


Tk = 





P,(t) = (11.28b) 


b— 
T— Tk = zu — f), so: 





- --- 1f (t-t)...(t- ta) tg)... (£5) 
x [ Tas) de (è 3) (th — tr)... (te — tha) (te — tei) --. (tk — tn) " 
— (b— a)A;. (11.29) 


In Table 11.1 are given the roots of the Legendre polynomial of the first kind and the weights Aj for 
SB ace: 
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Table 11.1 Roots of the Legendre polynomial of the first kind 

n t A n t A 
il t2 0 41-1 5 t; — —0.9062 A1 — 0.1185 
2 ty = —0.5774 A; = 0.5 t2 = —0.5384 A» — 0.2393 
t2 = 0.5774 A> = 0.5 t= 0 As = 0.2844 
3 tı = —0.7746 A, = 0.2778 t4= 0.5384 A4 — 0.2393 
t= 0 A» — 0.4444 ts = 0.9062 As = 0.1185 
13 — 0.7746 A3 = 0.2778 6 14 — —0.9324 A, = 0.0857 
4 t; — —0.8612 A; = 0.1739 to = —0.6612 Ay = 0.1804 
19 — —0.3400 Ay = 0.3261 tz = —0.2386 Aa — 0.2340 
tz = 0.3400 As — 0.3261 t4-— 0.2386 A4 — 0.2340 
ty = 0.8612 Ag = 0.1739 ts = 0.6612 As = 0.1804 
te = 0.9324 Ag = 0.0857 

E Solve the integral equation y(x) = cos rx + απο (εὖ --- 1) -- J - e™ p(y) dy by the Nystróm method 


for n = 3. 
τοι oe = 0.1127, 22 0.5, 13 — 0.8873, 


Ai—0.2778, — 4g — 04444, A; = 0.2778, 
fi = 0.96214, f» — 0.13087, fs = —0.65251, 
Ay, = 1.01278, Koo = 1.28403, — K33 = 2.19746, 
{12 = Ky = 1,05797, Aig = Ka, = 1.10517, Ko3 = K32 = 1.55838. 
The system of equations (11.25c) for y1, Y2, and vs is 
0.71864; — 0.47016y2 — 0.307023 = 0.96214, 
—0.29390y1 + 0.42938 p2 — 0.43292y3 = 0.13087, 
—0.30702p1 — 0.69254 y_ + 0.38955y3 = —0.65251. 


The solution of the system is: gy; = 0.93651, 2 = —0.00144, v3 = —0.93950. The substitution values 
of the exact solution at the interpolation nodes are: (x1) = 0.93797, (£2) =0, (£3) = —0.93797. 


11.2.4.2 Kernel Approximation 
Replace the kernel K(x, y) by a kernel K(x, y) so that K(x, y) ~ K(x,y) fra <x <b,a<y<b. 
Try to choose a kernel making the solution of the integral equation 


Ple) = Fle) +A | Kle, yP) dy (11.30) 


the easiest possible. 
1. Tensor Product Approximation 
A frequently-used approximation of the kernel is the tensor product approximation in the form 


K(a, y) & K(z,y) =X X dir aj(£)bk(y) (11.31a) 
j=0 k=0 
with given linearly independent functions og(z), .. ., o, (r) and fo(y),.... Baly) whose coefficients d;;. 


must be chosen so that the double sum approximates the kernel closely enough in a certain sense. 
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Rewriting (11.31a) in a degenerate kernel gives: 


X jx: dj Bi ( v| aw = Y duel) = Youle) (a)6;(y). (11.31b) 


Now, the ο... method 11.2.1, p. 622 can be used for the integral En A 


P(x) = f(x) + Af E sei) Py) dy. (11.316) 


Functions ag(z), ... o (x) and fo(y), .. ., &5(y) should be chosen so that the coefficients dj; in (11.31a) 
can be calculated easily and also that the solution of (11.31c) isn't too difficult. 

2. Special Spline Approach 

One chooses 





l-n 








n τ nc (11.32) 


0 otherwise 


an (x) = B(x) = 


for a special kernel approximation on the interval of integration [a, b] — [0,1]. The function o; (x) has 


k-1 k+1 
non-zero values only in the so called carrier interval (Ez ; n , (Fig. 11.1). 
n n 


To calculate the coefficients dj; in (11.31a), consider K (x, y) 
(x) at the points x = l/n, y = i/n (l,i = 0,1,...,n). Then 


l i 1 forj=l, k=i, 
2j B Ay (^) = b otherwise (11.33) 


holds, and consequently K (l/n, i/n) = du. Hence, one sub- 


MT Dd 
stitutes du = K E 3 =K (G , j Now (11.31a) has 





nn n n 





the form 


Figure 11.1 K(z,y) = 22, «(1 η) αγ(α)θκ(). (11.34) 


As known, the solution of (11.31c) has the representation 


p(z) = f(x) + Αραρ(α) +... + Anan(z). (11.35) 
The expression Apag(x) +... + Arale) is a piecewise linear function with substitution values A; at 
the points xp = k/n. Solving (11.31c) by the method given for the degenerate kernel, gives a linear 
system of equations for the numbers Ao, ..., A4: 


(1 — Acoo) Ao -λο Αἱ -- ... --λόρι An — bo, 
—Ac19 Ap 4- (1 = Aci Ai) = ub --λόιη Απ = bi, (11.362) 
--λ6η0 Ap -λοπι Αι τ (1 = Cnn) A = Dii; 
where 
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y 1 1 
_x(i 9 jn 4 ; 
Ξ (2.7) j ag(r)og (x) da 4- . ex (2, z) fo α)ακ(ᾳ) da. (11.365) 
0 0 
For the integrals it holds 1 
-- for j=0,k =Oandj=n,k=n, 
3n 
1 2 . ; 
— for j=k,1l<j<n, 
Ik= fuaa) dy — 4 3n P ο. (11.969) 
1 
τ — for j=k+1,j=k-1, 
6n 
0 otherwise. 


The numbers bg in (11.36a) are given by 


1 ς 
by = EO S(t) αι(α)] dz. (11.36d) 
1 fà Ann 


Taking a matrix C with numbers cj; from (11.36a), a matrix B with the values K(j/n, k/n) and a 





matrix A with the values /;; respectively, a vector b from the numbers bo, . . . , b, and a vector a from 
the unknown values Ao, ... , A;, the system of equations (11.36a) has the form 

(I— AC)a — (I — ABAJa — b. (11.966) 
In the case when the matrix (I— ABA) is regular, this system has a unique solution a = (Ao,..., An). 


11.2.4.3 Collocation Method 


Suppose the n functions y1(x),..., n(x) are linearly independent in the interval [a,b]. They can be 
used to form an approximation function P(x) of the solution y(x): 

p(z) ~ p(z) = αιφι(α) + agye(x) +... + augu (x). (11.37a) 
The problem is now to determine the coefficients a1,...,a4. Usually, there are no values ay1,..., Gn 
such that the function (x) given in this form represents the exact solution y(x) = (x) of the integral 
equation (11.23). Therefore, n interpolation points z4, ... , 2, are defined in the interval of integration, 
and it is required that the approximation function (11.372) satisfies the integral equation at least at 
these points: 


(xx) = api (te) +... + ann (xx) (11.37b) 
b 
— fu) A | Klen) larypily) +. -. +anpnly)] dy (Kk =1,...,n). (11.37c) 
With some transformations this system of equations takes the form: 


joco {Καν νι) al asso ponen) — A f KGris env | an 





= flt E= len) (11.37d) 
Defining the matrices 
yi(21) +++ Yn(x1) Bu +++ Bin b 
Ep í 0 Bob 0: p wthéa-[ KGssya()dy (13το 
1629) "eee PnlTn) Bri VES Bos m 


and the vectors 
a=(ay,...,a,)', b=(f(21),..-, f(tn))'- (11.371) 
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then the system of equations to determine the numbers a1,..., a; can be written in matrix form: 


(A—AB)a-b. (11.378) 


T 1 
E p(x) = B + | Ty e(y) dy. The approximation function is g(z) = a,x? + agr + a3, yi(x) = 


z?, qo(x) — v, qa(x) — 1. The interpolation nodes are z; — 0,9 — 0.5, 3 — 1. 
001 ΕΠ 0 
i i v2 v2 v2 ET 
Ἂν ΒΞ Fe 3 ae p» 
i1 2 m. - τ 
T 5 3 2 
The system of equations is 
a3 = 0, 
1 v2 1 v2 v2 1 
ro e T ag Sa EE ishare 
(; EIE =) aa +( 3 az 2J8' 
5 4 3 ài 1 1 
=a τα 408 — = 
τα 5? 3 8 z’ 


whose solutions are a? — —0.8197, a5 — 1.8092, a3 — 0 and with these g(x) — —0.8197 £? + 1.8092 x, 
and so (0) — 0, q(0.5) = 0.6997, B(1) = 0.9895. 

The exact solution of the integral equation is y(x) = yx with the values y(0) = 0, y(0.5) = 0.7071, 
y(1) =1. 

In order to improve the accuracy in this example, it is not a good idea to increase the degree of the 
polynomial, as polynomials of higher degree are numerically unstable. It is much better to use different 
spline approximations, e.g., a piecewise linear approximation G(x) = a1yi(x) +a2y2(x) +: ++ +anYn(x) 
with the functions introduced in 11.2.4.2 

k k—1 k+1 


for <g< 
n n n 











φιία) Ξ 
0 otherwise. 
In this case, the solution y(x) is approximated by a polygon p(x). 
Remark: There is no theoretical restriction as to the choice of the interpolation nodes for the colloca- 


tion method. In the case, however, when the solution function oscillates considerably in a subinterval 
the number of interpolation points in this interval should be increased. 


11.3 Fredholm Integral Equations ofthe First Kind 
11.3.1 Integral Equations with Degenerate Kernels 


1. Formulation of the Problem 
Consider the Fredholm integral equation of the first kind with degenerate kernel 
b 
f(z) = [awa +... +an(t)Ba(y)) py) dy (c< <d), (11.38a) 
and introduce the notation similar to that used in 11.2, p. 622, 
b 


A; = [ 80e) y (j =1,2,...,n). (11.38b) 


a 
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Then (11.384) has the form 


f(x) 2 Aion (x) - ... - Anon(x), (11.38c) 
i.e., the integral equation has a solution only if f(x) is a linear combination of the functions o (x), ..., 
om (x). If this assumption is fulfilled, the constants Ay,..., An are known. 


2. Initial Approach 
Looking for the solution in the form 
φ(α) -Ξ 1 Bi (2) +... + Ga Bn(x) (11.39a) 


where the coefficients c),...,¢, are unknown, substituting in (11.38b) 


b b 

Ai = cı f BB) dy... es [ G8.) dy (i—1,2,...,n), (11.39b) 
and introducing the notation 

b 

Ky = / Bi(y)Bs Qu) du (11.390) 
gives the following system of equations for the unknown coefficients c, ..., c: 

Kyuig +...+ Kincen A, 

: : : (11.39d) 
γιοι auis E Kimen = An. 





3. Solutions 

The matrix of the coefficients is non-singular if the functions f4(y), ..., B,(y) are linearly independent 
(see 12.1.3, p. 656). However, the solution obtained in (11.39a) is not the only one. Unlike the integral 
equations of the second kind with a degenerate kernel, the homogeneous integral equation belonging 
to (11.38a) always has a non-trivial solution. Suppose y"(x) is such a solution of the homogeneous 
equation and y(z) is a solution of (11.38a). Then (x) + y" (2) is also a solution of (11.38a). 

To determine all the solutions of the homogeneous equation, consider the equation (11.38c) with f(x) — 


0. If the functions a3 (x), ... , o, (x) are linearly independent, the equation holds if and only if 
b 
A;— [Bet dy - 0. G — 12,0), (1140) 


i.e., every function q^" (y) orthogonal to every function £(y) is a solution of the homogeneous integral 
equation. 


11.3.2 Analytic Basis 


1. Initial Approach 
Several methods for the solution of Fredholm integral equations of the first kind 


b 
fe) = [ K(ayely)dy (CS <d) (11.41) 


determine the solution y(y) as a function series of a given system of functions (8n(y)) = {81(y), Ge(y), 
...}, Le., looking for the solution in the form 


e) = esil) (11.42) 


where there are to determine the unknown constants c;. Choosing the system of functions it is to be 
considered that the functions (/,(y)) should generate the whole space of solutions, and also that the 
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calculation of the coefficients c; should be easy. 

For an easier survey only real functions are discussed in this section. All of the statements can be 
extended to complex-valued functions, too. Because of the solution method there are to establish cer- 
tain requirements for properties of the kernel function K (x, y) (see [11.3], [11.11], [11.12]). It is to be 
assumed that these requirements are always fulfilled. Next, there are to discuss some relevant informa- 
tion. 

2. Quadratically Integrable Functions 

A function (y) is quadratically integrable over the interval [a, 0] if 


b 
f wont ay « oo (11.43) 


holds. For example, every continuous function on [a, b] is quadratically integrable. The space of quadrat- 
ically integrable functions over [a, b] will be denoted by L?[a, b]. 

3. Orthonormal System 

Two quadratically integrable functions B;(y), 85(y), y € [a,b] are considered orthogonal to each other 
if the equality 


b 
J 88) dy =0 (11.44a) 


holds. A system of functions (8,(y)) in the space L?[a, ] is called an orthonormal system if the following 
equalities are true: 


b 
c 1 f PF $—17 
[ 68 - [1 m x (1.440) 


An orthonormal system of functions is complete if there is no function f(y) Z 0 in L?{a, b] orthogonal 
to every function of this system. A complete orthonormal system contains countably many functions. 
These functions form a basis of the space L?[a, b]. 'To transform a system of functions (f, (y)) into an 
orthonormal system (87 (y)) the Schmidt orthogonalization procedure can be used. This determines the 
coefficients 551,555, ... , 0,4 for n — 1,2,... successively so that the function 


Bn(y) = Do Ong By (y) (11.449) 
ja 


is normalized and orthogonal to every function ft (y), .... 05 4(y). 
4. Fourier Series 
If (8, (y)) is an orthonormal system and v(y) € L?[a, b], then one calls the series 


oo 


$ ibu) — vv) (11.450) 


ja 


the Fourier series of v(y) with respect to (8,(y)), and the numbers d; are the corresponding Fourier 
coefficients. Based on (11.44b) 


πο ο ο (11450) 


holds. If (8,(y)) is complete, the Parseval equality holds: 


b oo 
[wo ay |”. (11.459) 
a j=l 
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11.3.3 Reduction of an Integral Equation into a Linear System of 
Equations 

A linear system of equations is needed in order to determine the Fourier coefficients of the solution func- 

tion y(y) with respect to an orthonormal system. First, a complete orthonormal system (8,(y)),y € 


(a, b] is chosen. A corresponding complete orthonormal system (a (x)) can be chosen for the interval 
x € [c, d]. With respect to the system (a; (x)) the function f(x) has the Fourier series 


d 


fio(x) with f;— / oír) f (x) da. (11.463) 


c 


f(z) = 


Me 


ll 
Ri 


If the integral equation (11.41) is multiplied by o;(x) and the integral is evaluated for x running from 
c to d alone, one gets: 


d b 
- f [ KG netos) du dz 


ae 


= J [j K(x, y)ai(x) a} ply)dy (i=1,2,...). (11.46b) 


The expression in braces is a function of y with the Fourier representation 


[κι (z,y)os(z) dz — K;(y -Σ κο (y) with (11.46c) 
b d 
Ky = | | Kle, voile) BU) dx dy. 
With the Fourier series eat 
p(y) = Lal) (11.46d) 


= IDE ΠΡΙ )l« 


2 Ka fao) Jy)dy (i =1,2,...). (11.46e) 
Because of the orthonormal property (11.44b) the system of equations 
= Kye; (6 =1,2,...) (11.46f) 


holds. This is an infinite system of equations to determine the Fourier coefficients c4, c9, .... The matrix 
of coefficients of the system of equations 


Kyu KK 


Και Ko R3 +--+ 
K=] Ka Κω kar (41466) 


11.3 Fredholm Integral Equations of the First Kind 639 





is called a kernel matriz. The numbers f; and Kij (i,j — 1,2,...) are known, although they depend 
on the orthonormal system chosen. 


uL ima 
ο. [ Ee οἷῳ dy, 0 <a <7. The integral is considered in the sense of the Cauchy 
πο cosy — cosa 


principal value. As a complete orthogonal system can be used: 


1 2 2 
1. a(x) = πι a(z) = E (151,2. 2. B;(y) — ΠῚ (gj m2. i 


By (11.46d), the coefficients of the kernel matrix are 


- ™ sinysin jy : 
Ko; 1 =f i ix dy =0 (j =1,2,.-.), 
1 y 0 cosy —cosz * 6 νῷ 
sin iy cos iz 2 [τ T cosis 
Ky T i: sin y sin iy cos ia dx dy = ΣΙ saysmiyf / ae} dy (1—1,2,...). 
am Jo Jo iT 7? Jo Jo cos a 


COS jj — COs x COS jJ — COS £ 
For the inner integral the equation 














p —m"——- (11.47) 


[ cos iz d sin iy 
0 cosy— cosg siny 


0 forizj, 


2 π 
holds. Consequently K;; — —— / sin jy sin iy dy = i 
π Jo -1 for i=j. 


The Fourier coefficients of f (x) from (11.46a) are f; — / f(x)os(x) dz (i — 0,1,2,...). The system of 
0 


d diede, fay D 
Si Gee C2 1 
equations is 0-10 fa] fed. According to the first equation, the system can have any 


solution only if the equality fo = ie f (z)ao(x) dz — B Γ f(x)dr — 0 holds. Then c; 2 —f; (j = 
T Jo 


1,2,...), and y(y) --/25 f;sinjy — T UEM fü dx holds. 
dcs Sm 


COS y — cos x 


11.3.4 Solution ofthe Homogeneous Integral Equation ofthe First 
Kind 


If y(y) and y"(y) are arbitrary solutions of the inhomogeneous and the homogeneous integral equation 
respectively, i.e., 


b 
= f Kee) dy (11.484) and 0= [κ c, y)" (y) dy, (11.48b) 


then the sum y(y) + ¢"(y) is a solution of the inhomogeneous integral equation. Therefore there are 
to determine all the solutions of the homogeneous integral equation. This problem is the same as de- 
termining all the non-trivial solutions of the linear system of equations 


Y Ke; —0 (i—1,2,...). (11.49) 
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As sometimes this system is not so easy to solve, the following method can be used for the calculations. 
If there is a complete orthonormal system (a,(x)), take the functions 


d 
E [ Kent) dx (i—1,2,..). (11.504) 

If q^ (y) is an arbitrary solution of the homogeneous equation, i.e., 
[κ 1,y)o (y) dy 2 0 (11.50b) 


holds, then multiplying this equality by a;(a) and performing an integration with respect to x, gives 


b d b 
0= few [ KG. y)ay (2) dedy = [| ^99) dy (¢=1,2,..), (1.509) 


ie., every solution v" (y) of the homogeneous equation must be orthogonal to every function K;(y). 
Replacing the system (KK, (y)) by an orthonormal system (7(y)) and using an orthogonalization pro- 
cedure, instead of (11.50c) follows 


b 
j e" (y) K (y) dy — 0. (11.50d) 


Extending the system (K7(y)) into a complete orthonormal system, the conditions (11.50d) are obvi- 
ously valid for every linear combination of the new functions. If the orthonormal system (7(y)) is 
already complete, then only the trivial solution y”(y) = 0 exists. 
The solution system of the adjoint homogeneous integral equation can be calculated in exactly the same 
way: 

d 

f Kevela) dr =0. (11.50e) 


c 


l[* six ' 2 2n 
Li -f ————— — —e(y) dy — 0, 0 € v € s. An orthonormal system is: ao;(z) — 4| —siniz (i = 
0 π 


cos y — cos & 


21 [7" sinrsinir 2 Ἡ wm qe 1 
1,2,...), Kily) el sinzsmir μ, ΠΕΙ cos(i — 1)z — cos(i ἠ- 1)r du. Applying 


0 COSY — COST cos y — COS T 


21 ('sin(i — 1)y — sin(i 1 2 
(11.47) twice yields Ki(y) = -\/=5 (e v i ») d cosiy (i = 1,2,...). The 
π π 




















sin y 

system (€, (y)) is already an orthonormal system. The function Ko(y) — Vm completes this system. 
π 

Consequently the homogeneous equation has only the solution: y"(y) = c—= = é, (cis arbitrary). 


υπ 


11.3.5 Construction of Two Special Orthonormal Systems for a 
Given Kernel 


1. Preliminaries 

The solution of infinite systems of linear equations (see 11.3.3, p. 638) is not usually easier than the 
solution of the original problem. Choosing suitable orthonormal systems (a@,,(a)) and (6,(y)) one can 
change the structure of the kernel matrix K in such a way that the system of equations can be solved 
easily. By the following method two orthonormal systems can be constructed such that the coefficients 
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K;ij of the kernel matrix are non-zero only for i — j and i — j + 1. 


Using the method given in the previous paragraph, first two orthonormal systems (^ (y)) and (ab (x)) 
are to determine, i.e., the solution systems of the homogeneous and the corresponding adjoint homo- 
geneous integral equations respectively. This means that it can be given all the solutions of these two 
integral equations by a linear combination of the functions A^ (y) and a^ (x). These orthonormal sys- 
tems are not complete. By the following method these systems are to be completed step by step into 
complete orthonormal systems o;j(x), 8;(y) (j — 1,2,...). 


2. Procedure 
First a normalized function a(x) is determined, which is orthogonal to every function (o()). Then 


the following steps are performed for j = 1,2,...: 
1. Determination of the function 6;(y) and the number v; from the formulas 


d 

νιβι(η) -- f Keno) dx or (11.51a) 
a 

vipit) = f Kæ yole) de- ujb) GA D, (11.51b) 


so that v; is never equal to zero and §;(y) is normalized. Then 5;(y) is orthogonal to the functions 
((Θ'(ν)) &(v). .... B5: (v)). 


2. Determination of the function a;4;(2) and the number ju; from the formula 


b 
μιάγα(α) - | Κία.ν)βγ(ω) ἀν — vios). (11.516) 


There are two possibilities: 

a) pt; £0: The function a;41(x) is orthogonal to the functions ((ah(z), oa(z),..., oj(z)). 

b) 4; — 0: Then the function a;,;(x) is not uniquely defined. Here again there are two cases: 

bı) The system (lat 2)), αι(α),... «αγ(ο)) is already complete. Then the system (610), πο 
Bj(y)) is also complete, and the procedure is finished. 

bz) The system (ων 1)),o1(x),...,oj (ο) is not complete. Then again one chooses an arbitrary func- 
tion aj44 (x) orthogonal to the previous functions. 


This procedure is repeated until the orthonormal systems are complete. It is possible that after a cer- 
tain step the case b) does not occur during a countable number of steps, but the system of this countable 


number of functions ((ah(z)), oa(z),.. ;) is still not complete. Then again one can start the procedure 
by a function a(x), which is orthogonal to every function of the previous system. 


If the functions a; (a), 6;(y) and the numbers vj, jjj are determined by the procedure given above, the 
kernel matrix K has the form 





00 0-.- wm 0 0 
οκ ο... (m) jm g ... 
K=|0 0 K2. with  K"- ρω |. (11.52) 


0 pu vs 
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(m 


The matrices K” (m — 1,2,...) are finite if during the procedure µ] ) = 0 holds after a finite number 


of steps. They are infinite if for countably many values of j, i # 0 holds. The number of zero rows 


and zero columns in K corresponds to the number of functions in the systems (o^(x)) and (8^(y)). A 
very simple case happens if the matrices K™ contain one number νι) = Vm only, i.e., all numbers ij" 
are equal to zero. 

Using the notation of 11.3.3, p. 638, for the solution of the infinite system of equations under the as- 


sumptions f; — 0 for a;(x) € (ad (x)) it holds: 


fi h 
= fo ( : 
.- 7 r Bly) ¢ (BR), (11.53) 
arbitrary for f;(y)€ (85y)) - 
11.3.6 Iteration Method 
To solve the integral equation 
b 
f(a) = [Keane dy (c<a<d), (11.54a) 
starting with ag(r) — f(x) one determines the functions 
d b 
Baly) = f Ke noit) as (11.54b) and απ) - [ KG.) dv. (11.54c) 
for n = 1,2,.... If there is a quadratically integrable solution y(y) of (11.54a) then the following 
equalities hold: 
b bd 
[ 48. dy — | [PORK enone) da dy 
d 
= EO dr  (n-12,..). (1.544) 


Orthogonalization and normalization of the function systems (11.54b,c) give the orthonormal systems 
(a (x)) and (87(y)). Using the Schmidt orthogonalization method then 57 (y) has the form 


ROEE mhU) | (n—1,2,...). (11.546) 
j=l 


Now it is assumed that the solution y(y) of (11.54a) has the representation by the series 


pa) =È cnbi). (11.541) 
j=l 


In this case for the coefficients c, regarding (11.54d) 


b a b i d 
e = f OBO) dy = Z bri f ABO) dy = Z br | Λίσα 1ο) ἀ.. (11.548) 
a j=1 a j=l 


c 
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holds. To have a solution in the form (11.54f) the following conditions are both necessary and sufficient: 


d oo 
1. fw (x)? dx XL )da[, (1550) 2 X les «ov. (11.55b) 


11.4 Volterra Integral Equations 
11.4.1 Theoretical Foundations 


A Volterra integral equation of the second kind has the form 


p(x) = fla) + | Kle, yol) dy. (11.56) 


The solution function (a) with the independent variable x from the closed interval J = [a, b| or from 
the semi-open interval J = [a,0o) is required. There is the following theorem about the solution of 
a Volterra integral equation of the second kind: If the functions f(r) for x € I and K(v, y) on the 
triangular region x € I and y € [a,x] are continuous, then there exists a unique solution y(x) of the 
integral equation such that it is continuous for x € I. For this solution 


p(a) = f(a) (11.57) 


holds. In many cases, the Volterra integral equation of the first kind can be transformed into an equation 
of the second kind. Hence, theorems about existence and uniqueness of the solution are valid with some 
modifications. 


1. Transformation by Differentiation 
Assuming q(x), K (x, y), and K;(r, y) are continuous functions, the integral equation of the first kind 


f(x) - [κυρῶ dy (11.582) 
can be transformed into the form 
f(a) = K(a, x)p(x ΠΕ K(x, y)e(y) dy (11.58b) 


by differentiation with respect to x. If K(x, x) Z 0 for all x € J, then dividing the equation by K (x, x) 
gives an integral equation of the second kind. 


2. Transformation by Partial Integration 


Assuming that y(x), K(x, y) and K(x, y) are continuous, one can evaluate the integral in (11.58a) by 
partial integration. Substituting 


fewa = v(x) (11.59a) 


gives 


f(x) 


ll 

A 
~ 

5 
το 
= 
κ... 
p 
es 
= 


oz - [ (5,560) we ay 


(11.59b) 


ll 
A 
~ 
A 
8 
d 
Ἐ- 
= 
R 
AE 
| 
$e 
x 
~ 
8 
< 
—— 
NIIT uA 
e 
=A 
< 
os 
a 
REY 
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If K(x, x) Z 0 for x € I, then dividing by K(x, x) gives an integral equation of the second kind: 


| f(z) 1 f(a 

wo- EL ges ] (a9) ma (11.596) 
Differentiating the solution ~() yields the solution y(x) of (11.58a). 

11.4.2 Solution by Differentiation 


In some Volterra integral equations the integral vanishes after differentiation with respect to x, or it 
can be suitably substituted. Assuming that the functions K(x, y), Kz(x, y), and p(x) are continuous 
or, in the case of an integral equation of the second kind, (x) is differentiable, and differentiating 


1(α) - [tanen dy (11.604) or v(x) = f(x) + f iae) dy (11.60b) 
with respect to x yields 

fila) ^ K(z,zyo(z) + [5 -K(x,y) p(y) dy or (11.600) 

P(t) = f(a) + Κα, αγρ(α) -- a+ | a, Ks) e) dy (11.604) 


T 
1 
W Find the solution y(x) for x € n 5) of the equation [ cos(x — 2y)g(y) dy = 37 sing (I). Differ- 
Jo 
T 
1 
entiating it twice with respect to x gives y(x) cos z — Í sin(x — 2y)y(y) dy = 5 (sin x + x cos x) (IIa), 
Jo 
T 
1 
and y'(x) cosx — / cos(x — 2y)g(y) dy = cos x — z sin x (IIb). The integral in the second equation 
is the same as that in the original problem, so one can substitute it. This yields y(x) cos x = cos x and 


0,7). eG) =1, so y(#)=2£+C. 


To determine the constant C substitute x — 0 in (Ila) to obtain y(0) = 0. Consequently C — 0, and 
the solution of (I) is y(x) = 


because cos x z 0 for x € 





Remark: If the kernel of a Volterra integral equation is a polynomial, then one can transform the in- 
tegral equation by differentiation into a linear differential equation. Suppose the highest power of z in 
the kernel is n. After differentiating the equation (n + 1) times with respect to x follows a differential 
equation of n-th order in the case of an integral equation of the first kind, and of the order n + 1 in the 
case of an integral equation of the second kind. Of course it is to be assumed that (x) and f(x) are 
differentiable as many times as necessary. 


E ^ [2(r — y? + ljy(y)dy = α) (1). After differentiating three times with respect to x holds 

ex) f z—y)e(y) dy ^ 32? (II*a), eG) f^ e y)dy = 6r (I*b), | qe" (z)--4p(z) — 6 (II*c). 
The general solution of this differential equation is y(a) = Asin 2x + B cos 2x + 5 Substituting x — 0 
in (II*a) and (II*b) results in o(0) = 0, y’(0) = 0, so the solution is A — 0, B = —1.5. The solution 
of the integral equation (I*) is e(x) — λα — cos 2r). 
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11.4.3 Solution of the Volterra Integral Equation of the Second 
Kind by Neumann Series 


The solution of the Volterra integral equations of the second kind can be represented by using a Neu- 
mann series (see 11.2.3, p. 627). If the equation has the form 


p(t) = fla) +A [ K(w,yolu) dy, (11.61) 


so one formally substitutes 





τη. K(r,y) fory&rz, 
K(a,y) = la fory >z. (11.62a) 
With this transformation (11.61) is identically to a Fredholm integral equation 
b 
p(x) = fx) A ['K(s, ey) dv, (11.62b) 
allowing b = oo as well. The solution has the representation 
- b 
p(x) = f(z) + >>” [ fon dy. (11.62c) 
n=1 a 
The iterated kernels K1, Ko,... are defined by the following equalities: 
b r 
Kı(z,y) = K(z,y) Κο) -- [Ke n)K(n,y) dn = [κα η)Κ(η,υ)άη,... (11.624) 
a y 
and in general: 
παν) -- | Αα. η) Κι τί, y) du. (11.629) 


y 
The equalities K;(r,y) 2 0 for y > x (j — 1,2,...) are also valid for iterated kernels. Contrary to 
Fredholm integral equations if (11.61) has any solution, the Neumann series converges to it regardless 
of the value of A. 


Β v(x) =14+ λ/ e" "p(y)dy. Ki(x,y) — K(x,y) — e**, Ko(v,y) — / e? Πε 9 dy = ία -- 
Jo Jy 


Jo Kales) = 2 ole yy 

1 — Ant = — (rt —- $ 

yos King cy 9 
oo AT 

Consequently the resolvent is: P(x,y; A) = e™ 4 στα — y)" — e€*-900*D, Tt is well-known that 
aon 


this series is convergent for any value of the parameter A. 

T T 
One gets y(x) = 1 + af ευ να dy = 14 | e OD! dy. in particular if A — —1: q(z) — 

0 0 
1 
Lm L3. a cm (Qr 1)r 

1—z,Àz-L ola) = x74 (1 +¢ ). 
11.4.4 Convolution Type Volterra Integral Equations 
If the kernel of a Volterra integral equation has the special form 


k(r—y) for 0<y<a, 
)-10 


K(z,y) — cr (11.63a) 
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can be used the Laplace transformation to solve the equations 
[ ρω) ία) 0163) oœ ola) = fa) + fk —e()dy. — QL63) 
0 0 
If the Laplace transforms L{y(x)} = P(p), L{f(x)} = F(p), and L{k(x)} = K(p) exist, then the 
12, 11., p. 773) 


transformed equations have the form (see 15.2.1.2, 


K(p)d(p) — F(p) (11.64a) or (p) = F(p) 4- K(p)d(p) resp. (11.64b) 





From these follows 
ip) -= EP) ime (p) = P) resp. (11.64d) 


~ K(p) I-K 


The inverse transformation gives the solution y(x) of the original problem. Rewriting the formula for 
the Laplace transform of the solution of the integral equation of the second kind yields 


F(p) K(p) 


(p) = 1- K(p) = F(p) + T-K@ (11.64e) 
The formula 
K(p) 

I- K(p) H(p) (11.64f) 


depends only on the kernel, and denoting its inverse by h(x), the solution is 
p(x) = FG) + f hle- 9) f) dy. (1.646) 
0 


The function h(a — y) is the resolvent kernel of the integral equation. 


Β p(x) = f(x) + f e* “y(y) dy: (p) = F(p) + ο. i.e., (p) = E Flr). The inverse 
0 p- p- 





transformation gives y(x). From H(p) = 


elo) = f(a) + | EF) dy. 


1 
5 it follows that h(x) = e?”. By (11.64g) the solution is 


11.4.5 Numerical Methods for Volterra Integral Equation of 
the Second Kind 


The problem is to find the solution for the integral equation 


p(x) = f(x) + | Kley) dy (11.65) 


for x from the interval J = [a,b]. The purpose of numerical methods is somehow to approximate the 
integral by a quadrature formula: 


J Klevel) du = Quail K (E, JHO): (11.66a) 


Both the interval of integration and the quadrature formula depend on x. This fact is emphasized by 
the index [a, x] of Qja,cj(...). One gets the following equation as an approximation of (11.65): 


Ble) = f(x) + Qa (Ke, JBC). (11.605) 
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The function g(x) is an approximation of the solution of (11.65). The number and the arrangement of 
the interpolation nodes of the quadrature formula depend on z, so as to allow little choice. If £ is an 
interpolation node of Qu (KK (v, .)g(.)) , then (K (v, £)g(£)) and especially z(£) must be known. For 
this purpose, the right-hand side of (11.66b) should be evaluated first for x = £, which is equivalent to a 
quadrature over the interval [a, £]. As a consequence, the use of the popular Gauss quadrature formula 
is not possible. 

The problem is to be solved by choosing the interpolation nodes as a = £o < £1 < ... < Lp <... 
and using a quadrature formula Qja,z„] with the interpolation nodes zo, £1,...,&n. The values of the 
function at the interpolation nodes are denoted by the brief notation Yk = P(£k) (k = 0,1,2,...). 
For oo follows (see 11.3.1, p. 635) 


po = f(xo) — f(a),  (11.66c) and with this: pı = f(zi)- Qus (X (1,.)](.)). (11.664 
Q5, has the interpolation points zo and x; and consequently it has the form 
Ομ ναι, .)P(.) = wo (v1, 20) G0 + WK (a1, 1): (11.66e 


with suitable coefficients wo and w1. Continuing this procedure, the values pp are successively deter- 
mined from the general relation: 


Pr = f (Tk) - Ωαεμ(Κίακ,.)φ(-)), &=1,2,3,.... (11.661 
"The quadrature formulas Qt,;,; have the following form: 


k 
Qlan] (K (£r, Je.) = 3 wj K (v. ;)o;. (11.66g 


j-0 
Hence, (11.66f) takes the form: 





k 
Pr = (αν) Ἐν wg K (xi, vj)o;. (11.66h 


j-0 


The simplest quadrature formula is the left-hand rectangular formula (see 19.3.2.1, p. 964). For this 
the coefficients are 


WUjk-— cmja-—r; for j<k and wy = 0. (11.66i) 
With this follows the system 

yo = f(a), 

pı = f (a1) t (X1 — xo) K (21; 0) 0, (11.67a) 





P2 = f (v2) + (T1 — xo) C (2, 9)io -- (xa — v1) K (12, 21) 
and generally 


ο 
Pr = f (ee) - Y (jaa — m) K (1, 25)9;. (11.67b) 


10 
More accurate approximations of the integral can be obtained by using the trapezoidal formula (see 
19.3.2.2, p. 964). To make it simple, one chooses equidistant interpolation nodes 1, = a + kh, k = 
0,122 2t 
b h k-1 

EO απ το a glzo) +2 X. g(z;) + g(zr)| - (11.67c) 

a jai 
Using this approximation for (11.66f) one gets: 

po = f(a), (11.674) 


Pk = flax) +8 





k-1 
K (ap, U0) G0 + K (xy, Ue) eR + 2 y K (xp, e| ; (11.67e) 
j=l 





648 11. Linear Integral Equations 





Although the unknown values also appear on the right-hand side of the equation, they are easy to 
express. 

Remark: With the previous method one can approximate the solution of non-linear integral equations 
as well. Using the trapezoidal formula to determine the values v one has to solve a non-linear equation. 
To avoid this one can use the trapezoidal formula for the interval [a,2,_1], and use the rectangular 
formula for the interval [zj..1, x]. If h is small enough, this quadrature error does not have a significant 
effect on the solution. 


W The problem is to solve the integral equation y(x) = 2 + J (x — y)yl(y) dy by the formula (11.668 
0 


using the rectangular formula. The interpolation nodes are the equidistant points x, = k- 0.1, and 
hence h = 0.1. 














x exact | rectangular | trapezoidal 
yo =2, | formula formula 
WE = fla) t ο 02. 0.2 | 2.0401 2.0602 2.0401 
a c fün)4 h(K(mamo)qod K(m,z)e) 04 | 21021 |  2:2030 2.1620 

— 2 4- 0.1(0. E 2 + 0.1 - 2.02) = 2.0602 0.6 | 2.3709 2.4342 2.3706 
etc. 0.8 | 2.6749 2.7629 2.6743 
1.0 | 3.0862 3.2025 3.0852 


In the table the values of the exact solution are given, as well as the approximate values calculated by 
the rectangular and the trapezoidal formulas, respectively, so the accuracies of these methods can be 
compared. The step size used is h = 0.1. 


11.5 Singular Integral Equations 

An integral equation is called a singular integral equation if the range of the integral in the equation 
is not finite, or if the kernel has singularities inside of the range of integration. It is to be supposed 
that the integrals exist as improper integrals, or as Cauchy principal values (see 8.2.3, p. 506ff.). The 
properties and the conditions for the solutions of singular integral equations are very different from 
those in the case of “ordinary” integral equations. In the following sections only some special problems 
are discussed. For further discussions see [11.2], [11.3], [11.7], [11.8]. 


11.5.1 Abel Integral Equation 

One of the first applications of integral equations for a physical prob- 
lem was considered by Abel. A particle is moving in a vertical plane 
along a curve under the influence only of gravity from the point 
Po(xo, yo) to the point P,(0,0) (Fig. 11.2). 

The velocity of the particle at a point of the curve is 


Po (Xo -Y0 ) 





1 o 
P; (0,0) Xo x v= "t = V29(yo — y). (11.68) 
Figure 11.2 The time of fall as a function of yo is calculated by integration: 
l 
T (yo) -/——“—— (11.69a) 
0 29( (Yo p y) 


If s is considered as a function of y, i.e., s = f(y), then 


lf) 
ò ν0 ννο-ν 


yo 


T(yo) = dy. (11.69b) 
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The next problem is to determine the shape of the curve as a function of yo if the time of the fall is 
given. By substitution 
V29:T(yo) — F(yo) and Ρ(ν) - φ(υ) (11.69c) 


and changing the notation of the variable yo into x, a Volterra integral equation of the first kind is 
obtained: 





= dy. (11.694 
| Vi-3 ) 

It is to be considered now the slightly more general equation 
(x)= HL wih 0-cacl. (11.70) 


(a — y)* 


The kernel of ia equation is not bounded for y = x. In (11.70), the variable y is formally replaced 
by € and the variable x by y. By these substitutions the solution is obtained in the form  — q(x). If 


1 
both sides of (11.70) are multiplied by the term Gy and integrated with respect to y between 
2 


— y) 


the limits a and x, it yields the equation 


= t_v(é) V [LIO : 
/ TIT (/ G-E «) dy / (s- yrs dy. (11.71a) 


Changing the order of integration on the left-hand side gives 











f f dy f FW) 
ρίε f de = f — dy. (11.71) 
[ο (z — y)'-*(y — £)* 1 (α-ν)-- 
The inner ΜΝ can be evaluated by the substitution y = € + (a — &)u: 
1 
dy du T 
= = ; 11.71 
i (x — y)t-*(y — £)* l u*(l— u)i-*  sin(ar) (Le) 


This result is to be substituted into (11.71b). After differentiation with respect to x one gets the func- 
tion y(x): 


p(x) = Bion) d. [oet (11.71d) 


m drj (x-y) 








ply) νὴ =e 
-h uy e =e fe dy = πνα. 
11.5.2 Singular Integral Equation with Cauchy Kernel 
11.5.2.1 Formulation of the Problem 


Consider the following integral equation: 


a(r)g(x) 4- | AED) of) dy = f(z), wel. (11.72) 


πι — 
p oJ 


T is a system consisting of a finite number of smooth, simple closed curves in the complex plane such 
that they form a connected interior domain $* with 0 € S* and an exterior domain S~. Driving along 
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the curve there is S* always on the left-hand side of I. A function u(x) is Hólder continuous (or satisfies 
the Hélder condition) over I’ if for any pair 21, £2 € I the relations 

lu(zi) — u(a2)| < Klay — 22/8, O< Ps <1, K>0 (11.73) 
are valid. It is supposed that the functions a(x), f(x), and y(x) are Hölder continuous with exponent 
£1, and K (z, y) is Hólder continuous with respect to both variables with the exponents 82 » βι. The 
kernel K (z, y)(y — x)-! has a strong singularity for 2 = y. The integral exists as a Cauchy principal 
K(r,y) — K(v,c) 

ντα 


value. With K(x, £) = b(x) and k(x, y) = (11.72) holds in the form 


Coa) = alo)ola) + E f PW) α EL = f(e) ser. (14) 


mis y— 


The expression (£y)(2) denotes the left-hand side of the integral equation in abbreviated form. £ is 
a singular operator. The function k(x, y) is a weakly singular kernel. It is assumed that the normality 
condition a(x)? — b(x)? # 0, £ € I’ holds. The equation 


gly) 


(Ορ) (α) -- αἴα)φ(α) - E peda fe), zen (11.74b) 





is the characteristic equation pertaining to (11.74a). The operator Lo is the characteristic part of the 
operator £. The adjoint integral equation of (11.74a) yields the equality 


ο ο ο πο ο yas 


ae Ti $—1 
y T y 


=g(y), ver. (11.74c) 


11.5.2.2 Existence of a Solution 
The equation (Ly)(x) = f(x) has a solution (x) if and only if for every solution (1) of the homoge- 
neous transposed equation (£'y)(y) = 0 the condition of orthogonality 


EO) dy =0 (11.75a) 


is satisfied. Similarly, the transposed equation (£'w)(y) = g(y) has a solution if for every solution 
(a) of the homogeneous equation (Ly) (a) = 0 the following is valid: 


EO dz — 0. (11.75) 

T 
11.5.2.3 Properties of Cauchy Type Integrals 
The following function is called a Cauchy type integral over I: 

1 
(z) = — PY) ay z€C, (11.76a) 
amid y-—z 
F 

For z ¢ I the integral exists in the usual sense and represents a holomorphic function (see 14.1.2, 
p. 732). Also (o0) = 0 holds. For z — x € TI in (11.76a) is to be considered the Cauchy principal 
value 


Hole = z [ £0) 


yY =F 





dy, wel. (11.76b) 
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The Cauchy type integral ®(z) can be extended continuously over I from S% and from S^. The limits 
when approaching z the point x € I are denoted by +(x) and ~ (x), respectively. The formulas of 
Plemelj and Sochozki are valid: 


$*()- $o()- (Moy), — d-() - - 5e) - Hela). (1.760) 


11.5.2.4 The Hilbert Boundary Value Problem 


1. Relations 

The solution of the characteristic integral equation and the Hilbert boundary value problem are strongly 
correlate. If (x) is a solution of (11.74b), then (11.76a) is a holomorphic function on $* and S^ with 
(co) = 0. Because of the formulas of Plemelj and Sochozki (11.76c) 


p(x) = @t(r)- D(x), 2Hy)(x)= Ot (a) +O (x), rer (11.77a) 
holds. With the notation 
a(z) — b(a) Κο) 
2) -- ---------- and r) —————. VTE 
e) a(x) 4- b(x) aud. gen) a(x) 4- b(x) ' am) 
the characteristic integral equation has the form: 
P(x) = G(x) (x) + g(x), zer. (11.77c) 
2. Hilbert Boundary Value Problem 
Looking for a function ®(z) which is holomorphic on S* and S~, and vanishes at infinity, and satisfies 
the boundary conditions (11.77c) over I. A solution &(z) of the Hilbert problem can be given in the 
form (11.76a). So, as a consequence of the first equation of (11.77a), a solution y() of the characteristic 
integral equation is determined. 
11.5.2.5 Solution of the Hilbert Boundary Value Problem 
(in short: Hilbert Problem) 


1. Homogeneous Boundary Conditions 
@* (x) =G(xr)@ (x), vel. (11.78) 


During a single circulation of the point x along the curve T, the value of log G(x) changes by 27 id), 
where A; is an integer. The change of the value of the function log G(x) during a single traverse of the 
complete curve system I" is 


So arid, = Qik. (11.79a) 
1=0 
The number «K = Y A is called the index of the Hilbert problem. Now it is to compose a function 
i-o 
Go(r) = (x — ao) ^Il(zx)G(x) (11.79b) 
II(z) ^ (x — a1) * (z — a3) --- (x — as, (11.79c) 
where ag € S* and aj (I — 1,...,n) are arbitrarily fixed points inside I}. If P = Tọ is a simple closed 
curve (n — 0), then one defines II() — 1. With 
1 l 1 
dy [BSD ay (11.79d) 
27i Row 


the following particular solution of the homogeneous Hilbert problems is obtained, which is called the 
fundamental solution: 
. fH-(z)expl(z) for zest, 


X(2)= { (z— ag) *expl(z) for ze S". (11.790) 
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This function doesn’t vanish for any finite z. The most general solution of the homogeneous Hilbert 
problem, which vanishes at infinity, for & > 0 is 

®,(z) = X(z)Pe-i(z), 2EC (11.80) 
with an arbitrary polynomial P,_1(z) of degree at most (« — 1). For « < 0 there exists only the trivial 
solution ®;,(z) = 0 which satisfies the condition ®;(00) = 0, so in this case P,_1(z) = 0. For K > 0 the 
homogeneous Hilbert problem has & linearly independent solutions vanishing at infinity. 


2. Inhomogeneous Boundary Conditions 
The solution of the inhomogeneous Hilbert problem is the following: 


d(z) — X(z)R(z) + d (z) (11.81) with R(z) "ail x82 A (11.82) 


If « « 0 holds, for the existence of a solution vanishing at infinity the following necessary and sufficient 
conditions must be fulfilled: 


υ"α(ν) αμ... m 
1. (Kem 0, esas 1). (11.83) 


11.5.2.6 Solution of the Characteristic Integral Equation 


1. Homogeneous Characteristic Integral Equation 
If a(z) is the solution of the corresponding homogeneous Hilbert problem, from (11.772) follows the 
solution of the homogeneous integral equation 


pr(x) = Of (x) — B(x), «eT. (11.84a) 
For & € 0 only the trivial solution yp(x) = 0 exists. For & > 0 the general solution is 
pn(x) = [X+ (x) — X^ (x)| Pa i(x) (11.84b) 


with a polynomial P,_; of degree at most & — 1. 


2. Inhomogeneous Characteristic Integral Equation 
If (z) is a general solution of the inhomogeneous Hilbert problem, the solution of the inhomogeneous 
integral equation can be given by (11.77a): 








p(x) = &* (x) — D(x) (11.85a) 
= X*(x)Rt (x) — X~(a)R (a) + S$ (x) — P, (z), «eT. (11.85b) 
Using the formulas of Plemelj and Sochozki (11.76c) for R(z) holds 
Do li gm) 9 X64 pol 9) 9. : 
R'e) = i t (u5) ὦ, Ra) =- * (x4) ln. (1.850) 


Substituting (11.85c) into (11.85a) and considering (11.76b) and g(x) — f(x)/(a(x) 4- b()) finally 


results in the the solution: 


909) 7 grs) -6(2))X* (5) ^) 


1 fy) a ae 
στι} {το χω Tay ee sero Qna) 





F(X* (2) - X^ (2) 


According to (11.83) in the case & « 0 the following relations must hold simultaneously in order to 
ensure the τ E a solution: 





[ag DE cu jn (k 20,1,...,—&— 1). (11.87) 
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W The characteristic integral equation is given with constant coefficients a and b: 


b t 
ag(x) 4 m / 2w dy = f(x). Here I’ is a simple closed curve, i.e., /' — I9 (n — 0). From (11.77b) 
ντα 




















—b z 
follows G — ^ πα and g(a“) = a . G is a constant, consequently & — 0. Therefore, I(x) = 1 and 
a a 
—b 
-b -b 1 1 A geg 
Gu -G--—. I(z) log 5 -J dy = log lp4955 
a+b at+b2mif yz 0, zes. 
a—b 
2 B —b 
Vij) eee ee i, RS ek, 
1 A atb 


Since & = 0 holds, the homogeneous Hilbert boundary value problem has only the function $,(2) = 0 
as the solution vanishing at infinity. From (11.86) follows 
Xt + X7 X'—-X- 1 f(y) a b 1 fy) , 
y. 


ict ys dc "T 
vis a tX O ' 2(a 4-b)X* nij y-c y gg a? — b? mid ya 





12 Functional Analysis 


1. Functional Analysis 

Functional analysis arose after the recognition of a common structure in different disciplines such as 
the sciences, engineering and economics. General principles were discovered that resulted in a common 
and unified approach in calculus, linear algebra, geometry, and other mathematical fields, showing their 
interrelations. 

2. Infinite Dimensional Spaces 

There are many problems, the mathematical modeling of which requires the introduction of infinite 
systems of equations or inequalities. Differential or integral equations, approximation, variational or 
optimization problems could not be treated by using only finite dimensional spaces. 

3. Linear and Non-Linear Operators 

In the first phase of applying functional analysis — mainly in the first half of the twentieth century 
— linear or linearized problems were thoroughly examined, which resulted in the development of the 
theory of linear operators. More recently the application of functional analysis in practical problems 
required the development of the theory of non-linear operators, since more and more problems had 
to be solved that could be described only by non-linear methods. Functional analysis is increasingly 
used in solving differential equations, in numerical analysis and in optimization, and its principles and 
methods became a necessary tool in engineering and other applied sciences. 

4. Basic Structures 

In this chapter only the basic structures will be introduced, and only the most important types of 
abstract spaces and some special classes of operators in these spaces will be discussed. The abstract 
notion will be demonstrated by some examples, which are discussed in detail in other chapters of this 
book, and the existence and uniqueness theorems of the solutions of such problems are stated and 
proved there. Because of its abstract and general nature it is clear that functional analysis offers a large 
range of general relations in the form of mathematical theorems that can be directly used in solving a 
wide variety of practical problems. 


12.1 Vector Spaces 
12.1.1 Notion of a Vector Space 


A non-empty set V is called a vector space or linear space over the field F of scalars if there exist two 
operations on V — addition of the elements and multiplication by scalars from F — such that they have 
the following properties: 

1. for any two elements x,y € V, there exists an element z = x + y € V, which is called their sum. 

2. For every x € V and every scalar (number) o € F there exists an element ax € V, the product of x 
and the scalar a so that the following properties, the axioms of vector spaces (see also 5.3.8.1, p. 365), 
are satisfied for arbitrary elements x, y, z € V and scalars q, 8 € F: 





V7 
V8 


(a 4- B)x — ox 4- Bx. (12.7 
alz +y) = azr + ay. (12.8 


(V1) z+(y+z)=(x+y)+z. (12.1) 
(V2) There exists an element 0 € V, the zero element, such that x +0 = x. (12.2) 
(V3) To every vector x there isa vector — a such that x + (--π) Ξ- 0. (12.3) 
(V4) r+y=y+r. (12.4) 
(V5) l-v=2, 0-α-0. (12.5) 
(V6) a(Gx) = (αβὴα. (12.6) 
(V7) ) 
(V8) ) 
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V is called a real or complex vector space, depending on whether F is the field R of real numbers or 
the field C of complex numbers. The elements of V are also called either points or, according to linear 
algebra, vectors. The vector notation i or x is not used in functional analysis. 

The difference x — y of two arbitrary vectors x,y € V also can be defined in V as « — y = x + (—y). 
From the previous definition, it follows that the equation x +y = z can be solved uniquely for arbitrary 
elements y and z. The solution is x — z — y. Further properties follow from axioms (V1)-(V8): 

e the zero element is uniquely defined, 

© ax= Px andx £0, imply a= £, 

e az= ay and a z 0, imply x = y, 

e —(ar) — a- (—zr). 


12.1.2 Linear and Affine Linear Subsets 


1. Linear Subsets 
A non-empty subset Vo of a vector space V is called a linear subspace or a linear manifold of V if together 
with two arbitrary elements x,y € Vo and two arbitrary scalars o, f. € F, their linear combination 
ax + By is also in Vo. Vo is a vector space in its own right, and therefore satisfies the axioms (V1)— 
(V8). The subspace Vo can be V itself or only the zero point. In these cases the subspace is called 
trivial. 
2. Affine Subspaces 
A subset of a vector space V is called an affine linear subspace or an affine manifold if it has the form 
{to+y:y € Vo}, (12.9) 
where zo € V is a given element and Vo is a linear subspace. It can be considered (in the case ao Æ 0) 
as the generalization of the lines or planes not passing through the origin in R?. 
3. The Linear Hull 
The intersection of an arbitrary number of subspaces in V is also a subspace. Consequently, for every 
non-empty subset E C V, there exists a smallest linear subset lin(E) or [E] in V containing E, namely 
the intersection of all the linear subspaces, which contain E. The set lin(E) is called the linear hull of 
the set E, or the linear subspace generated by the set E. It coincides with the set of all (finite) linear 
combinations 
O42 o- O39 + . .. + AnTn, (12.10) 
comprised of elements £1, £2, . . . ,&n € E and scalars a1, Q2, . . . , Qn E F. 
4. Examples for Vector Spaces of Sequences 
W A Vector Space IF": Letn bea given natural number and V the set of all n-tuples, i.e., all finite 
sequences consisting of n scalar terms ((£1,...,£,) : & € F, i = 1,...,n). The operations will be 


defined componentwise or termwise, i.e., if x — (£1,..., £4) and y — (ri... m) are two arbitrary ele- 
ments from V and a is an arbitrary scalar, a € F, then 
e+y=(&4+m,---,& +m), (12.11a) ax — (a&,..., a£). (12.11b) 


In this way, the vector space F” is defined. The linear spaces R or C are special cases for n = 1. 

This example can be generalized in two different ways (see examples B and C). 

HB Vector Spaces of all Sequences: Considering the infinite sequences as elements x — (65) 22.4. 
En € F and defining the operations componentwise, similar to (12.11a) and (12.11b), the vector space 
S of all sequences are obtained. 


E C Vector Space (also coo) of all Finite Sequences: Let V be the subset of all elements of 
s containing only a finite number of non-zero components, where the number of non-zero components 
depends on the element. This vector space — the operations are again introduced termwise — is denoted 
by or also by Cop, and it is called the space of all finite sequences of numbers. 
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Bi D Vector Space m (also 1?) of all Bounded Sequences: —A sequence x = {&,}°2, belongs 
to m if and only if there exists C, 7 0 with |£,| X C;, Vn — 1,2,.... This vector space is also denoted 
by 1°. 

E E Vector Space c of all Convergent Sequences: A sequence x = {&,}°2, belongs to c if and 
only if there exists a number £9 € F such that for Ve > 0 there exists an index no = no(£) such that 
for all n > ng one has |£, — £o| « € (see 7.1.2, p. 458). 

E F Vector Space co of all Null Sequences: The vector space co of all null sequences, i.e., the 
subspace of c consisting of all sequences converging to zero (£o — 0). 

HG Vector Space I’: The vector space of all sequences x = {&,}°2, such that °°, |&,|? is conver- 
gent, is denoted by I? (1 € p « oo). 

It can be shown by the Minkowski inequality that the sum of two sequences from lI? also belongs to I”, 
(see 1.4.2.13, p. 32). 


Remark: For the vector spaces introduced in examples A-G, the following inclusions hold: 
pCescecmcs and yC Pc] C co, where 1<p<q<oo. (12.12) 
5. Examples of Vector Spaces of Functions 
E A Vector Space F(T): Let V be the set of all real or complex valued functions defined on a given 
set T, where the operations are defined point-wise, i.e., if v = x(t) and y = y(t) are two arbitrary 
elements of V and a € F is an arbitrary scalar, then we define the elements (functions) «+ y anda-«x 
by the rules 
(x 4-y)(t) 2 x(t) - t) Vte T, (12.13a) 
(ax)(t)=a-a(t) Vte T. (12.13b) 
This vector space is denoted by F(T). 
Some of the subspaces are introduced in the following examples. 
HB Vector Space B(T) or M(T): The space B(T) is the space of all functions bounded on T. This 
vector space is often denoted by M(T'). In the case of T = N, one gets the space M(N) = m from 
example D of the previous paragraph. 
B C Vector Space C([a, b]): The set C([a, b]) of all functions continuous on the interval [a, b] (see 
2.1.5.1, p. 58). 
Bi D Vector Space C(?([a,b]): Letk € N, k > 1. The set C({a,d]) of all functions k-times 
continuously differentiable on [a, b] (see 6.1, p. 432-437) is a vector space. At the endpoints a and 
b of the interval [a,b], the derivatives have to be considered as right-hand and left-hand derivatives, 
respectively. 
Remark: For the vector spaces of examples A-D of this paragraph, and T = [a,b] the following 
subspace relations hold: 
C'? ([a, 5]) C C([a, b]) C B([a. b]) C -F([a, b). (12.14) 
WB E Vector Subspace of C({a, b]): For any given point t € [a,b], the set (x € C([a,6]): x(to) = 0} 
forms a linear subspace of C([a, b]). 


12.1.3 Linearly Independent Elements 


1. Linear Independence 
A finite subset {21,...,2,} of a vector space V is called linearly independent if 

Qatı +*+ antn =0 implies ay =---=a, =0. (12.15) 
Otherwise, it is called linearly dependent. If ay = --- = a, = 0, then for arbitrary vectors x1,...,%p 
from V, the vector a,x, +--+ + Qp2%p is trivially the zero element of V. Linear independence of the 
vectors 21,...,r, means that the only way to produce the zero element 0 = aya, +--+ + o2, is when 
all coefficients are equal to zero ay = --- = a, = 0. This important notion is well known from linear 
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algebra (see 5.3.8.2, p. 366) and was used e.g. for the definition of a fundamental system of solutions of 
linear homogeneous differential equations (see 9.1.2.3, 2., p. 553). An infinite subset E C V is called 
linearly independent if every finite subset of E is linearly independent. Otherwise, E is called linearly 
dependent. 
E ifthesequence whose k-th term is equal to 1 and all the others are 0 is denoted by e; , then belongs e; 
to the space y and consequently to any space of sequences. The set {€1, €2, .. .) is linearly independent 
in every one of these spaces. In the space C([0, 7]), e.g., the system of functions 

1, sinnt, cosnt (n= 1,2,3,...) 
is linearly independent, but the functions 1, cos 2t, cos? t are linearly dependent (see (2.97), p. 81). 
2. Basis and Dimension of a Vector Space 
A linearly independent subset B from V, which generates the whole space V, i.e., lin(B) = V holds, is 
called an algebraic basis or a Hamel basis of the vector space V (see 5.3.8.2, p. 366). B = {re : € € E] 
is a basis of V if and only if every vector x € V can be written in the form x = 7 org, where the 

ξεξ 


coefficients a are uniquely determined by x and only a finite number of them (depending on x) can be 
different from zero. Every non-trivial vector space V, i.e., V # {0}, has at least one algebraic basis, 
and for every linearly independent subset E of V, there exists at least one algebraic basis of V, which 
contains this subset of E. 


A vector space V is m-dimensional if it possesses a basis consisting of m vectors. That is, there exist 
m linearly independent vectors in V, and every system of m + 1 vectors is linearly dependent. 

A vector space is infinite dimensional if it has no finite basis, i.e., if for every natural number m there 
are m linearly independent vectors in V. 

The space F” is n-dimensional, and all the other spaces in examples B-E are infinite dimensional. The 
subspace lin({1, t, t?}) C C([a, b]) is three-dimensional. 

In the finite dimensional case, every two bases of the same vector space have the same number of ele- 
ments. Also in an infinite dimensional vector space any two bases have the same cardinality, which is 
denoted by dim(V). The dimension is an invariant quantity of the vector space, it does not depend on 
the particular choice of an algebraic basis. 


12.1.4 Convex Subsets and the Convex Hull 
12.1.4.1 Convex Sets 


A subset C of a real vector space V is called conves if for every pair of vectors x,y € C all vectors of 
the form Ax + (1 — A)y, 0 < A < 1, also belong to C. In other words, the set C is convex, if for any 
two elements x and y, the whole line segment 


{Ac + (1—A)y: O<AK< 1}, (12.16) 


(which is also called an interval), belongs to C. (For examples of convex sets in IR? see the sets denoted 
by A and B in Fig. 12.5, p. 684.) 

The intersection of an arbitrary number of convex sets is also a convex set, where the empty set is agreed 
to be convex. Consequently, for every subset E C V there exists a smallest convex set which contains 
E, namely, the intersection of all convex subsets of V containing E. It is called the conver hull of the 
set E and it is denoted by co (E). co (E) is identical to the set of all finite convez linear combinations 
of elements from E, i.e., co (E) consists of all elements of the form A12 Εντ. 4- A424, where 21,... η 
are arbitrary elements from E and A; € [0,1] satisfy the equality A1 4- --- -- A, — 1. Linear and affine 
subspaces are always convex. 


12.1.4.2 Cones 
A non-empty subset C of a (real) vector space V is called a convex cone if it satisfies the following 
properties: 





658 12. Functional Analysis 





1. Cisaconvex set. 
2. From zv € C and À 7 0, it follows that Ax € C. 
3. From x € Cand —x € C, it follows that x — 0. 
A cone can be characterized also by 3. together with 
r,y€ C and A,u 20 imply Ar-cgye C. (12.17) 
Wl A: The set IR of all vectors z — (£1,...,6,) with non-negative components is a cone in IR". 
WB B: The sct C, of all real continuous functions on [a, b] with only non-negative values is a cone in the 
space C(|a, b]). 


Wi C: The set of all sequences of real numbers (£,)7?., with only non-negative terms, i.e., £, > 0, Vn, 


is a cone in s. Analogously, cones are obtained in the spaces of examples C-G in 12.1.2, p. 655, if the 
sets of non-negative sequences are considered in these spaces. 


HD: The set CC I? (1 € p « oo), consisting of all sequences (£, 9, such that for some a > 0 
Σ E <a (12.18) 
n=1 


is a convex set in I”, but obviously, not a cone. 
E E: Examples from R? see Fig. 12.1: a) convex set, not a cone, b) not convex, c) convex hull. 


(0,1) 





a) X bp) | 


Figure 12.1 


12.1.5 Linear Operators and Functionals 
12.1.5.1 Mappings 
A mapping T: D —> Y from the set D C X into the set Y is called 


eè injective, if 


Τ(α) --Τ(υ) --- «=y, (12.19) 
e surjective, if for 
Vy€Y there exists anelement x€D = suchthat T(x) =y, (12.20) 


eè bijective, if T is both injective and surjective. 
D is called the domain of the mapping T and is denoted by Dr or D(T), while the subset (y € Y: 
Jx € Dr with T(x) = y} of Y is called the range of the mapping T and is denoted by R(T) or Im(T). 


12.1.5.2 Homomorphism and Endomorphism 
Let X and Y be two vector spaces over the same field F and D a linear subset of X. A mapping T': 
D — Y is called linear (or a linear transformation, linear operator or homomorphism), if for arbitrary 
x,y E€ D anda, p EF, 

T(ax + by) = aTz + BTy. (12.21) 
For a linear operator T' the notation T'r is preferred, which is similarly used for linear functions, while 
the notation T(x) is used for general operators. 
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The range R(T) is the set of all y € Y such that the equation Tr — y has at least one solution. 
N(T) = {a € X: Tx = 0} is the null space or kernel of the operator T and is also denoted by ker(T). 
A mapping of the vector space X into itself is called an endomorphism. If T is an injective linear 
mapping, then the mapping defined on R(T’) by 
yı v, suchthat Tr — y, y € R(T) (12.22) 
is linear. It is denoted by T7: (T) —» X and is called the inverse of T. If Y is the vector space F, 
then a linear mapping f: X —9 F is called a linear functional or a linear form. 
12.1.5.3 Isomorphic Vector Spaces 


A bijective linear mapping T: X —> Y is called an isomorphism of the vector spaces X and Y. Two 
vector spaces are called isomorphic provided an isomorphism exists. 


12.1.6 Complexification of Real Vector Spaces 


Every real vector space V can be extended to a complex vector space V. The set V consists of all pairs 
(x,y) with x,y € V. The operations (addition and multiplication by a complex number a + ib € C) 
are defined as follows: 


(£1, Y1) + (£2, Y2) = (£1 + £2, Y1 + y2), (12.23a) (a+ib)(x, y) = (ax — by, bx + ay).(12.23b) 
Since the special relations 
(x,y) = (z,0) + (0,y) and i(y,0) = (0 +i1)(y,0) = (0-y — 1-0, 1y +0- 0) = (0,y) (12.24) 


hold, the pair (x, y) can also be written as x + iy. The set V is a complex vector space, where the set 











V is identified with the linear subspace Vo = {(x,0): a € V}, i.e., x € V is considered as (x,0) or as 
x +i0. 

This procedure is called the complexification of the vector space V. A linearly independent subset in V 
is also linearly independent in V. The same statement is valid for a basis in V, so dim(V) = dim(V). 


12.1.7 Ordered Vector Spaces 
12.1.7.1 Cone and Partial Ordering 


If a cone C is fixed in a vector space V, then an order can be introduced for certain pairs of vectors in 
V. Namely, if x — y € C for some x,y € V then one writes x > y or y € x and say x is greater than 
or equal to y or y is smaller than or equal to x. The pair (V, C) is called an ordered vector space or a 
vector space partially ordered by the cone C. An element x is called positive, if x 7 0 or, which means 
the same, if z € C holds. Moreover 

C={xeEV:x> 0}. (12.25) 


If the vector space IR? ordered by its first quadrant as the cone C(— Ri) is under consideration, then a 
typical phenomenon of ordered vector spaces will be seen. This is referred to as “partially ordered” or 
sometimes as “semi-ordered”. Namely, only certain pairs of two vectors are comparable. Considering 
the vectors x — (1, —1) and y — (0,2), neither the vector r — y — (1, 3) nor y — x — (—1,3) isin C, 
so neither x > y nor x < y holds. An ordering in a vector space, generated by a cone, is always only a 
partial ordering. 

It can be shown that the binary relation > has the following properties: 


(O1) r2rVrcV (reflexivity). (12.26) 
(O2) x> y and y> z imply z> z (transitivity). (12.27) 
(03) «>y and a>0, a€R, imply az > ay. (12.28) 


(04) a, > y and 29> y imply 71+ %2 > yi + yo. (12.29) 
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Conversely, if in a vector space V there exists an ordering relation, i.e., a binary relation > is defined 
for certain pairs of elements and satisfies axioms (O1)—(O4), and if one puts 

Vi = {x EV: «> 0}, (12.30) 
then it can be shown that V, is a cone. The order >y, in V induced by V+ is identical to the original 
order >; consequently, the two possibilities of introducing an order in a vector space are equivalent. 
A cone C C V is called generating or reproducing if every element x € V can be represented as x = 
u — v with u,v € C. It can be written in the form V = C-C. 
W A: Anobvious order in the space s (see example B, p. 655) is induced by means of the cone 

C = {x = {én}: En >20 Vn) (12.31) 
(see example C, p. 658). 


In the spaces of sequences (see (12.12), p 656) usually the natural coordinate-wise order is considered. 
This is defined by the cone obtained as the intersection of the considered space with C (see (12.31), 
p. 660). The positive elements in these ordered vector spaces are then the sequences with non-negative 
terms. It is clear that other orders can be defined by other cones, as well. Then orderings different from 
the natural ordering can be obtained (see [12.17], [12.19]). 


E B: In the real spaces of functions F(T), B(T), C({a,b]) and C“({a, b]) (see 12.1.2, 5., p. 656), the 
natural order x > y for two functions x and y is defined by z(t) > y(t), Yt € T, or Vt € [a,b]. 
Then x > 0 if and only if x is a non-negative function in T. The corresponding cones are denoted by 
F(T), B4(T), etc. Also C} = C4 (T) = F4 (T) A C (T) can be obtained if T = [a,b]. 


12.1.7.2 Order Bounded Sets 


Let E be an arbitrary non-empty subset of an ordered vector space V. An element z € V is called 
an upper bound of the set E if for every x € E, r € z. An element u € V is a lower bound of E if 
u € z, V r € E. For any two elements x,y € V with x < y, the set 

ry] 2 (ve V: v € y) (12.32) 
is called an order interval or (0)-interval. 
Obviously, the elements x and y are a lower bound and an upper bound of the set [z, y], respectively, 
where they even belong to the set. A set E C V is called order bounded or simply (o) bounded, if E is a 
subset of an order interval, i.e., if there exist two elements u, z € V such that u € x € z, Vr € E or, 
equivalently, Æ C |u, z]. A set is called bounded above or bounded below if it has an upper bound, or a 
lower bound, respectively. 


12.1.7.3 Positive Operators 
A linear operator (see [12.2], [12.17]) T: X —9 Y from an ordered vector space X — (X, X,) into an 
ordered vector space Y = (Y, Y+) is called positive, if 

T(X,)CY,, ie, Tr 20 forall «>0. (12.33) 


12.1.7.4 Vector Lattices 


1. Vector Lattices 

In the vector space R! of the real numbers the notions of (o)-boundedness and boundedness (in the 
usual sense) are identical. It is known that every set of real numbers which is bounded from above 
has a supremum: the smallest of its upper bounds (or the least upper bound, sometimes denoted by 
lub). Analogously, if a set of reals is bounded from below, then it has an infimum, the greatest lower 
bound, sometimes denoted by glb. In a general ordered vector space, the existence of the supremum 
and infimum cannot be guaranteed even for finite sets. They must be given by axioms. An ordered 
vector space V is called a vector lattice or a linear lattice or a Riesz space, if for two arbitrary elements 
x,y € V there exists an element z € V with the following properties: 

1. rXz and y&€z, 

2. ifuc€ V with z € u and y € u, then z < u. 
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Such an element z is uniquely determined, it is denoted by x V y, and it is called the supremum of x 
and y (more precisely: supremum of the set consisting of the elements x and y). In a vector lattice, 
there also exists the infimum for any x and y, which is denoted by x ^ y. For applications of positive 
operators in vector lattices see, e.g., [12.2], [12.3] [12.15]. 

A vector lattice is called Dedekind complete or a K-space (Kantorovich space) if every non-empty subset 
E that is order bounded from above has a supremum lub(£) (equivalently, if every non-empty subset 
that is order bounded from below has an infimum glb(£)). 


E A: In the vector lattice F([a, b]) (see 12.1.2, 5., p. 656), the 
supremum of two functions x,y is calculated pointwise by the for- 


mula 
(x V y)(t) 2 max(z(t),u(t)) Vte [a,b]. (12.34) 
In the case of [a,b] = [0, 1], z(t) = 1 — št and y(t) = ? (Fig. 12.2), 
1— 3h af edel 
(xV y)(t) = { 2. if <t< D (12.35) 


is obtained. 

E B: The spaces C(|a,]) and B([a,b]) (see 12.1.2, 5., p. 656) are 
also vector lattices, while the ordered vector space CC ([a, b]) is not 
a vector lattice, since the minimum or maximum of two differentiable 
functions may not be differentiable on [a, b], in general. 

A linear operator T: X — Y from a vector lattice X into a vector 
lattice Y is called a vector lattice homomorphism or homomorphism 
Figure 12.2 of the vector lattice, if for all x,y € X 





T(xVy)=TxVTy and T(a@Ay)=TxrATy. (12.36 


2. Positive and Negative Parts, Modulus of an Element 
For an arbitrary element x of a vector lattice V, the elements 

z,—zV0, zr. —(-z)V0 and |2|=24+2_ (12.37 
are called the positive part, negative part, and modulus of the element x, respectively. For every elemen 
x € V, the three elements z., z..., |z| are positive, where for z, y € V the following relations are valid: 


rXcr,X€|vc, ξανα, rz,^z.-—0, |z| 2x V (-z), (13.384, 
(z +y)+ <t tys (w+y)-Sa_ty, dec zie. (12.38b 
x<y implies z4 <y, and x >y- (12.38c 


and for arbitrary a > 0 





(ar), =ar}, (ar) =ar, Ἰαπξ αι. (12.38d 


nf 


καθ 57 Ὁ 





τκπθν” 





t al / x(t) b =% al, 








Figure 12.3 


In the vector spaces F({a, b]) and C({a, b]), the positive part, the negative part, and the modulus of a 
function x(t) can be got by means of the following formulas (Fig. 12.3): 
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z(t) = [n ν᾿ 2 - ᾽ (12.392) 
z(= { δι). i τίν Z0 (12.39%) lel) =le] Yt E [a,b]. (12.39c) 


12.2 Metric Spaces 
12.2.1 Notion of a Metric Space 


Let X be a set, and suppose a real, non-negative function p(z, y) (x,y € X) is defined on X x X. If 
thisfunction p: Xx X > Ri satisfies the following properties (M1)-(M3) for arbitrary elements 
x,y, z € X, then it is called a metric or distance in the set X, and the pair X = (X, p) is called a metric 
space. The axioms of metric spaces are: 


(M1) p(a,y)>0 and p(x,y) =0 if and only if « = y (non-negativity), (12.40) 
(12). ρίαν) -- ρίν,α) (symmetry) (1241) 
(M3) p(x,y) € p(x, z) + p(z,y) (triangle inequality). (12.42) 


A metric can be defined on every subset Y of a metric space X = (X, p) in a natural way if the metric 
p of the space X is restricted to the set Y, i.e., if p is considered only on the subset Y x Y. The space 
(Y, p) of X x X is called a subspace of the metric space X. 

E A: The sets R” and C” are metric spaces with the Euclidean metric defined for points 

z—(&,... εξ) and y — (m, De sh) as 


p(z, y) = ec (12.43) 


E B: The function 


eo Tia = 12.44 

p(z, y) Be Ex — nel ( ) 
for vectors © = (&,...,&) and y — (m. ..., m) also defines a metric in R” and C”, the so-called 
mazimum metric. If & = (ἐν, bad En) is an approximation of the vector x, then it is of interest to know 


how much is the maximal deviation between the coordinates: qax [£x — κ]. 
<k<n 


The function 
n 


p(z. y) ^ l& — nel (12.45) 
k=1 


for vectors x,y € R” (or ©”) defines a metric in R” and C”, the so-called absolute value metric. The 
metrics (12.43), (12.44) and (12.45) are reduced in the case of n = 1 to the absolute value |x — y| in the 
spaces R and C (the sets of real and complex numbers). 

HMC: Finite 0-1 sequences, e.g., 1110 and 010110, are called words in coding theory. If the num- 
ber of positions is counted where two words of the same length n have different digits, i.e., for x — 
(1,---,&), y = (m,---,M), &, nk € (0,1), o(, y) is defined as the number of the k € (1,...,n] 
values such that k Æ ng, then the set of words with a given length n is a metric space, and the metric 
is the so-called Hamming distance, e.g., o((1110), (0100)) — 2. 

W D: Inthe set m and in its subsets c and co (see (12.12), p. 656) a metric is defined by 


p(z, y) = sup επ ην (£ = (£58 y m mim.) (12.46) 
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oo 
HE: Inthe set I? (1 € p « co) of sequences x — (£1, £2, ...) with absolutely convergent series 5 |£,|? 
n=1 


a metric is defined by 


play) = PY lén- lP, (zy € P). (12.47) 
n=1 


E F: In the set C([a,b]) a metric is defined by 
p(z, y) = max e(t) — v(t)]. (12.48) 


Bl G: In the set C ([a, b]) a metric is defined by 


x mur la ( OE), 


p Elab] 


where (see (12.14) CO ([a, b]) is understood as C (fa, b])). 
E H: Consider the set L^(0) (1 € p « oo) ofthe equivalence classes of Lebesgue measurable functions 





(12.49) 


which are defined almost everywhere on a bounded domain € C R” and / |x(t)|? di. « co (see also 
à 
12.9, p. 693). A metric in this set is defined by 


posa) — m fleo — OP dp. gean 
Q 


12.2.1.1 Balls, Neighborhoods and Open Sets 
In a metric space X = (X, p), whose elements are also called points, the following sets 
B(zo;r) = {x € X : p(x, £o) < r}, (12.51) B(zo;r) = {x € X : p(x, £0) < r} (12.52) 


defined by means of a real number r > 0 and a fixed point 29, are called an open and closed ball with 
radius r and center at £o, respectively. 

The balls (circles) defined by the metrics (12.43) and (12.44) and (12.45) in the vector space R? are 
represented in Fig. 12.4a,b with zọ = 0 and r = 1. 


MOT "ου “Jon 
{(x, yi y <1} 
(1,0) x (L0) x (L0) x 
b) c) 


Figure 12.4 








A subset U of a metric space X = (X, p) is called a neighborhood of the point xo if U contains xo together 
with an open ball centered at xo, in other words, if there exists an r > 0 such that B(xo;r) C U. A 
neighborhood U of the point x is also denoted by U (x). Obviously, every ball is a neighborhood of its 
center; an open ball is a neighborhood of all of its points. A point zo is called an interior point of a set 
A C X if xo belongs to A together with some of its neighborhood, i.e., there is a neighborhood U of £o 
such that r9 € U C A. A subset of a metric space is called open if all of its points are interior points. 
Obviously, X is an open set. 

The open balls in every metric space, especially the open intervals in R, are the prototypes of open sets. 
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The set of all open sets satisfies the following axioms of open sets: 
e If Ga is open for Va € J, then the set U Ga is also open. 
acl n 
e If G4, Gs», ..., G, are finitely many arbitrary open sets, then the set (| Gy is also open. 


e The empty set () is open by definition. B 
A subset A of a metric space is bounded if for a certain element o (which does not necessarily belong 
to A) and a real number & » 0 the set A is in the ball B(xo; R), i.e., p(z, xo) < R for all x € A. 


12.2.1.2 Convergence of Sequences in Metric Spaces 


Let X — (X, p) be a metric space, zo € X and (x4) 594, v4, € X a sequence of elements of X. 
The sequence [252 , is called convergent to the point xg if for every neighborhood U (xg) there is an 
index ng — no(U (xo)) such that for all n > no, £n € U (xo). The usual notation 
In —>% (n—oco) or lim r, — vo (12.53) 
n—oo 


is used and the point xg is called the limit of the sequence {xp }°2,. The limit of a sequence is uniquely 
determined. Instead of an arbitrary neighborhood of the point xo, it is sufficient to consider only open 
balls with arbitrary radii, so (12.53) is equivalent to the following: Ve > 0 (now thinking about the 
open ball B (x; £)), there is an index no = no(e), such that ifn > no, then p(an, 2%) < e. Notice that 
(12.53) means p(£n, z0) — 0. 

With these notions introduced in special metric spaces the distance between points can be calculated 
and the convergence of point sequences can be investigated. This has a great importance in numerical 
methods and in approximating functions by certain classes of functions (see, e.g., 19.6, p. 982). 

In the space R”, equipped with one of the metrics given above, convergence always means coordinate- 
wise convergence. 

In the spaces B([a, b]) and C([a, b]), the convergence introduced by (12.48) means uniform convergence 
of the function sequence on the set [a, b) (see 7.3.2, p. 468). 

In the space L?(Q) convergence with respect to the metric (12.50) means convergence in the (quadratic) 
mean, i.€., Un > Lo if 


[len — o| du — 0 for n— 00. (12.54) 
Q 


12.2.1.3 Closed Sets and Closure 


1. Closed Sets 
A subset F of a metric space X is called closed if X \ F is an open set. Every closed ball in a metric 
space, especially every interval of the form fa, b], [a, o0), (—00, a] in R, is a closed set. 
Corresponding to the axioms of open sets, the collection of all closed sets of a metric space has the 
following properties: 
e If F, are closed for Va € T, then the set N Fa is closed. 
acr n 

ο [If Fi,..., F, are finitely many closed sets, then the set U Fẹ is closed. 
e The empty set É is a closed set by definition. zn 
The sets Ø and X are open and closed at the same time. 
A point xo of a metric space X is called a limit point of the subset A C X if for every neighborhood 
U (ro), 

(απ) A z 0. (12.55) 
If this intersection always contains at least one point different from xo, then rp is called an accumulation 
point of the set A. A limit point, which is not an accumulation point, is called an isolated point. 
An accumulation point of A does not need to belong to the set A, e.g., the point a with respect to the 
set A = (a, b], while an isolated point of A must belong to the set A. 
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A point xo is a limit point of the set A if there exists a sequence {27 }°2, with elements x, from A, which 
converges to zo. If zo is an isolated point, then £n = £o, V n > no for some index no. 

2. 'The Closure of a Set 

Every subset A of a metric space X obviously lies in the closed set X. Therefore, there always exists a 
smallest closed set containing A, namely the intersection of all closed sets of X, which contain A. This 
set is called the closure of the set A and it is usually denoted by A. A is identical to the set of all limit 
points of A; A is obtained from the set A by adding all of its accumulation points to it. A is a closed 
set if and only if A = A. Consequently, closed sets can be characterized by sequences in the following 
way: A is closed if and only if for every sequence {2,}°2, of elements of A, which converges in X to an 
element zo(€ X), the limit zo also belongs to A. 

Boundary points of A are defined as follows: £o is a boundary point of A if for every neighborhood 
U (xo), U (vo) N A Æ Ý and also U (z0) N (X \ A) # 0. xo itself does not need to belong to A. Another 
characterization of a closed set is the following: A is closed if it contains all of its boundary points. (The 
set of boundary points of the metric space X is the empty set.) 


12.2.1.4 Dense Subsets and Separable Metric Spaces 

A subset A of a metric space X is called everywhere dense if A = X, i.e., each point x € X is a limit 
point of the set A. That is, for each x € X, there is a sequence {£n} £n € A such that x, —> 2. 

W A: According to the Weierstrass approximation theorem, every continuous function on a bounded 
closed interval [a,b] can be approximated arbitrarily well by polynomials in the metric space of the 
space C(|a, b]), i.e., uniformly. This theorem can now be formulated as follows: The set of polynomials 
on the interval [a, b] is everywhere dense in C([a, b]). 

W B: Further examples for everywhere dense subsets are the set of rational numbers Q and the set of 
irrational numbers in the space of the real numbers R.. 


A metric space X is called separable if there exists a countable everywhere dense subset in X. A count- 
able everywhere dense subset in R” is, e.g., the set of all vectors with rational components. The space 
1 =I is also separable, since a countable everywhere dense subset is formed, for example, by the set of 
its elements of the form x — (r1, r2, ..., ry, 0,0,...) , where r; are rational numbers and N = N(z) is 
an arbitrary natural number. The space m is not separable. 


12.2.2 Complete Metric Spaces 
12.2.2.1 Cauchy Sequences 


Let X = (X, p) be a metric space. A sequence {x,,}°2, with x, € X is called a Cauchy sequence if for 
Ve > 0 there is an index ng — no(£) such that for V n, m 7 ng there holds the inequality 

Plinn) LE (12.56) 
Every Cauchy sequence is a bounded set. Furthermore, every convergent sequence is a Cauchy se- 
quence. In general, the converse statement is not true, as is shown in the following example. 


Bl Consider the space I' with the metric (12.46) of the space m. Obviously, the elements «™ = 


11 1 
(i, PPOR 00... ) belong to l! for every n = 1,2,... and the sequence {2 }°°, is a Cauchy 
n 
sequence in this space. If the sequence (of sequences) {x}? converges, then it has to be convergent 
1. Ἱ a 1 


also coordinate-wise to the element 2 = (. PE 
2:3 n-1' 





; s However, x does not belong 


to Il, since Ya = +00 (see 7.2.1.1, 2., p. 459, harmonic series). 
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12.2.2.2 Complete Metric Spaces 
A metric space X is called complete if every Cauchy sequence converges in X. Hence, complete metric 
spaces are the spaces for which the Cauchy principle, known from real calculus, is valid: A sequence 
is convergent if and only if it is a Cauchy sequence. Every closed subspace of a complete metric space 
(considered as a metric space on its own) is complete. The converse statement is valid in a certain way: 
If a subspace Y of a (not necessary complete) metric space X is complete, then the set Y is closed in X. 
Bl Complete metric spaces are, e.g., the spaces: m, I? (1 € p « oo), c, B(T), C([a,0]), C9 ([a, b]), 
L?(a,b) (1€ p < oo). 
12.2.2.3 Some Fundamental Theorems in Complete Metric Spaces 
The importance of complete metric spaces can be illustrated by a series of theorems and principles, 
which are known and used in real calculus, and which are to be applied even in the case of infinite 
dimensional spaces. 
1. Theorem on Nested Balls 
Let X be a complete metric space. If 

B(z1;r1) D B(£2; r2) D 2 B(anjrn) D+: (12.57) 
is a sequence of nested closed balls with r, —> 0, then the intersection of all of those balls is non- 
empty and consists of only a single point. If this property is valid in some metric space for any sequence 
satisfying the assumptions, then the metric space is complete. 


2. Baire Category Theorem 


oo 
Let X be a complete metric space and {F,}?2, a sequence of closed sets in X with U Fẹ = X. Then 


there exists at least one index ko such that the set Fj, has an interior point. 
3. Banach Fixed-Point Theorem 
Let F be a non-empty closed subset of a complete metric space (X, p). Let T: X — X be a contracting 
operator on F, i.e., there exists a constant q € [0, 1) such that 

p(Txz, Ty) < qp(x,y) foral x,y €F. (12.58) 
Suppose, if x € F, then Tx € F. Then the following statements are valid: 
a) For an arbitrary initial point £o € F the iteration 

μι Te, (A= 02e) (12.59) 
is well defined, i.e., £n € F for every n. 
b) The iteration sequence {x,,}°2 converges to an element 2* € F. 
c) Ta* =a", ie., 2* isa fixed point of the operator T. (12.60) 
d) The only fixed point of T in F is x*. 
e) The following error estimation is valid: 

* q^ 

p(x*, m4) < 1- ^» 79). (12.61) 
The Banach fixed-point theorem is sometimes called the contraction mapping principle. 
12.2.2.4 Some Applications of the Contraction Mapping Principle 
1. Iteration Method for Solving a System of Linear Equations 


The given linear (n,n) system of equations 





Qj121 cajo12 +... + Qing = b1, 
2131 2232 +e Amn = b», (12 62a) 
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can be transformed according to 19.2.1, p. 955, into the equivalent system 





21 -(1--α[ι)αι +ay2%2 ++: +41n%n = by, 
22 casjz4 —(1 — az2)£2 +-+- +dantn = be, (12.62b) 
2 etd s boss σα, ues E A mne 


If the operator T: E" — F" is defined by 


n n T 
T= e — 5 yp + 01,... 2 — Σ [E s.) j (12.63) 


k=1 k=1 
then the last system is transformed into the fixed-point problem 
t= Tr (12.64) 
in the metric space F”, where an appropriate metric is considered: The Euclidean (12.43), the maxi- 


mum (12.44) or the absolute value metric p(x, y) = 5; |xy — yx| (compare with (12.45)). If one of the 
k=1 





numbers 
n n n 
2;lexP, mex?.laxl max $. laz (12.65) 
j,k=1 k=1 j=l 


is smaller than one, then T turns out to be a contracting operator. It has exactly one fixed point 
according to the Banach fixed-point theorem, which is the componentwise limit of the iteration sequence 
started from an arbitrary point of F”. 


2. Fredholm Integral Equations 
The Fredholm integral equation of second kind (see also 11.2, p. 622) 


e) - f KG yet) dy — Λίο). xE fat (12.66) 


with a continuous kernel K(x, y) and continuous right-hand side f(x) can be solved by iteration. By 
means of the operator T: C({a, b]) —> C([a, b]) defined as 


Τρ(α) -- [ K(e,yelw) dy + fe) Ve € C((a, 4), (12.67) 


it is transformed into a fixed-point problem Ty = ¢ in the metric space C({a, b]) (see example A in 
b 

12.1.2, 4., p. 655). If max | |K(x,y)| dy < 1, then T is a contracting operator and the fixed-point 
a-—r-0Ja 


theorem can be applied. The unique solution is now obtained as the uniform limit of the iteration 
sequence (qn) 24, where yn = Tyn-1, starting with an arbitrary function qo(x) € C(|a, 5]). It is clear 
that Yn = Tyo and the iteration sequence is {T" yo}. 


3. Volterra Integral Equations 
The Volterra integral equation of second kind (see 11.4, p. 643) 


(ο) -- | Κία.ν)ρ(ῳ) dy = fa), xE fat (12.68) 


with a continuous kernel and a continuous right-hand side can be solved by means of the Volterra 
integral operator 


(Via) --- | Κία.υ)ρίν)ἁν Ὑφ εο[αθ) (12.69) 
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and Ty = f + Vyas the fixed-point problem Ty = ¢ in the space C({a, 0]). 
4. Picard-Lindelof Theorem 
Consider the differential equation 

Bj) (12.70) 
with a continuous mapping f: I x G — R”, where J is an open interval of R and G is an open domain 
of R”. Suppose the function f satisfies a Lipschitz condition with respect to x (see 9.1.1.1, 2. p. 541), 
i.e., there is a positive constant L such that 

ol f(t, x1), f(t, £2)) < Lolz, x2) V (t, x1), (t, £2) E I X G, (12.71) 
where o is the Euclidean metric in IR". (Using the norm (see 12.3.1, p. 669) and the formula (12.81) 
olx, y) = ||x — y|| (12.71) can be written as || (t, x1) — f (t, x2)]| € L-]|x1 — a||.) Let (to, zo) € I x G. 
Then there are numbers 8 > Oandr > Osuch that the set Q = {(t, £) € RxR”: |t—to| < B, o(z, xo) € 


r) lies in I x G. Let M — maxo o( f(t, z),0) and a = min(, a 


that for each 2 € B — (x € R: o(z, zo) € 0], the initial value problem 

$-—f(tz) α(ίο) - 3 (12.72) 
has exactly one solution y(t, Z),i.e., @(t,@) = f(t, y(t, Z)) for Vt satisfying |t—to| € o and g(to, 2) — 3. 
The solution of this initial value problem is equivalent to the solution of the integral equation 


}. Then there is a number b > 0 such 


ρίε, 8) =a+ | F(s,9(s,2)) ds, t € [to — oœ, to + a]. (12.73) 


If X denotes the closed ball {y(t, x): d(y(t, £), £o) < r} in the complete metric space C ([to — a, to + 
a] x B; R”) with metric 


d(p, v) — o(ylt, x), (t, o), (12.74) 


then X is a complete metric space with the induced metric. If the operator T: X —9 X is defined by 


Inax 
(t.0)€{|tto|<a}xB 


Tolt,2) =%+ f f(s, o(s,4))ds (12.75) 


then T is a contracting operator and the solution of the integral equation (12.73) is the unique fixed 
point of T which can be calculated by iteration. 


12.2.2.5 Completion of a Metric Space 


Every (non-complete) metric space X can be completed; more precisely, there exists a metric space X 
with the following properties: 


a) X contains a subspace Y isometric to X (see 12.2.3, 2., p. 669). 

b) Y is everywhere dense in X. 

c) X is a complete metric space. 

d) If Z is any metric space with the properties a)-c), then Z and X are isometric. 

The complete metric space, defined uniquely in this way up to isometry, is called the completion of the 
space X. 

12.2.3 Continuous Operators 


1. Continuous Operators 
Let T: X — Y be a mapping of the metric space X = (X, p) into the metric space Y = (Y, o). T 
is called continuous at the point xo € X if for every neighborhood V = V (yo) of the point yo = T (zo) 
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there is a neighborhood U = U (xo) such that: 

T(r)€V foral seU. (12.76) 
T is called continuous on the set A C X if T is continuous at every point of A. Equivalent properties 
for T to be continuous on X are: 
a) For any point z € X and any arbitrary sequence [5 28 ,, z, € X with £n — x there always holds 
T(r,) — T(x). Hence p(v,, x) — 0 implies o(T (x4), T(x)) — 0. 
b) For any open subset G C Y the inverse image T~1(G) is an open subset in X. 
c) For any closed subset F C Y the inverse image T7! (F) is a closed subset in X. 


d) For any subset A C X one has T(A) C T(A). 
2. Isometric Spaces 
If there is a bijective mapping T: X —+ Y for two metric spaces X — (X, p) and Y — (Y, o) such that 


p(r,y) —- o(T(x),T(y)) Vo,ye€X, (12.77) 
then the spaces X and Y are called isometric, and T is called an isometry. 


12.3 Normed Spaces 


12.3.1 Notion of a Normed Space 
12.3.1.1 Axioms of a Normed Space 


Let X be a vector space over the field F. A function || - ||: X — RŻ is called a norm on the vector 
space X and the pair X — (X, || - ||) is called a normed space over the field F, if for arbitrary elements 
x,y E€ X and for any scalar a € F the following properties, the so-called axioms of a normed space, are 
fulfilled: 


(N1) |z| >0, and ||z|| =0 ifand only if x =0, (12.78) 
(Ν2) [αα| -Ξ |α|: ||] (homogenity), (12.79) 
(N3) | - vll € [lel] + Ilyll (triangle inequality). (12.80) 


A metric can be introduced by means of 

plz, y) =z- yl, x,y €X, (12.81) 
in any normed space. The metric (12.81) has the following additional properties which are compatible 
with the structure of the vector space: 

plz + z,y +z) = plz,y), zeX (132.824) 

plax, ay) — |o]p(x,y), «€F. (12.82b) 

So, in a normed space there are available both the properties of a vector space and the properties of a 
metric space. These properties are compatible in the sense of (12.82a) and (12.82b). The advantage is 
that most of the local investigations can be restricted to the unit ball 

B(0;1)={x €X: |z| <1} or B(06;1) 2 (ze X: |v| € 1) (12.83) 
since 

B(zor-(y€X:|y—-zl «rb 2x-rB(01, VxreX and Vr>0. (12.84) 
Moreover, the algebraic operations in a vector space are continuous, i.e., 

In >T, Ymy, n>a imply 

In +Yn voy, Antn om, [es] -» |3]||- (12.85) 
In normed spaces instead of (12.53) one may write for convergent sequences 

llen — zo| — 0 (n — oo). (12.86) 
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12.3.1.2 Some Properties of Normed Spaces 


Among the linear metric spaces, those spaces are normable (i.e., a norm can be introduced by means 
of the metric, if one defines |||] = p(x, 0)) whose metric satisfies the conditions (12.82a) and (12.82b). 








Two normed spaces X and Y are called norm isomorphic if there is a bijective linear mapping 7: X —> 
Y with ||Tx|| = ||2|| for all x € X. Let || - ||; and || - |a be two norms on the vector space X, and denote 
the corresponding normed spaces by X; and X», i.e., X; — (X, || - |;) and X2 ^ (X, || - ||2). 


The norm || - ||; is stronger than the norm || - ||2, if there is a number 7 > 0 such that ||a|]2 < y|I|I1, 
for all x € X. In this case, the convergence of a sequence {x,}°2, to x with respect to the stronger 
norm ||- |j, i.e., ||z, — x||i —9 0, implies the convergence to x with respect to the norm || - ||2, i.e. 


|z4 — x||a — 0. 

Two norms ||-|| and ||-||; are called equivalentifthere are two numbers q1 > 0, y2 > Osuch that Yx € X 
there holds 5i|||| € ||x||; X ylla||. In a finite dimensional vector space all norms are equivalent to 
each other. 





A subspace of a normed space is a closed linear subspace of the space. 


12.3.2 Banach Spaces 

A complete normed space is called a Banach space. Every normed space X can be completed into a 
Banach space X by the completion procedure given in 12.2.2.5, p. 668, and by the natural extension of 
its algebraic operations and the norm to X. 

12.3.2.1 Series in Normed Spaces 


In a normed space X infinite series can be considered. That means for a given sequence {£n}; of 
elements x, € X a new sequence {s;}?2, is constructed by 


81 — 44,89 — T1 + T2, ..., Sk = Tr t+ +E = Ska H Tk, (12.87) 
If the sequence {s, }?2, is convergent, i.e., ||s. — s|| — 0 (k — oo) for some s € X, then a convergent 
series is defined. The elements s1, $5, . .. , Sk, ... are called the partial sums of the series. The limit 
k 
s — lim x 12.88 
ie δη (12.88) 


oo oo 
is the sum of the series, and it is denoted by s = »; r,. A series > 2p is called absolutely convergent 
n=l n=1 
.. . co . . . 
if the number series 5; ||z,|| is convergent. In a Banach space every absolutely convergent series is 
n=1 


convergent, and ||s|| € y ||v,|| holds for its sum s. 
n=1 


12.3.2.2 Examples of Banach Spaces 





BA: F" with |z|| — p p)" , if l<p<oo; |z|] = max |&|, if p — oo. (13.894) 


E 1<k<n 
These normed spaces over the same vector space IF" are often denoted by P(n) (1 < p < oo). For 
1 € p « oo, they are called Euclidean spaces in the case of F = R, and unitary spaces in the case of 
F=C. 
B B: m with ||z|| ^ sup |£i]. (12.89b) 
k 





WB C: c and co with the norm from m. (12.89c) 


L 
BD: P with jz) =|[z||, = (x i") (1 € p < oo). (12.89d) 
n=1 
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B E: C(|a,b]) with |z|| — mas |a(t)]. (132.890) 
€|a, 1 
b Ῥ 
ΒΕ: Z((a,b) (1€ p «oo) with ||| — ||x|| = (frora) : (12.598 
k 
BG: CÓÜ([a,t]) with ||z|| — 2 max lx (£)|, where «(t) stands for x(t). (12.89g) 
i-o fete: 


12.3.2.3 Sobolev Spaces 


Let 2 C R" be a bounded domain, i.e., an open connected set, with a sufficiently smooth boundary 
ƏN. For n = 1 or n = 2,3 one can imagine 2 being something similar to an interval (a, b) or a bounded 
convex set. 

A function f: Q —+ R is k-times continuously differentiable on the closed domain €? if f is k-times 
continuously differentiable on €? and each of its partial derivatives has a finite limit on the boundary, 
i.e., if approaches an arbitrary point of OQ. In other words, all partial derivatives can be continuously 
extended on the boundary of Q, i.e., each partial derivative is a continuous function on €). In this vector 
space (for p € [1,0o)) and with the Lebesgue measure \ in R” (see example C in 12.9.1, 2., p. 695) the 
following norm is defined: 

i 


Il = rl ΠΟ E firea]. (12.90) 
. ΠΕ 
The resulting normed space is denoted by WF?(Q) or also by WF(Q) (in contrast to the space C9 ([a, b]) 


which has a quite different norm). Here a means a multi-indez, i.e., an ordered n-tuple (04,..., 04) of 
non-negative integers, where the sum of the components of a is denoted by |a| = ai + az +-+- + an- 


For a function f(x) — f(&,...,&) with a = (&,...,&,) € © the brief notation is used as in (12.90): 
alal f 
Ope 0g 


The normed space W*?(Q) is not complete. Its completion is denoted by W*?(Q) or in the case of 


per- (12.91) 


p — 2 by H*(Q) and it is called a Sobolev space. 
12.3.3 Ordered Normed Spaces 


1. Cones in a Normed Space 
Let X be areal normed space with the norm || - ||. A cone X, C X (see 12.1.4.2, p. 657) is called solid, 
if X, contains a ball (with positive radius), or equivalently, X; contains at least one interior point. 
E The usual cones are solid in the spaces R, C([a, b]), c, but in the spaces L?” ((a, b)) and P (1 < p < o) 
they are not solid. 
A cone X , is called normalif the norm in X is semi-monotonic, i.e., there exists a constant M > 0 such 
that 

0xz&y — ll € Ml. (12.92) 
If X is a Banach space ordered by a cone X,, then every (o)-interval is bounded with respect to the 
norm if and only if the cone X, is normal. 
W The cones of the vectors with non-negative components and of the non-negative functions in the 
spaces IR", m, c, co, C, I? and L7, respectively, are normal. 
A cone is called regular if every monotonically increasing sequence which is bounded above, 

Tı σσ (12.93) 
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is a Cauchy sequence in X. In a Banach space every closed regular cone is normal. 
E The cones in R”, I? and L? for 1 € p « oo are regular, but in C and m they are not. 
2. Normed Vector Lattices and Banach Lattices 
Let X be a vector lattice, which is a normed space at the same time. X is called a normed lattice or 
normed vector lattice (see [12.15], [12.19], [12.22], [12.23]), if the norm satisfies the condition 

|z| <|y| implies ||| € ||ull Vx.y € X (monotonicity of the norm). (12.94) 
A complete (with respect to the norm) normed lattice is called a Banach lattice. 


WB The spaces C([a,0]), L?, 1P, B([a,b]) are Banach lattices. 
12.3.4 Normed Algebras 


A vector space X over F is called an algebra, if in addition to the operations defined in the vector space 
X and satisfying the axioms (V1)-(V8) (see 12.1.1, p. 654), a product x-y € X is defined for every two 
elements z, y € X (or with a simplified notation by a product xy), such that for arbitrary x,y,z € X 
and a € F the following conditions are satisfied: 





(A1) a(yz) = (ay)z, (12.95) 
(A2) a(y+z2z)=ay+ 22, (12.96) 
(A3) (x-y)z — zz +yz, (12.97) 
(A4) a(ay) = (ax)y = x(ay). (12.98) 


An algebra is commutative if xy = yx holds for two arbitrary elements x,y. A linear operator (see 
(12.21), p. 658) T: X —> Y of the algebra X into the algebra Y is called an algebra homomorphism if 
for any x1, £2 E€ X: 

T(zi: 33) = Tz - Tx. (12.99) 
An algebra X is called a normed algebra or a Banach algebra if it is a normed vector space or a Banach 
space and the norm has the additional property 

læ -yll < lell Iyl- (12.100) 
In a normed algebra all the operations are continuous, i.e., additionally to (12.85), if zn —> x and 
Yn — y, then also £nyn — xy (see [12.20]). 
Every normed algebra can be completed to a Banach algebra, where the product is extended to the 
norm completion with respect to (12.100). 
E A: C([a,b]) with the norm (12.89e) and the usual (pointwise) product of continuous functions. 


E B: The vector space W ([0, 27]) of all complex-valued functions x(t) continuous on [0, 27] and having 
an absolutely convergent Fourier series expansion, i.e., 


oo 
v(t) 2 M ce", (12.101) 
n=—oo 
oo 
with the norm ||xz| = © |e,| and the usual multiplication. 
πς--οο 


MC: The space L(X) of all bounded linear operators on the normed space X with the operator norm 
and the usual algebraic operations (see 12.5.1.2, p. 677), where the product T'S of two operators is 
defined as the sequential application, i.e., T'S(x) — T(S(x)), x € X. 


Bl D: Thespace L!(—oo, oo) of all measurable and absolutely integrable functions on the real axis (see 
12.9, p. 693) with the norm 


TES Γ᾽ |x(t)| dt (12.102) 


oo 
is a Banach algebra if the multiplication is defined as the convolution (x * y)(t) = | x(t — s)y(s) ds. 
4J—00 
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12.4 Hilbert Spaces 


12.4.1 Notion ofa Hilbert Space 
12.4.1.1 Scalar Product 


A vector space V over a field F (mostly F = C) is called a space with scalar product or an inner product 
space or pre-Hilbert space if to every pair of elements x, y € V there is assigned a number (x, y) € F (the 
scalar product of x and y), such that the axioms of the scalar product are satisfied, i.e., for arbitrary 
1r,y,z€VandacF: 





(H1) (zr, x) Σ 0, (1.6., (υ, x) is real), and (x, 2) = 0 if and only if « = 0, (12.103) 
(H2) (ax, y) — a(x,y), (12.104) 
(H3) (x - y, 2) — (x, z) + (y, 2), (12.105) 
(HA) (x,y) — (y). (12.106) 


(Here w denotes the conjugate of the complex number w, which is denoted by w* in (1.133c). Sometimes 
the notation of a scalar product is (, y).) 

In the case of F = R, i.e., in a real vector space, (H4) means the commutativity of the scalar product. 
Some further properties follow from the axioms: 


(z,ay) =ā(z,y) and (z,y+ z)= (x,y) + (zx, z). (12.107) 
12.4.1.2 Unitary Spaces and Some of their Properties 
In a pre-Hilbert space H a norm can be introduced by means of the scalar product as follows: 

zl = y (x,x) (x €H). (12.108) 
A normed space H = (H, ||-||) is called unitary if there is a scalar product satisfying (12.108). Based on 
the previous properties of scalar products and (12.108) in unitary spaces the following facts are valid: 
a) Triangle Inequality: 

læ + yll < llæll + yl (12.109) 


b) Cauchy-Schwarz Inequality or Schwarz-Buniakowski Inequality (see also 1.4.2.9, p. 31): 


Ky) γα αγγ). (12.110) 


c) Parallelogram Identity: This characterizes the unitary spaces among the normed spaces: 


le + yl? + lle — vl? — 2 (Ic? - ll) - (12.111) 
d) Continuity of the Scalar Product: 
T4 v, y. — y. imply (zs, ya) — (x,y). (12.112) 


12.4.1.3 Hilbert Space 
A complete unitary space is called a Hilbert space. Since Hilbert spaces are also Banach spaces, they 
possess their properties (see 12.3.1, p. 669; 12.3.1.2, p. 670; 12.3.2, p. 670). In addition they have the 
properties of unitary spaces 12.4.1.2, p. 673. A subspace of a Hilbert space is a closed linear subspace. 
@ A: P(n), P and L?((a,b)) with the scalar products 

b ΕΕ 


(zy) = Y 6g; (x,y) = Y eine and (x,y) = [ x(t)y(t) dt. (12.113) 
k=1 k=1 oo 


B B: The space H?(O) with the scalar product 








(5509) = [ reso dra M ROO dx. (12.114) 
2 


ΠΕ 
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E C: Let y(t) be a measurable positive function on [a,b]. The complex space L?((a, b), v) of all mea- 
surable functions, which are quadratically integrable with the weight function y on (a,b), is a Hilbert 
space if the scalar product is defined as 


(x,y) = [ " e (Oy Dye t) dt. (12.115) 


12.4.2 Orthogonality 
Two elements x, y of a Hilbert space H are called orthogonal (denoted by x L y) if (x, y) = 0 (the notions 
of this paragraph also make sense in pre-Hilbert spaces and in unitary spaces). For an arbitrary subset 
A C H, the set 

At ={r EH: (x,y)=0 Vye A} (12.116) 
of all vectors which are orthogonal to each vector in A is a (closed linear) subspace of H and it is called 
the orthogonal space to A or the orthogonal complement of A. The notation A L B means that (x,y) = 0 
for all x € A and y € B. If A consists of a single element x, then the notation x L B is used. 
12.4.2.1 Properties of Orthogonality 
The zero vector is orthogonal to every vector of H. The following statements hold: 
a)z Lyandz.Lzimply z 1L (ay 4- 8z) for any a, 8 € C. 
b) From z L yn and yn > y it follows that x L y. 











c) z L Aif and only if z 1. lin(A), where lin(A) denotes the closed linear hull of the set A. 
d) If x L Aand A is a fundamental set, i.e., lin(A) is everywhere dense in H, then x = 0. 


e) Pythagoras Theorem: If the elements z4,..., x4 are pairwise orthogonal, that is 7, -L 2; for all 
k Al, then 


IX el? = X lel. (12.117) 
k=1 k=1 


f) Projection Theorem: If Ho is a subspace of H, then each vector x € H can be written uniquely 
as 


ατα απ αᾱ € Ho, z” L Ho. (12.118) 
g) Approximation Problem: Furthermore, the equation ||'|| — p(x, Ho) — infyen,(||x — y||) holds, 
and so the problem 

lc —y|| + inf, y € Ho (12.119) 


has the unique solution x’ in Ho. In this statement Ho can be replaced by a convex closed non-empty 
subset of H. 
The element z’ is called the projection of the element x on Ho. It has the smallest distance from x (to 


Ho), and the space H can be decomposed: H = Ho ® Hj. 
12.4.2.2 Orthogonal Systems 


A set (ze: € € E) of vectors from H is called an orthogonal system if it does not contain the zero vector 
and zg L x5, € z- rj, hence (xg, Fp) = dén holds, where 


0 for £ Z m 
denotes the Kronecker symbol (see 4.1.2, 10., p. 271). An orthogonal system is called orthonormal if 
in addition ||xe|| — 1 V €. 
In a separable Hilbert space an orthogonal system may contain at most countably many elements. 
Therefore = = IN is assumed from now on. 


iss { dor dh (12.120) 
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W A: The m 


1 1 1 1 
cos t, sin t, cos 2t, —— sin 2t, ... (12.121) 


van F vr ντ ντ 
in the real space L?((—7, 7)) and the system 
1 
van 


in the complex space L?((—7, 7)) are orthonormal systems. Both of these systems are called trigono- 
metric. 


W B: The € polynomials of the first kind (see 9.1.2.6, 2., p. 566) 


Pt) - 1. 


form an orthogonal system of elements in the space L?((—1, 1)). The corresponding orthonormal system 


P,(t) = \Jn+ : aa P,(t). (12.124) 


E C: The Hermite polynomials (see 9.1.2.6, 6., p. 568 and 9.2.4, 3., 602) according to the second 
definition of the Hermite differential equation (9.66b) 
απ 
e L eF 
dt" 
form an orthogonal system in the space L?((—oo, oo)). 
E D: The Laguerre polynomials form an orthogonal system (see 9.1.2.6, 5., p. 568) in the space 
L?((0,00)). 
Every orthogonal system is linearly independent, since the zero vector was excluded. Conversely, if 
U1,%,...%p,... is asystem of linearly independent elements in a Hilbert space H, then there exist vec- 
tors €1, 62, ...,€n; .. ., Obtained by the Gram—Schmidt orthogonalization method (see 4.6.2.2, 1., p. 316) 
which form an orthonormal system. They span the same subspace, and by the method they are deter- 
mined up to a scalar factor with modulus 1. 





e™ (n =0,+1,42,...) (12.122) 














"[(2—1)"] (n=0,1,...) (12.123) 


H,(t) = (n=0,1,...) (12.125) 


12.4.3 Fourier Series in Hilbert Spaces 
12.4.3.1 Best Approximation 


Let H be a separable Hilbert space and 
(e: n21,2,...) (12.126) 


a fixed orthonormal system in H. For an element x € H the numbers cp = (x, €n) are called the Fourier 
coefficients of x with respect to the system (12.126). The (formal) series 


Y Caen (12.127) 
n=1 


is called the Fourier series of the element x with respect to the system (12.126) (see 7.4.1.1, 1., p. 474). 
The n-th partial sum of the Fourier series of an element x has the property of the best approximation, 
i.e., for fixed n, the n-th partial sum of the Fourier series 

n 


On = M, Ck ek (12.128) 


k=1 
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n 
gives the smallest value of ||z — 35 axeg|| among all vectors of H, = lin({e1,...,én}). Furthermore, 
k=1 


Z— Op is orthogonal to H,, and there holds the Bessel inequality: 
co 


Lile sle, n= (Zen) (n =1,2,...). (12.129) 


n=1 


12.4.3.2 Parseval Equation, Riesz-Fischer Theorem 
The Fourier series of an arbitrary element x € H is always convergent. Its sum is the projection of 
the element a onto the subspace Ho = lin({en}2 1). If an element x € H has the representation 


oo 
© = X; ages, then a; are the Fourier coefficients of r (n — 1,2,...). If [o4 24 is an arbitrary 
n=1 


co 
sequence of numbers with the property >> |a;,|? < oo, then there is a unique element x in H, whose 
n=1 


Fourier coefficients are equal to a, and for which the Parseval equation holds: 


Σ \(x, en) |? = 3 α 


n=1 








2 = |æ]? (Riesz-Fischer theorem). (12.130) 


n 


An orthonormal system {ep } in H is called complete if there is no non-zero vector y orthogonal to every 


€; it is called a basis if every vector x € H has the representation « = Σ Qnen, 1.€., An = (, en) and x 
n=1 

is equal to the sum of its Fourier series. In this case, one also says that x has a Fourier expansion. The 

following statements are equivalent: 

a) {e,} is a fundamental set in H. 

b) {en} is complete in H. 

c) {en} is a basis in H. 


d) For V z,y € H with the corresponding Fourier coefficients c, and d, (n — 1,2,...) there holds 
(t,y) = do ends. (12.131) 
τις] 


e) For every vector x € H, the Parseval equation (12.130) holds. 
E A: The trigonometric system (12.121) is a basis in the space L?((—7, 7)). 


WB B: The system of the normalized Legendre polynomials (12.124) P,(t) (n — 0,1,...) is complete 
and consequently a basis in the space L?((—1, 1)). 


12.4.4 Existence of a Basis, Isomorphic Hilbert Spaces 


In every separable Hilbert space there exits a basis. From this fact it follows that every orthonormal 
system can be completed to a basis. 

Two Hilbert spaces H; and Ho are called isometric or isomorphic as Hilbert spaces if there is a linear 
bijective mapping T: H; —+ Hy with the property (Tr, T'y)g, = (x,y)u, (that is, it preserves the 
scalar product and because of (12.108) also the norm). Any two arbitrary infinite dimensional separable 
Hilbert spaces are isometric, in particular every such space is isometric to the separable space I”. 
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12.5 Continuous Linear Operators and Functionals 


12.5.1 Boundedness, Norm and Continuity of Linear Operators 


12.5.1.1 Boundedness and the Norm of Linear Operators 
Let X = (X, || - ||) and Y = (Y, || - ||) be normed spaces. In the following discussion the index X in 
the notation ||- ||% , which emphasizes being in the space X , is omitted, because from the text it will 
be always clear, which norms and spaces are considered. An arbitrary operator T: X —9 Y is called 
bounded if there is a real number \ > 0 such that 

|Z (z)|| < Aljel] VaeeX. (12.132) 
A bounded operator with a constant À “stretches” every vector at most A times and it transforms every 
bounded set of X into a bounded set of Y, in particular the image of the unit ball of X is bounded in Y. 
This last property is characteristic of bounded linear operators. A linear operator is continuous (see 
12.2.3, p. 668) if and only if it is bounded. 
The smallest constant A, for which (12.132) still holds, is called the norm of the operator T and it is 
denoted by ||Z'], i.e., 

ITI := inf{à > 0 : [[Tz|| € Al|z]|, x € X). (12.133) 
For a continuous linear operator the following equalities hold: 








ITI = sup ||Τα|| -- sup |[Tx|| = sup Τα] (12.134) 
σι llæl|<1 σι 
and, furthermore, the following estimation holds 
[Τα] x TI- lel VaeX. (12.135) 
WB Let T be the operator in the space C({a, b]) with the norm (12.89e), defined by the integral 
(Tx)(s) = y(s) = T. K(s,t)x(t)dt (s € [a,6]), (12.136) 


where K(s, t) is a (complex-valued) continuous function on the rectangle (a € s,t X b). Then T isa 
bounded linear operator, which maps C([a, b]) into C([a, b]). Its norm is 


b 
IIl = max [ |K(s,t)| dt. (12.137) 


12.5.1.2 The Space of Linear Continuous Operators 
The sum U = S + T and the multiple oT' of two linear (continuous) operators S, T: X —> Y are 
defined point-wise: 

U(x) = S(a)+T(a), (aT)(x)=a-T(x), Vee X and Vac F. (12.138) 
The set L(X, Y), often denoted by B(X, Y), of all linear continuous operators T' from X into Y equipped 
with the operations (12.138) is a vector space, where ||T'|| (12.133) turns out to be a norm on it. So, 
L(X, Y) is a normed space and even a Banach space if Y is a Banach space. So the axioms (V1)-(V8) 
and (IN1)-(N3) are satisfied (see 12.1.1, p. 654, 12.3.1, p. 669). 
If Y — X, then a product can be defined for two arbitrary elements S, T € L(X, X) — L(X) — B(X) as 

(ST)(r)-— S(Tvx) σεχ), (12.139) 
which satisfies the axioms (A1)-(A4) from 12.3.4, p. 672, and also the compatibility condition (12.100) 
with the norm. L(X) is in general a non-commutative normed algebra, and if X is a Banach space, then 
it is a Banach algebra. Then for every operator T € L(X) its powers are defined by 

P=], TST OE πι τι), (12.140) 
where J is the identity operator Jz =x, Vx € X. Then 

2" || < FP (n =0,1,...), (12.141) 
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and furthermore there always exists the (finite) limit 


r(T) = tim WIT", (12.142) 


which is called the spectral radius of the operator T and satisfies the relations 
κ πι, v(T") ο], r(aT) = |aļr(T), r(T) τα, (12.143) 
where T* is the adjoint operator to T' (see 12.6, p. 684, and (12.159)). If L(X) is complete, then for 
[A] » r(T), the operator (AI — T)-! has the representation in the form of a Neumann series 
(XI—TY) X TDEX CTGg aX "Pu. uu (12.144) 
which is convergent for |A| 7 r(T) in the operator norm on L(X). 


12.5.1.3 Convergence of Operator Sequences 


1. Point-wise Convergence 
of a sequence of linear continuous operators Th: X — Y to an operator T: X —+ Y means that: 
Tax — Tx inYforeachz € X. (12.145) 
2. Uniform Convergence 
The usual norm-convergence of a sequence of operators {T,,}°2, in a space L(X, Y) to T,ie., 
||T, —T|| = sup |The — Tel — 0 (n> oo) (12.146) 
lle] t 
is the uniform convergence on the unit ball of X. It implies point-wise convergence, while the converse 
statement is not true in general. 
3. Applications 


The convergence of quadrature formulas when the number n of interpolation nodes tends to oo, the 
performance principle of summation, limiting methods, etc. 


12.5.2 Linear Continuous Operators in Banach Spaces 

Now X and Y are supposed to be Banach spaces. 

1. Banach-Steinhaus Theorem (Uniform Boundedness Principle) 

The theorem characterizes the point-wise convergence of a sequence (T; ) of linear continuous operators 
Tn to some linear continuous operator by the conditions: 

a) For every element from an everywhere dense subset D C X, the sequence {Tax} has a limit in Y, 
b) there is a constant C such that ||7;,|| € C, V n. 

2. Open Mappings Theorem 


The theorem tells us that a linear continuous operator mapping from X onto Y is open, i.e., the image 
T(G) of every open set G from X is an open set in Y. 
3. Closed Graph Theorem 
An operator T: Dr — Y with Dr C X is called closed if x, € Dr, £n > roin X and Tr, > yg in Y 
imply 20 € Dr and yo = Txo. A necessary and sufficient condition is that the graph of the operator T 
in the space X x Y, i.e., the set 

Ir = {(x,Tx): x € Dr} (12.147) 
is closed, where here (a, y) denotes an element of the set X x Y. 
If T is a closed operator with a closed domain Dry, then T is continuous. 
4. Hellinger-Toeplitz Theorem 
Let T be a linear operator in a Hilbert space H. If (x, Ty) = (Tx, y) for every x,y € H, then T is 
continuous (here (x, Ty) denotes the scalar product in H). 
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5. Krein-Losanovskij Theorem on the Continuity of Positive Linear 

Operators 
If X = (X, X4,]] - |) and Y = (¥,Y4, || - ||) are ordered normed spaces, where X, is a generating 
cone, then the set L,(X, Y) of all positive linear and continuous operators T, i.e., T(X+) C Y4, isa 
cone in L(X, Y). The theorem of Krein and Losanovskij asserts (see [12.17]): If X and Y are ordered 
Banach spaces with closed cones X, and Y+, and X is a generating cone, then the positivity of a linear 
operator implies its continuity. 
6. Inverse Operator 
Let X and Y be arbitrary normed spaces and let T': X —9 Y be a linear, not necessarily continuous 
operator. T' has a continuous inverse T-!: Y —5 X, if T(X) — Y and there exists a constant m > 0 


general than that in (12.22) (see 12.1.5.2, p. 659), since there may be D # X and R(T) # Y. 

In the case of Banach spaces X, Y the following theorem is valid: 

7. Banach Theorem on the Continuity of the Inverse Operator 

If T is a linear continuous bijective operator from X onto Y, then the inverse operator T7! is also 
continuous. 


1 
such that ||T'x|| πια] for each x € X. Then ||T7}]| < —. The situation considered here is less 
m 


An important application is, e.g., the continuity of (AZ — T)~! given the injectivity and surjectivity of 
AI — T. This fact has importance in investigating the spectrum of an operator (see 12.5.3.2, p. 680). 
It also applies to the 

8. Continuous Dependence of the Solution 

on the right-hand side and also on the initial data of initial value problems for linear differential equa- 
tions. This fact is demonstrated by the following example. 


W Tho initial value problem 

#(t) 4- py (t) (£) + pa(t)z(t) — q(t), te€[a,b], x(to) =€, d(to) =€, to € [a, t] (12.148a) 
with coefficients p1(t), po(t) € C([a, b]) has exactly one solution zx from C?([a, b] for every right-hand 
side q(t) € C({a, b]) and for every pair of numbers £, £. The solution x depends continuously on q(t), € 
and £ in the following sense. If gn(t) € C([a,0]),£, &, € IR! are given and x, € C({a, b]) denotes the 
solution of 

Gn(t) + pi(t)en(t) + po(t)an(t) = gqn(t), Enla) = En, tnla) = ĉn, (12.148b) 
for n — 1,2,..., then: 

a(t) > g(t) in C([a, UJ), 

En =F E, implies 2» — z in the space C?({a, b]). (12.148c) 


& > & 


9. Method of Successive Approximation 
to solve an equation of the form 
α- Ταν (12.149) 
with a continuous linear operator T' in a Banach space X for a given y. This method starts with an 
arbitrary initial element xo, and constructs a sequence {xp } of approximating solutions by the formula 
Ing1 =YtTr, (n=0,1,...). (12.150) 
This sequence converges to the solution z* in X of (12.149). The convergence of the method, i.e., 
Xn —> x*, is based on the convergence of the series (12.144) with \ = 1. 
Let ||T|| < q < 1. Then the following statements are valid: 





1 
a) The operator J — T has a continuous inverse with ||(J — T)~1|| < = and (12.149) has exactly 
—4 


one solution for each y. 
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b) The series (12.144) converges and its sum is the operator (I — T)-1. 
c) The method (12.150) converges to the unique solution x* of (12.149) for any initial element xo, if 
the series (12.144) converges. Then the following estimation holds: 


|a — z*|| € [gl Pee —2o|| (n=1,2,...). (12.151) 


Equations of the type 
a-pTx=y, Atc-Txr=y, wrAECF (12.152) 
can be handled in an analogous way (see 11.2.2, p. 625, and. [12.8]). 


12.5.3 Elements ofthe Spectral Theory of Linear Operators 
12.5.3.1 Resolvent Set and the Resolvent of an Operator 


For an investigation of the solvability of equations one tries to rewrite the problem in the form 

(1—T)r—y (12.153) 
with some operator T' having a possible small norm. This is especially convenient for using a functional 
analytic method because of (12.143) and (12.144). In order to handle large values of ||T'|| as well, it is 
necessary to investigate the whole family of equations 

(AI— T)r—-y r€X, with λες (12.154) 
in a complex Banach space X. Let T' be a linear, but in general not a bounded operator in a Banach 
space X. The set o(T) of all complex numbers A such that (A7 — T)! € B(X) = L(X) is called the 
resolvent set and the operator Ry = R(T) = (AI — T)~} is called the resolvent. Let T now be a 
bounded linear operator in a complex Banach space X. Then the following statements are valid: 
a) The set o(T) is P More precisely, if Ag € o(T) and A € C satisfy the inequality 





PES TES (12.155) 
Wr 
then Ry exists and 
Ry = Ra + (À — λο) Ελ, + A = Ao)? RR, +. = 3 (A - AQ)! BS. (12.156) 
k=1 
b) {å € C: JA| > |ITI|} C e(T). More precisely, V À € € with |A| 7 ||T||, the operator R, exists and 
py. No pe 
Hee mus (12.157) 


c) || Ra — Rall — 0, if A —> Ao. (A, Ao € O(T)), and ||Ry|]| 50, i£A— coo. (A € e(T)). 


d) aa- R „if A > Ao. 








e) For an arbitrary functional f € X* (see 12.5.4.1, p. 681) and arbitrary x € X the function F(A) = 
f(Ry(x)) is holomorphic on o(T). 

f) For arbitrary A, i € o(T), and A Z p one has: 
Ry — R, 


FAR, = Ryley = E. 


(12.158) 


12.5.3.2 Spectrum of an Operator 


1. Definition of the Spectrum 

The set o(T) = C\ o(T) is called the spectrum of the operator T. Since I — T has a continuous inverse 
(and consequently (12.153) has a solution, which continuously depends on the right-hand side) if and 
only if 1 € o(Z), the spectrum o(T) must be known as well as possible. From the properties of the 
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resolvent set it follows immediately that the spectrum o(T) is a closed set of C which lies in the disk 
(A € €: |A| € I[T]|), however, in many cases c (T) is much smaller than this disk. The spectrum of any 
linear continuous operator on a complex Banach space is never empty and 

r(T) = sup [Àl]. (12.159) 

λεσ(Τ) 

It is possible to say more about the spectrum in the cases of different special classes of operators. 
If T is an operator in a finite dimensional space X and if the equation (AJ — T')x = 0 has only the trivial 
solution (i.e., AJ — Tis injective), then A € o(T) (ie., AZ — T is surjective). If this equation has a 
non-trivial solution in some Banach space, then the operator AJ — T is not injective and (AJ — T)~1 is 
in general not defined. 


The number A € C is called an eigenvalue of the linear operator T, if the equation Ax = Tx has a non- 
trivial solution. All those solutions are called eigenvectors, or in the case when X is a function space 
which occurs very often in applications), they are called eigenfunctions of the operator T associated 
to A. The subspace spanned by them is called the eigenspace (or characteristic space) associated to A. 
The set a, (T) of all eigenvalues of T is called the point spectrum of the operator T. 


~ 


2. Comparison to Linear Algebra, Residual Spectrum 

An essential difference between the finite dimensional case which is considered in linear algebra and the 
infinite dimensional case discussed in functional analysis is that in the first case o(T) = op(T) always 
holds, while in the second case the spectrum usually also contains points which are not eigenvalues of 
T. If AI — T is injective and surjective as well, then λ € o(T) due to the theorem on the continuity 
of the inverse (see 12.5.2, 7., p. 679). In contrast to the finite dimensional case where the surjectivity 
follows automatically from the injectivity, the infinite dimensional case has to be dealt with in a very 
different way. 

The set o-(T) of all A € o(T), for which AI — T is injective and Im(AJ — T) is dense in X, is called 
the continuous spectrum and the set o,(T) of all \ with an injective AJ — T and a non-dense image, is 
called the residual spectrum of operator T. 





For a bounded linear operator T' in a complex Banach space X 
o(T) = 0,(T) Uo(T) Vo,(T), (12.160) 


where the terms of the right-hand side are mutually disjoint. 


12.5.4 Continuous Linear Functionals 


12.5.4.1 Definition 


For Y = F a linear mapping is called a linear functional or a linear form. In the following discussions, 
for a Hilbert space the complex case is considered; in other situations almost every times the real case 
is considered. The Banach space L(X, F) of all continuous linear functionals is called the adjoint space 
or the dual space of X and it is denoted by X* (sometimes also by X"). The value (in F) of a linear 
continuous functional f € X* on an element x € X is denoted by f (x), often also by (x, f) — emphasizing 
the bilinear relation of X and X* — (compare also with the Riesz theorem (see 12.5.4.2, p. 682). 


ΒΛ: Let ty, t,...,tn be fixed points of the interval {a,b] and cy, c2,...,C, real numbers. By the 
formula 


f(a) = Y ash) (12.161) 


n 
a linear continuous functional is defined on the space C((a, b]); the norm of f is || f|| = > |ce|. A special 
k=l 


case of (12.161) for a fixed t € [a,b] is the ô functional 
d(x) = a(t) (x € C({a,d])). (12.162) 
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WB B: With an integrable function (t) (see 12.9.3.1, p. 696) on [a, b] 
b 
f(x) = | e(t)a(t) dt (12.163) 
is a linear continuous functional on C([a, 6]) and also on B([a, b]) in each case with the norm [|f|| — 


[ ena. 


12.5.4.2 Continuous Linear Functionals in Hilbert Spaces. 
Riesz Representation Theorem 


In a Hilbert space H equipped with the scalar product (*,-) every element y € H defines a linear 
continuous functional by the formula f(x) = (x,y), where its norm is || /|| — ||y]|. Conversely, if f is a 
linear continuous functional on H, then there exists a unique element y € H such that 


f(x) =(a,y) Va eH, (12.164) 
where || f|| = ||y||. According to this theorem the spaces H and H* are isomorphic and might be iden- 
tified. 


The Riesz representation theorem contains a hint on how to introduce the notion of orthogonality in 
an arbitrary normed space. Let A C X and A* C X*. The sets 


At ={feX:f(z)=0 Vee A} and AM ={x EX: f(z) =0 Vf e ΑΠ (12.165) 


are called the orthogonal complement or the annulator of A and A*, respectively. 


12.5.4.3 Continuous Linear Functionals in L? 


1 
Let p 7 1. The number q is called the conjugate exponent to pif — + — — 1, where it is assumed that 
D d 


q = œ in the case of p = 1. 
E Based on the Hólder integral inequality (see 1.4.2.12, p. 32) the functional (12.163) can be considered 


1. 1 

also in the spaces Z7([a, b]) (1 € p € oo) (see 12.9.4, p. 697) if v € L*([a, 0]) and — -- - — 1. Its norm 
D d 

is then 


1 
b q 
νομὴ .. αξ α-φέτα 
Ill = lvl = (/ 


ess. sup |y(t)|, if p=1 
te [a,b] 


(12.166) 


(with respect to the definition of ess. sup |p| see (12.221), p. 698). To every linear continuous functional 
f in the space L”((a, b]) there is a uniquely (up to its equivalence class) defined element y € L^(|a, 0]) 
such that 


q 


b b 
f) (9) - [ s(0y(Ddt, ver and [f= lola = / vora) : (12.167) 
For the case of p = co see [12.15]. 


12.5.5 Extension ofa Linear Functional 


1. Semi-Norm 

A mapping p: X — R of a vector space X is called a semi-norm or pseudonorm, if it has the following 
properties: 

(HN1) p(x) > 0, (12.168) 
(ΠΝ2) p(az) = |alp(z), (12.169) 
(HN3) ple +y) < ple) + p(y). (12.170) 
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Comparison with 12.3.1, p. 669, shows that a semi-norm is a norm if and only if p(z) — 0 holds only 
for x — 0. 

Both for theoretical mathematical questions and for practical reasons in applications of mathematics, 
the problem of the extension of a linear functional given on a linear subspace Xo C X to the entire 
space (and, in order to avoid trivial and uninteresting cases) with preserving certain * good" properties 
became a fundamental question. The solution of this problem is guaranteed by 

2. Analytic Form of the Hahn-Banach Extension Theorem 

Let X be a vector space over F and p a pseudonorm on X. Let Xo be a linear (complex in the case of 
F = C and real in the case of F = R) subspace of X, and let fọ be a (complex-valued in the case of 
F = C and real-valued in the case of F = R) linear functional on Xo satisfying the relation 


|fo(z)| € p(z) Vx € Xo. (12.171) 
Then there exists a linear functional f on X with the following properties: 
f(x) = fo(z) Vee Xo, |f(x)|< p(x) VaeX. (12.172) 


So, f is an extension of the functional fg onto the whole space X preserving the relation (12.171). 

If Xo is a linear subspace of a normed space X and fo is a continuous linear functional on Xo, then 
p(x) = || foll- ||2|| is a pseudonorm on X satisfying (12.171), so the Hahn-Banach extension theorem for 
continuous linear functionals is obtained. 





Two important consequences are: 
1. For every element x # 0 there is a functional f € X* with f(x) — |[z|| and ||f|| = 1. 
2. For every linear subspace Χρ ς X and 29 ¢ Xo with the positive distance d = infzcx, ||x — xo|| » 0 
there is an f € X* such that 
1 
f(r)—-0 Vr€ Xo, f(xo)-—1 and | - d (12.173) 
12.5.6 Separation of Convex Sets 


1. Hyperplanes 

A linear subset L of the real vector space X, L Z X, is called a hypersubspace or hyperplane through 0 
if there exists an £o € X such that X = lin(xo, L). Sets of the form x + L (L a linear subset) are affine- 
linear manifolds (see 12.1.2, p. 655). If L is a hypersubspace, these manifolds are called hyperplanes. 
There exist the following close relations between hypersubspaces, hyperplanes and linear functionals: 
a) The kernel f~1(0) = {x € X: f(x) = 0} ofa linear functional f on X is a hypersubspace in X, and 
for each number A € R there exists an element zy € X with f(z4) 2 A and f-1(A) = 2) + f71(0). 

b) For any given hypersubspace L C X and each x9 € L and A Z 0 (A € RR) there always exists a 
uniquely determined linear functional f on X with f-1(0) = L and f(x) = A. 

The closedness of f~1(0) in the case of a normed space X is equivalent to the continuity of the functional 
f: 

2. Geometric Form of the Hahn-Banach Extension Theorem 

Let X be a normed space, zo € X and La linear subspace of X. Then for every non-empty convex open 
set K which does not intersect the affine-linear manifold zo 4- L, there exists a closed hypersubspace H 
such that zo + L C H and HA K =f. 

3. Separation of Convex Sets 

Two subsets A, B of a real normed space X are called separated by a hyperplane if there is a functional 
f € X* such that: 


sup f(x) < inf f(y). (12.174) 
2ΕΑ ye 


The separating hyperplane is then given by f~1(a) with a = sup,c4 f(x), which means that the two 
sets are contained in the different half-spaces 
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AcCíreX: f(r)zxa) and B C (ze X: f(z) 2 aj. (12.175) 


In Fig. 12.5b,c two cases of the separation by a hyperplane are shown. 

Their disjointness is less decisive for the separation of two sets. In fact, Fig. 12.5a shows two sets E 
and B, which are not separated although E and B are disjoint and B is convex. The convexity of both 
sets is the intrinsic property for separating them. In this case it is possible that the sets have common 
points which are contained in the hyperplane. 


a) b) 





Figure 12.5 


If A is a convex set of a normed space X with a non-empty interior Jnt(A) and B C X is a non-empty 
convex set with Int(A) n B = 0, then A and B can be separated. The hypothesis Int(A) # 0 in that 
statement cannot be dropped (see [12.3], example 4.47). A (real linear) functional f € X* is called 
a supporting functional of the set A at the point £o € A, if there is a real number À € R such that 
f(vo) =A, and Ac {x € X: f(x) € A). f-1(A) is called the supporting hyperplane at the point zo. 
For a convex set A with a non-empty interior, there exists a supporting functional at each of its bound- 
ary points. 

Remark: The famous Kuhn-Tucker theorem (see 18.2, p. 925) which yields practical methods to de- 
termine the minimum of convex optimization problems (see [12.5]), is also based on the separation of 
convex sets. 


12.5.7 Second Adjoint Space and Reflexive Spaces 
The adjoint space X* of a normed space X is also a normed space if it is equipped with the norm ||f|| — 


sup |f(x)|, so (X*)* = X** — the second adjoint space to X can also be considered. The canonical 
lll t 


embedding 

J: X — X" with Jx=F,, where F,(f)= f(a) V fe xX* (12.176) 
is a norm isomorphism (see 12.3.1, p. 669), hence X is identified with the subset J(X) C X**. A Banach 
space X is called reflexive if J(X) = X**. Hence the canonical embedding is then a surjective norm 
isomorphism. 
W Every finite dimensional Banach space and every Hilbert space is reflexive, as well as the spaces 
L? (1 € p « oo), however C([a, b]), L([0, 1]), co are examples of non-reflexive spaces. 


12.6 Adjoint Operators in Normed Spaces 
12.6.1 Adjoint of a Bounded Operator 


For a given linear continuous operator T: X — Y (X,Y are normed spaces) to every g € Y* there 

is assigned a functional f € X* by f(x) = g(Tx), V x € X. In this way, a linear continuous operator 
T*: Y* — X*, (T*g)(zx) 2 g(Tx), VgeY* and VrzeX (12.177) 

is obtained which is called the adjoint operator of T and has the following properties: 

(T+S)* =T*+S5*, (ST)* = S*T*, ||T*|| = ||T||, where for the linear continuous operators 7: X > Y 

and S: Y + Z (X, Y, Z normed spaces), the operator $T: X — Z is defined in the natural way 
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as ST(x) — S(T(x)) (see 12.3.4, M C, p. 672). With the notation introduced in 12.1.5, p. 658, and 
12.5.4.2, p. 682, the following identities are valid for an operator T € B(X, Y): 

Im(T)-ker(T*), Im(T*)- ker(T)*, (12.178) 
where the closedness of /m(T') implies the closedness of /m(T*). 
The operator T**: X** — Y**, obtained as (7*)* from T*, is called the second adjoint of T. Due to 
(T*(F,))g — F,(1*g) — (1*g)(x) ^ g(Tv) — Frz(g) the operator T** has the following property: If 
F,c€X",then T* F, — Fr, € Y". Hence, the operator T**: X** + Y** is an extension of T. 
In a Hilbert space H the adjoint operator can also be introduced by means of the scalar product 
(Tx,y) = (a,T*y), x,y € H. This is based on the Riesz representation theorem, where the iden- 
tification of H and H** implies (AT)* = AT*, I* = I and even T*™* = T. If T is bijective, then the same 
holds for T*, and also (T*)-! — (T71)*. For the resolvents of T' and T* there holds 

[f (D = R(T), (12.179) 
from which o(T*) = {X: \ € o(T)} follows for the spectrum of the adjoint operator. 
@ A: Let T be an integral operator in the space L?([a, b]) (1 < p « oo) 


(Tx)(s) = [ ” K(s,t)x(t) dt (12.180) 


with a continuous kernel K (s, t). The adjoint operator of T is also an integral operator, namely 


b 

(T*g)(t) = Í K*(t,s)yq(s) ds (12.181) 
with the kernel K*(s,t) = K(t,s), where y, is the element from L! associated to g € (L”)* according 
to (12.167). 


HB: Ina finite dimensional complex vector space the adjoint of an operator represented by the matrix 
A = (aj;) is defined by the matrix A* with aj; = Gi. 


12.6.2 Adjoint Operator of an Unbounded Operator 
Let X and Y be real normed spaces and T' a (not necessarily bounded) linear operator with a (linear) 
domain D(T) C X and values in Y. For a given g € Y*, the expression g(T'x), depending obviously 
linearly on x, is meaningful. Now the question is: Does there exist a well-defined functional f € X* 
such that 

f(x)-—-9g(Tv) Vx € D(T). (12.182) 
Let D* C Y* be the set of all those g € Y* for which the representation (12.182) holds for a certain 


f € X*. If D(T) — X, then for given g the functional f is uniquely defined. So a linear operator T* is 
defined by f — T*g with D(T*) — D*. 'Then for arbitrary x € D(T) and g € D(T*) 

g(Tv) — (T*g)(x). (12.183) 
The operator T* turns out to be closed and is called the adjoint of T'. The naturalness of this general 


procedure stems from the fact that D(T*) — Y* holds if and only if T is bounded on D(T). In this case 
T* € B(Y*, X*) and ||T*|| = ||T|| hold. 


12.6.3 Self-Adjoint Operators 
An operator T € B(H) (H is a Hilbert space) is called self-adjoint if T* = T. In this case 
(Tv,y) 2 (x, Ty), x, y € H (12.1842) 
is valid and the number (Tx, x) is real for each x € H. Then the equality 
ITI = E |(Tx,x)| (12.184b) 
zl=1 
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holds and with m = m(T) = re (Pe, x) and M = M(T) = sup (Tx, x) also the relations 
m Ισ 
m(T)|z|? € (Tz, z) x M(T)|z|? and [ΤΙ -- τ(Τ) -- ππακ{[πα], ΜΠ (12.185) 
are valid. The equality (12.184a) characterizes the self-adjoint operators. The spectrum of a self-adjoint 
(bounded) operator lies in the interval [m, M] and m, M € o(T) holds. 


12.6.3.1 Positive Definite Operators 
A partial ordering can be introduced in the set of all self-adjoint operators of B(H) by defining 

T20 ifandonlyif (Tr,v) 20 VzeH. (12.186) 
An operator T with T > 0 is called positive (or, more exactly positive definite). For any self-adjoint 
operator T (with (H1) from 12.4.1.1, p. 673), (T?a,x) = (Tz,Tx) > 0, so T? is positive definite. 
Every positive definite operator T' possesses a square root, i.e., there exists a unique positive definite 
operator W such that W? = T. Moreover, the vector space of all self-adjoint operators is a K-space 
(Kantorovich space, see 12.1.7.4, p. 660), where the operators 


1 1 
Ir -vr?, T*-z(T|-T) T -s(rT|-T) (12.187) 


are the corresponding elements with respect to (12.37). They are of particular importance for the 
spectral decomposition and spectral and integral representations of self-adjoint operators by means of 
some Stieltjes integral (see 8.2.3.1, 2., p. 506, and. [12.1], [12.11], [12.12], [12.15], [12.18]). 

12.6.3.2 Projectors in a Hilbert Space 

Let Ho be a subspace of a Hilbert space H. Then every element x € H has its projection z’ onto Ho 
according to the projection theorem (see 12.4.2, p. 674), and therefore, an operator P with Px — z' is 
defined on H with values in Ho. P is called a projector onto Ho. Obviously, P is linear, continuous, and 
||P|| = 1. A continuous linear operator P in H is a projector (onto a certain subspace) if and only if: 
a) P = P*, i.e., P is self-adjoint, and 

b) P? = P, i.e., P is idempotent. 


12.7 Compact Sets and Compact Operators 
12.7.1 Compact Subsets ofa Normed Space 


A subset A of a normed space! X is called 

e compact, if every sequence of elements from A contains a convergent subsequence whose limit lies in 
A, 

e relatively compact or precompact if its closure (see 12.2.1.3, p. 664) is compact, i.e., every sequence of 
elements from A contains a convergent subsequence (whose limit does not necessarily belong to A). 
This is the Bolzano—Weierstrass theorem in real calculus for bounded sequences in IR", and one says 
that such a set has the Bolzano—Weierstrass property. 

Every compact set is closed and bounded. Conversely, if the space X is finite dimensional, then every 
such set is compact. The closed unit ball in a normed space X is compact if and only if X is finite 
dimensional. 

For some characterizations of relatively compact subsets in metric spaces (the Hausdorff theorem on 
the existence of a finite e-net) and in thespacess, C (Arzela-Ascoli theorem) and in the spaces L^(1 « 
p « oo) see [12.15]. 


12.7.2 Compact Operators 
12.7.2.1 Definition of Compact Operator 


An arbitrary operator T': X —> Y of a normed space X into a normed space Y is called compact if the 





TIt is enough that X is a metric (or an even more general) space. This generality is not used in what follows. 
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image TA) of every bounded set A C X is a relatively compact set in Y. If, in addition the operator 
T is also continuous, then it is called completely continuous. Every compact linear operator is bounded 
and consequently completely continuous. For a linear operator to be compact it is sufficient to require 
that it transforms the unit ball of X into a relatively compact set in Y. 


12.7.2.2 Properties of Linear Compact Operators 

A characterization by sequences of the compactness of an operator from B(X, Y) is the following: For 
every bounded sequence {x,,}°2, from X the sequence (T, 7? , contains a convergent subsequence. 
A linear combination of compact operators is also compact. If one of the operators U € B(W, X), T € 
B(X,Y), S € B(Y,Z) in each ofthe following products is compact, then the operators TU and ST are 
also compact. If Y is a Banach space, then the following important statements are valid. 

a) Convergence: If a sequence of compact operators (T; 9. is convergent in the space B(X, Y), then 
its limit is a compact operator, too. 

b) Schauder Theorem: If T is a linear continuous operator, then either both T and T* are compact 
or both are not. 


c) Spectral Properties of a Compact Operator T in an (Infinite Dimensional) 

Banach Space X: The zero belongs to the spectrum. Every non-zero point of the spectrum o(T) 
is an eigenvalue with a finite dimensional eigenspace X, = {a € X: (AI — T)x = O}, andVe > 0 
there is always only a finite number of eigenvalues of T outside the circle {|A| < £}, where only the zero 
can be an accumulation point of the set of eigenvalues. If À — 0 is not an eigenvalue of T', then T'71 is 
unbounded if it exists. 


12.7.2.3 Weak Convergence of Elements 

A sequence {x,,}°, of elements of a normed space X is called weakly convergent to an element xo if for 
each f € X* the relation f(a») + f (ao) holds (written as: z, — xo or as z, — a). 

Obviously: x, — xo implies x, — 2. If Y is another normed space and T: X —> Y is a continuous 
linear operator, then: 

a) v, — zo implies Tx, = T xo, 

b) if T is compact, then £n — xo implies Tzn > Txo. 

W A: Every finite dimensional operator is compact. From this fact it follows that the identity operator 
in an infinite dimensional space cannot be compact (see 12.7.1, p. 686). 

E B: Suppose X = P?, and let T' be the operator in P? given by the infinite matrix 


ti ti tig --- 

£31 122 t23 > E oo 

iac oc with Tr = (Σ ἔκαμ... Ὁ, ἑηικσκ,.. ) . (12.188) 
9 x Rt REl 


If ΣΣ [κε -- Μ « oo, then T is a compact operator from P into P with ||T|| « M. 
k n=l 

E C: The integral operator (12.136) is a compact operator in the spaces C([a, b]) and L^((a,b)) (1 < 

p < oo). 


12.7.3 Fredholm Alternative 


Let T be a compact linear operator in a Banach space X. The following equations (of the second kind) 
are considered with a parameter À Æ 0: 
Azy— Tr —y, λαο τα 50, 


Af-T'fog, M -T'f-0. (12.189) 
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The following statements are valid: 

a) dim(ker(AJ—T)) = dim(ker(AJ —7*)) < +00, i-e., both homogeneous equations always have the 
same number of linearly independent solutions. 

b) Im(Al — T) 2 ker(AJ — T*)*. and* Im(AI — T*) = ker(AJ — T)+. 

c) Im(AJ — T) — X if and only if ker(A7 — T) — 0. 

d) The Fredholm alternative (also called the Riesz-Schauder theorem): 


a) Either the homogeneous equation has only the trivial solution. In this case A € o(T), the operator 
(AI — T)-1 is bounded, and the inhomogeneous equation has exactly one solution z — (A1 — T)-1y for 
arbitrary y € X. 

B) Or the homogeneous equation has at least one non-trivial solution. In this case A is an eigenvalue of 
T,ie.,A € c(T), and the inhomogeneous equation has a (non-unique) solution if and only if the right- 
hand side y satisfies the condition f(y) = 0 for every solution f of the adjoint equation T* f = Af. 

In this last case every solution x of the inhomogeneous equation has the form x = £o + h, where xo is 
a fixed solution of the inhomogeneous equation and h € ker(AJ — T). 

Linear equations of the form Tx = y with a compact operator T are called equations of the first kind. 
Their mathematical investigation is in general more difficult (see [12.11],[12.18]). 


12.7.4 Compact Operators in Hilbert Space 


Let T : H — H be a compact operator. Then T is the limit (in B(H)) of a sequence of finite dimen- 
sional operators. The similarity to the finite dimensional case can be seen from the statements: 


If C is a finite dimensional operator and T = I — C, then the injectivity of T implies the existence of 
T^! and T-! € B(H). 

If C is a compact operator, then the following statements are equivalent: 

a) 3T and it is continuous, 

b) x #0 = Tzr #0, i.e., T is injective, 

c) T(H) = H, i.e., T is surjective. 


12.7.5 Compact Self-Adjoint Operators 


1. Eigenvalues 

A compact self-adjoint operator T Z 0 in a Hilbert space H possesses at least one non-zero eigenvalue. 

More precisely, T always has an eigenvalue à with |A| — ||T'|. 'The set of eigenvalues of T is at most 

countable. 

Any compact self-adjoint operator T has the representation T = }> A,P,, (in B(H)), where A, are 
k 


the different eigenvalues of T' and P4 denotes the projector onto the eigenspace H3. In this case the 
operator T is diagonalizable. From this fact it follows that Tx = 33 Ag(x, ex)ex for every x € H, where 
k 


(ex) is the orthonormal system of the eigenvectors of T. If \ ¢ o(T) and y € H, then the solution of 


the equation (AJ — T)« = y can be represented as £ = Ry(T)y = X` 
k 


KU ex )ek- 


2. Hilbert-Schmidt Theorem 

If T is a compact self-adjoint operator in a separable Hilbert space H, then there is a basis in H con- 
sisting of the eigenvectors of T. 

The so-called spectral (mapping) theorems (see [12.8], [12.10], [12.12], [12.13], [12.18]) can be con- 
sidered as the generalization of the Hilbert-Schmidt theorem for the non-compact case of self-adjoint 
(bounded or unbounded) operators. 





tHere the orthogonality is considered in Banach spaces (see 12.5.4.2, p. 682). 
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12.8 Non-Linear Operators 

In the theory of non-linear operator equations the most important methods are based on the following 
principles: 

1. Principle of the Contracting Mapping, Banach Fixed-Point Theorem (see 12.2.2.3, p. 666, 
and 12.2.2.4, p. 666). For further modifications of this principle see [12.8], [12.11], [12.12], [12.18]. 

2. Generalization of the Newton Method (see 18.2.5.2, p. 931 and 19.1.1.2, p. 950) for the infinite 
dimensional case. 

3. Schauder Fixed-Point Principle (see 12.8.4, p. 691) 

4. Leray-Schauder Theory (sce 12.8.5, p. 692) 

Methods based on principles 1 and 2 yield information on the existence, uniqueness, constructivity 
etc. of the solution, while methods based on principles 3 and 4, in general, allow “only” the qualitative 
statement of the existence of a solution. If further properties of operators are known then see also 


12.8.6, p. 692, and 12.8.7, p. 693. 


12.8.1 Examples of Non-Linear Operators 


For non-linear operators the relation between continuity and boundedness discussed for linear operators 
in 12.5.1, p. 677 is no longer valid in general. In studying non-linear operator equations, e.g., non- 
linear boundary value problems or integral equations, the following non-linear operators occur most 
often. Iteration methods described in 12.2.2.4, p. 666, can be successfully applied for solving non-linear 
integral equations. 
1. Nemytskij Operator 
Let €) be an open measurable subset from IR" (12.9.1, p. 693) and f: Q x R —+ Ra function of two 
variables f(x, s), which is continuous with respect to x for almost every s and measurable with respect 
to s for every x (Caratheodory conditions). The non-linear operator NV’ to F(Q) defined as 

(Nu)(x) — f[x,u(x) (a € Q) (12.190) 
is called the Nemytskij operator. It is continuous and bounded if it maps L?(Q) into L4(Q), where 
— -- - — ]. This is the case, e.g., if 
P q 

p 

|f(z,s)| < a(x) +b|s|3 with a(x) e L*(Q) (b>0) (12.191) 
or if f: Q x R — R is continuous. The operator M is compact only in special cases. 
2. Hammerstein Operator 
Let Q be a relatively compact subset of R”, f a function satisfying the Caratheodory conditions and 
K (x, y) a continuous function on 2 x Q. The non-linear operator H on F(Q) 


(uy) — f KG flu) dy. (x € Q) (12492) 
2 


is called the Hammerstein operator. H can be written in the form H =K -M with the Nemytskij 
operator M and the integral operator K determined by the kernel K 


(Ku)(z) — / K(z,y)u(y)dy (x € Q). (12.193) 
ἃ 
If the kernel K (x, y) satisfies the additional condition 
I |K (x, y)|1 dx dy < oo (12.194) 
ΩΧΩ 


and the function f satisfies the condition (12.191), then 4 is a continuous and compact operator on 
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3. Urysohn Operator 
Let Q C R” be an open measurable subset and K(z,y,s): Q x Q x R — R a function of three 
variables. Then the non-linear operator U on F(Q) 


Uul) = f Kley uu)dy œE (12.195) 
2 


is called the Urysohn operator. If the kernel K satisfies the appropriate conditions, then U is a contin- 
uous and compact operator in C(Q) or in L?(Q), respectively. 


12.8.2 Differentiability of Non-Linear Operators 
Let X, Y be Banach spaces, D C X be an open set and T: D —5 Y. The operator T is called Fréchet 
differentiable (or, briefly, differentiable) at the point x € D if there exists a linear operator L € B(X, Y) 
(in general depending on the point x) such that 

T(r--h)— T(r) 2 Lh --w(h) πίοι [ω(ἠ}|} — o(l[h]]) (12.196) 
or in an equivalent form 

5 |T(@ +h) — T(x) — Lal - 
\lal|0 [η 


ie., Ye » 0, 36 » 0, such that ||A|| « 9 implies ||T (x 4- h) — T(x) — Lh|| € e||h||. The operator L, 
which is usually denoted by T"(a), T'(x,-) or T'(x)(-), is called the Fréchet derivative of the operator T 
at the point x. The value dT (v; h) — T'(x)h is called the Fréchet differential of the operator T' at the 
point x (for the increment A). 

The differentiability of an operator at a point implies its continuity at that point. If T € B(X, Y), i,e., 
T itself is linear and continuous, then T is differentiable at every point, and its derivative is equal to 


T 


12.8.3 Newton’s Method 


Let X, D be as in the previous paragraph and T: D — Y. Under the assumption of the differentiability 
of T at every point of the set D an operator T': D — B(X, Y) is defined by assigning the element 
T'(x) € B(X,Y) to every point x € D. Suppose the operator 7" is continuous on D (in the operator 
norm); in this case T is called continuously differentiable on D. 
Suppose Y = X and also that the set D contains a solution x* of the equation 

T(x) =0. (12.198) 
Furthermore, it is assumed that the operator T’(x) is continuously invertible for each x € D, hence 
[Τ'(α)]Γ1 15 πι Β(Χ). Because of (12.196) for an arbitrary zo € D one conjectures that the elements 
T (xo) — T (ro) — T(x*) and T'(xo)(xo — 2*) are “not far” from each other and therefore the element x1 
defined as 

21 — v — [T'(xo)] T (vo) (12.199) 
is an approximation of z* (under the given assumptions). Starting with an arbitrary xo the so-called 
Newton approximation sequence 

En41 = Tn — [T (z4)| στα) (n =0,1,...) (12.200) 
can be constructed. There are many theorems known from the literature discussing the behavior and 
the convergence properties of this method. Here only the following most important result is mentioned 
which demonstrates the main properties and advantages of Newton's method: 
Ve € (0,1) there exists a ball B — B(vo;ó), ó — ó(e) in X, such that all points z; lie in P and the 
Newton sequence converges to the solution x* of (12.198). Moreover, ||, — zo|| € &"||xo — x*|| which 
yields a practical error estimation. 
The modified Newton’s method is obtained if the operator [T"(xo)]-? is used instead of [T'(x,)| ? Yn = 
0, 1,... in formula (12.200). For further estimations of the speed of convergence and for the (in general 
sensitive) dependence of the method on the choice of the starting point zo see [12.7], [12.12], [12.18]. 


0, (12.197) 
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W Jacobian or Functional Matrix Given a non-linear operator T' — F: D —+ R™ on an open set 


D C R" with m non-linear coordinate functions F1, F5, . . . , Fm and n independent variables x1, £2, . . . , En- 
Then 
F(a) 
F(x) . 
F(x) = : ER” Va = (21, %2,...,%n) € D (12.201) 
F(x) 


OF; 

Otk 
on D exist and are continuous, then the mapping (the operator) F in every point of D is differentiable 
and its derivative at the point x = (£1, £2,- . -, 8n) E D is the linear operator F'(x) : R” — R” with 
the matrix representation 





holds. If the partial derivatives (k — 1,2,...,n) of the coordinate functions F; (i — 1,2,...,m) 




















OF \(x) OF; (x) OF \ (x) 
Ox, Ox e Ox, 
Ε΄ (3) -- ὅσι ὅσα 7η 2 (12.202) 
OF.(x) OF y(t) OF n(x) 
Ox, Ox m δαν 


The derivative F'(x) is a matrix of the type (m, n). It is called the Jacobian or functional matrix of 
F . Special cases of occurrence are, e.g., the iterative solution of systems of non-linear equations by the 
Newton method (see 19.2.2.2, p. 962) or describing the independence of functions (see 2.18.2.6, 3., p. 
123). 

For m = n the so-called functional determinant or Jacobian determinant can be formed, which is de- 
noted shortly by 


D, Fa... Fu) (12.203) lE 


D(z1,2,..., t4) 


This determinant is used for the solution of (mostly inner-mathematical) problems (see also, e.g., 


8.5.3.2, p. 539). 


12.8.4 Schauder's Fixed-Point Theorem 


Let T : D —. X bea non-linear operator defined on a subset D of a Banach space X. The non-trivial 
question of whether the equation x = T(x) has at least one solution, can be answered as follows: If 
X = R and D = [-1, 1], then every continuous function, mapping D into D, has a fixed point in D. 
If X is an arbitrary finite dimensional normed space (dimX > 2), then Browwer’s fixed-point theorem 
holds. 


1. Brouwer’s Fixed-Point Theorem Let D be a non-empty closed bounded and convex subset of 
a finite dimensional normed space. If T is a continuous operator, which maps D into itself, then T has 
at least one fixed point in D. 

The answer in the case of an arbitrary infinite dimensional Banach space X is given by Schauder's 
fixed-point theorem. 

2. Schauder’s Fixed-Point Theorem Let D be a non-empty closed bounded and convex subset 
of a Banach space X. If the operator T: D —+ X is continuous and compact (hence completely 
continuous) and it maps D into itself, then T' has at least one fixed point in D. 

By using this theorem, it is proved, e.g., that the initial value problem (12.70), p. 668, always has a 
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local solution for t 7 0, if the right-hand side is assumed only to be continuous. 


12.8.5 Leray-Schauder Theory 


For the existence of solutions of the equations x — T' (x) and (I--T)(x) = y with a completely continuous 
operator T, a further principle is found which is based on deep properties of the mapping degree. It can 
be successfully applied to prove the existence of a solution of non-linear boundary value problems. Here 
only those results of this theory are mentioned which are the most useful ones in practical problems, 
and for simplicity a formulation is chosen which avoids the notion of the mapping degree. 
Leray-Schauder Theorem: Let D be an open bounded set in a real Banach space X and let T : 
D:—35 X bea completely continuous operator. Let y € D be a point such that x + AT (x) Æ y for each 
x € OD and A € [0, 1], where OD denotes the boundary of the set D. Then the equation (I 4- T)(x) — y 
has at least one solution. 
The following version of this theorem is very useful in applications: 
Let T be a completely continuous operator in the Banach space X. If all solutions of the family of 
equations 

2 --λΤ(ῳ) (A € (0,1) (12.204) 
are uniformly bounded, i.e., Jc > 0 such that VÀ and V x satisfying (12.204) the a priori estimation 
\|x|| < c holds, then the equation x = T(x) has a solution. 


12.8.6 Positive Non-Linear Operators 

The successful application of Schauder’s fixed-point theorem requires the choice of a set with appropri- 
ate properties, which is mapped into itself by the considered operator. In applications, especially in the 
theory of non-linear boundary value problems, ordered normed function spaces and positive operators 
are often considered, i.e., which leave the corresponding cone invariant, or isotone increasing operators, 
i.e., if x < y > T(x) < T(y). If confusions (see, e.g., 12.8.7, p. 693) are excluded, these operators are 
also called monotone. 

Let X = (X, X4, || - ||) be an ordered Banach space, X, a closed cone and [a, b] an order interval of 
X. If X, is normal and T is a completely continuous (not necessarily isotone) operator that satisfies 
T([a, ]) C [a, b], then T' has at least one fixed point in [a, b] (Fig. 12.6b). 


pty 





b) 


Figure 12.6 





Notice that the condition T({a, b]) C [a,b] automatically holds for any isotone increasing operator T, 
which is defined on an (o)-interval (order interval) [a, b] of the space X if it maps only the endpoints a, b 
into fa, b], i.e., when the two conditions T(a) 2 a and T(b) € b are satisfied. Then both sequences 

x9 — à and z444; — T(vr,) (n 2 0) and Ίος b and yii — T(y,) (n 2 0) (12.205) 
are well defined, i.e., £n, Yn € [a, 6], n 2 0. They are monotone increasing and decreasing, respectively, 
i.e., a = £o € zi €... € y, € ... and b — yg 2 y 2 ... ya 2. .... A fixed point z,, z* of the operator 
T'is called minimal, maximal, respectively, if for every fixed point z of T the inequalities r, € z, z € z* 
hold, respectively. 
Now,the following statement is valid (Fig. 12.6a)): Let X be an ordered Banach space with a closed 
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cone X, and T: D — X, D C X a continuous isotone increasing operator. Let [a,b] C D be such 
that T(a) > aand T(b) € b. Then T([a,?]) C [a,b], and the operator T' has a fixed point in [a, b] if one 
of the following conditions is fulfilled: 

a) X4 is normal and T is compact; 

b) X4 is regular. 

Then the sequences {x,}°29 and (y, £o. defined in (12.205), converge to the minimal and to the 
maximal fixed points of T' in [a, b), respectively. 

The notion of the super- and sub-solutions is based on these results (see [12.14]). 


12.8.7 Monotone Operators in Banach Spaces 


1. Special Properties 

An arbitrary operator T: D C X — Y (X, Y normed spaces) is called demi-continuous at the point 
xo € D if for each sequence {x,,}°, C D converging to xo (in the norm of X) the sequence {T(a,)}°, 
converges weakly to T(x) in Y. T is called demi-continuous on the set D if T is demi-continuous at 
every point of D. 

In this paragraph another generalization of the notion of monotonicity known from real analysis are 
introduced. Let X now be a real Banach space, X* its dual, D C X and T: D —>+ X* a non-linear 
operator. T is called monotone if V x,y € D the inequality (T(x) — T(y),x — y) > 0 holds. If X = H 
is a Hilbert space, then (-,-) means the scalar product, while in the case of an arbitrary Banach space 
one refers to the notation introduced in 12.5.4.1, p. 681. The operator T is called strongly monotone 
if there is a constant c > 0 such that (T(x) — T(y),x — y) » clx — yl? for V z,y € D. An operator 
T: X — X* is called coercive if lim oa) = 


le>% |x 





2. Existence Theorems 

for solutions of operator equations with monotone operators are given here only exemplarily: If the op- 
erator T, mapping the real separable Banach space X into X*, (Dr = X), is monotone demi-continuous 
and coercive, then the equation T(x) = f has a solution for arbitrary f € X*. 

If in addition the operator T is strongly monotone, then the solution is unique. In this case the inverse 
operator T7! also exists. 

For a monotone, demi-continuous operator T: H — H in a Hilbert space H with Dr = H, there holds 
Im(I +T) = H, where (Z + T)-! is continuous. If T is supposed to be strongly monotone, then T'7! is 
bijective with a continuous T1 . 

Constructive approximation methods for the solution of the equation T'(r) — 0 with a monotone oper- 
ator T' in a Hilbert space are based on the idea of Galerkin's method (see 19.4.2.2, p. 974, or. [12.10], 
[12.18]). By means of this theory set-valued operators T: X —5 2*' can also be handled. The notion 
of monotonicity is then generalized by (f — g,r — y) 2 0, Vr, y € Dr and f € T(x),g € T(y). 


12.9 Measure and Lebesgue Integral 
12.9.1 Set Algebras and Measures 


The initial point for introducing measures is a generalization of the notion of the length of an interval in 
R, of the area, and of the volume of subsets of IR? and R?, respectively. This generalization is necessary 
in order to “measure” as many sets as possible and to “make integrable” as many functions as possible. 
For instance, the volume of an n-dimensional rectangular parallelepiped |, 

Q={r ER": ap <a €by (kK =1,2,...,n)} hasthe value [] (b, — ax). (12.206) 


k=1 
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1. c Algebra or Set Algebra 
Let X be an arbitrary set. A non-empty system A of subsets from X is called a o algebra if: 


a) Ac A implies X\ Ae A and (12.207a) 
b) Aj, Ao,...,An,---€ A implies LJ An € A. (12.207b) 
n=1 


Every o algebra contains the sets Ý and X, the intersection of countably many of its sets and also the 
difference sets of any two of its sets. 


In the following R denotes the set R. of real numbers extended by the elements —oo and 4-oo (extended 
real line), where the algebraic operations and the order properties from R. are extended to R. in the 








οο 

natural way. The expressions (+00) + (F00) and — are meaningless, while 0 - (+00) and 0) - (—oo) are 
o0 

assigned the value 0. 


2. Measure = 
A function pp: A —> Ry = Ry U +00, defined on ao algebra A, is called a measure if 


8) µίΑ) 30 VAEA, (12.208a) 

b) μ(θ) -- 0, (12.208b) 

c) Aj, Ao,...An,-..€ A, Ap NAL =O (k Al) implies u (U ^.) = X u(n). (12.208c) 
n=1 n=1 


The property c) is called ø additivity of the measure. If jz is a measure on A, and for the sets A, B € 
A, A C B holds, then (A) < p(B) (monotonicity). If An € .A (n — 1,2,...and A41 C A3 C 5, 


then u ( ἢ An) . lim ICA) (continuity from below). 
n=1 POG 


Let .A bea c algebra of subsets of X and jj a measure on A. The triplet X = (X, A, ju) is called a measure 
space, and the sets belonging to A are called measurable or A-measurable. 

E A: Counting Measure: Let X be a finite set {21, v2,..., ty}, A the o algebra of all subsets of X, 
and let assign a non-negative number p; to each 7, (k = 1,...,N). Then the function jz defined on A 
for every set A € A, A= {%n,,2ng,---,%n,} by H(A) = Pri + Pno +*+- + Pn, iS a measure which takes 
on only finite values since (X) = pı +-+- + pn < co. This measure is called the counting measure. 
ΒΒ: Dirac Measure: Let .A bea c algebra of subsets of a set X and a an arbitrary given point from 
X. Then a measure (called Dirac Measure) is defined on A by 


l ifacA, . 
δι(Α) -- lo ev (12.2094) 


It is called the 6 function (concentrated on a). The characteristic function or indicator function of a 
subset A C X denotes the function xa: X —> (0,1) of X on (0, 1), which has the value 1 for x € A 
and for all other x the value 0: 
1, ifx € A, 
χαία) -- ls otherwise. (12.209b) 


Obviously ó4(4) = da(xa) — xA(a) (see 12.5.4, p. 681), where x4 denotes the characteristic function 
of the set. A. 

B C: Lebesgue Measure: Let X be a metric space and B(X) the smallest e algebra of subsets of X 
which contains all the open sets from X. B(X) exists as the intersection of all the ø algebras containing 


all the open sets, and is called the Borel o algebra of X. Every element from B(X) is called a Borel set 
(see [12.6]). 

Suppose now, X = R” (n 7 1). Using an extension procedure a ø algebra and a measure on it can be 
constructed, which coincides with the volume on the set of all rectangular parallelepipeds in IR". More 
precisely: There exists a uniquely defined o algebra A of subsets of R” and a uniquely defined measure 
A on A with the following properties: 
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a) Each open set from RR" belongs to .A, in other words: B(R") C .A. 
b) If A € A, \(A) 2 0 and P C Athen B € A and A(B) — 0. 


c) If Q is a rectangular parallelepiped, then Q € A, and A(Q) = αν — ax). 


d) À is translation invariant, i.e., for every vector x € R” and every set A € A one has x + A = {£ +y: 
y E€ A} € Aand A(x + A) = A(A). 

The elements of A are called Lebesgue measurable subsets of R”. A is the (n-dimensional) Lebesgue 
measure in R”. 

Remark: In measure theory and integration theory one says that a certain statement (property, or 
condition) with respect to the measure p is valid almost everywhere or u-almost everywhere on a set 
X, if the set, where the statement is not valid, has measure zero. It is denoted by a.e. or u-a.e.f For 
instance, if À is the Lebesgue measure on R and A, B are two disjoint sets with R = AU B and f isa 
function on R with f(x) — 1, Vr € A and f(x) — 0, Yx € B, then f = 1, à-a.e. on R if and only if 
A(B) — 0. 


12.9.2 Measurable Functions 
12.9.2.1 Measurable Function 


Let A be a ø algebra of subsets of a set X. A function f: X —5 R is called measurableif for an arbitrary 
a € R the set f ((a, +00]) = {x : x € X, f(x) >a} isin A. 
A complex-valued function g + ih is called measurable if both functions g and h are measurable. The 
characteristic function x 4 of every set A € A is measurable, because 

. A, if a € (—oo, 1), 

Xa ((a, +00]) = { dé. fw et ) (12.210) 
is valid (see Dirac measure, p. 694). If A is the ø algebra of the Lebesgue measurable sets of R” and 
f: R” — R is a continuous function, then the set f~!((a, +oo]) = f71((a, +00)), according to 12.2.3, 
p. 668, is open for every a € R, hence f is measurable. 
12.9.2.2 Properties of the Class of Measurable Functions 


The notion of measurable functions requires no measure but a ø algebra. Let A be a ø algebra of sub- 
sets of the set X and let f, g, fa : X — R be measurable functions. Then the following functions (see 
12.1.7.4, p. 660) are also measurable: 


a) af for every a € R; f-g; 
b) f+, f- |Fl; FV gand fA g; 
c) f + g, if there is no point from X where the expression (+00) + 


d) sup fn, inf fn, lim sup fnr (= Jim. sup fe), liminf fn; 
k>n 





=Foo) occurs; 





~ 


e) the point-wise limit lim fn, in case it exists; 
f) if f > 0 and p € R, p » 0, the f? is measurable. 
A function f: X — R is called elementary or simple if there is a finite number of pairwise disjoint sets 


n 
A1,..., À, € A and real numbers 05,...,0; such that f — »5 oy xx, where yj denotes the character- 
k=l 


istic function of the set Aj. Since each characteristic function of a measurable set is measurable (see 
(12.210)), so every elementary function is measurable. It is interesting that each measurable function 
can be approximated arbitrarily well by elementary functions: For each measurable function f > 0 
there exists a monotone increasing sequence of non-negative elementary functions, which converges 
point-wise to f. 





5Here and in the following parts *a.e." is an abbreviation for "almost everywhere". 





696 12. Functional Analysis 





12.9.3 Integration 

12.9.3.1 Definition of the Integral 

Let (X, A, p) be a measure space. The integral f du (also denoted by I fdu) for a measurable 
JX E 

function f is defined by means of the following steps: 


1. If f is an elementary function f = = Or Xx, then 
[ ran - Yong). (12.211) 
k=1 
2. If f: X — R (f 2 0), then 
[ta = sup { [gu : g isan elementary function with 0 € g(z) € f(x), Vx € x} . (12.212) 
3. If f: X — R and f+, f- are the positive and the negative parts of f, then 


fiw=f tha- fi- du (12.213) 


under the condition that at least one of the integrals on the right side is finite (in order to avoid the 
meaningless expression oo — oo). 

4. For a complex-valued function f = g + ih, if the integrals (12.213) of the functions g, h are finite, 
put 


Jtdu= fodu+i f hdn. (12.214) 


5. If for any measurable set A and a function f there exists the integral of the function fxa then put 


[1 e f fxadn. (12.215) 
A 


The integral of a measurable function is in general a number from R. A function f: X —> Ris called 


integrable or summable over X with respect to p if it is measurable and / |f| du « oo. 


12.9.3.2 Some Properties of the Integral 
Let (X,.A, μ) be a measure space, f, g : X — R be measurable functions and a, 8 € R. 
1. If f is integrable, then f is finite a.e., i.e., u(r € X: |f(x)| = +oo} = 0. 


f ros finas. 


3. If f is integrable and f > 0, then / f dp 2 0. 


N 





. If f is integrable, then 


4. If0 < g(x) < f(x) on X and f is integrable, then g is also integrable, and foan < fi du. 
5. If f, g are integrable, then af + 8g is integrable, and for c B8g)du—a / fdut+ B fu dp. 


6. If f,g are integrable on A € A, i.e., there exist the integrals 1 fdu and / gdu according to 
A A 


(12.215) and f = g p-a.e. on A, then f fdu =| gdp. 
A A 


If X = R” and A is the Lebesgue measure, then the introduced integral is the (n-dimensional) Lebesgue 
integral (see also 8.2.3.1, 3., p. 507). In the case n — 1 and A — [a, 0], for every continuous function 
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b 
f on [a,b] both the Riemann integral / f(x) dx (see 8.2.1.1, 2., p. 494) and the Lebesgue integral 
Ja 

f d^ are defined. Both values are finite and equal to each other. Furthermore, if f is a bounded 
M 
Riemann integrable function on [a,b], then it is also Lebesgue integrable and the values of the two 
integrals coincide. 
The set of Lebesgue integrable functions is considerably larger than the set of the Riemann integrable 
functions and it has several advantages, e.g., when passing to the limit under the integral sign and f, 
|f| are Lebesgue integrable simultaneously. 
12.9.3.3 Convergence Theorems 
Now Lebesgue measurable functions will be considered throughout. 
1. B. Levi’s Theorem on Monotone Convergence 
Let (f4 4 be an a.e. monotone increasing sequence of non-negative integrable functions with values 
in R. Then 


dim / Rh dus / lim f, dy. (12.216) 


2. Fatou's Theorem 
Let {f,}02, be a sequence of non-negative R-valued measurable functions. Then 


fim inf f, di € liminf Hh dp. (12.217) 


3. Lebesgue's Dominated Convergence Theorem 
Let { fn } be a sequence of measurable functions convergent on X a.e. to some function f. If there exists 
an integrable function g such that | f,| < ga.e., then f = jim, fn is integrable and there holds 


dim. [fan — f jim fadp. (12.218) 


4. Radon-Nikodym Theorem 

a) Assumptions: Let (X, A, 11) bea o-finite measure space, i.e., there exists a sequence ( A4), A, E€ A 
oo 

such that X = U A, and u(A,) « oo for V n. In this case the measure is called o finite. It is called 
n=1 

finite if (X) < œ, and it is called a probability measure if μ(Χ) -- 1. A real function y defined on A is 

called absolutely continuous with respect to u if (A) = 0 implies y(A) = 0. This property is denoted 

by e x p. 

For an integrable function f, the function y defined on A by (A) = f4 f duisc additive and absolutely 


continuous with respect to the measure ju. The converse of this property plays a fundamental role in 
many theoretical investigations and practical applications: 


b) Radon-Nikodym Theorem: Suppose a c-additive function y and a measure ju are given on ao 
algebra A, and let y < u. Then there exists a j-integrable function f such that for each set A € A, 


y(A) = Ji dp. (12.219) 
A 
The function f is uniquely determined up to its equivalence class, and y is non-negative if and only if 
1 3 0 µ-α.ο. 


12.9.4 L?Spaces 


Let (X,.A, j1) be a measure space and p a real number 1 < p < oo. For a measurable function f, 
according to 12.9.2.2, p. 695, the function | f|? is measurable as well, so the expression 
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Nf) — (f rag)! (12.220) 


is defined (and may be equal to -oo). A measurable function f: X —+ Ris called p-th power integrable, 
or an L?-function if N,(f) < +00 holds or, equivalent to this, if | f|? is integrable. 
For every p with 1 < p < +00, the set of all L?-functions, i-e., all functions p-th power integrable with 
respect to u on X, is denoted by £?(j:) or by £?(X) or in full detail £?(X, A, jz). For p = 1 the simple 
notation £(X) is used. For p = 2 the functions are called quadratically integrable. 
The set of all measurable ji-a.e. bounded functions on X is denoted by £L% (p) and the essential supre- 
mum of a function f is defined as 

Noo(f) = ess. sup f = inf{ae€ R: |f(x)| <a µ-α.ο.}. (12.221) 
L?(u) (1 € p € oo) equipped with the usual operations for measurable functions and taking into 
consideration Minkowski inequality for integrals (see 1.4.2.13, p. 32), is a vector space and N,(-) is a 
semi-norm on ΚΡ(μ). If f € g means that f(x) X g(x) holds p-a.e., then £?(j1) is also a vector lattice 
and even a K-space (see 12.1.7.4, p. 660). 'T'wo functions f, g € £?(u) are called equivalent (or they 
are declared as equal) if f = g p-a.e. on X. In this way, functions are considered to be identical if they 
are equal jr-a.e. The factorization of the set £?(X) modulo the linear subspace IN; ! (0) leads to a set 
of equivalence classes on which the algebraic operations and the order can be transferred naturally. So 
a vector lattice (K-space) is obtained again, which is denoted now by L?(X, jz) or L?(u). Its elements 
are called functions, as before, but actually they are classes of equivalent functions. 


It is very important that || Flp = Np(f) is now a norm on L^(y) (f stands here for the equivalence class 
of f, which will later be denoted simply by f), and (L?(j1), || f||p) for every p with 1 < p < +œ is a 
Banach lattice with several good compatibility conditions between norm and order. For p — 2 with 


(f.g) — [ t9 dp as ascalar product, L(j1) is also a Hilbert space (see [12.12]). 
Very often the space L?((2) is considered for a measurable subset €) C IR". Its definition is not a problem 
because of step 5 in (12.9.3.1, p. 696). 


The spaces L?(Q, A), where À is the n-dimensional Lebesgue measure, can also be introduced as the 
completions (see 12.2.2.5, p. 668 and 12.3.2, p. 670) of the non-complete normed spaces C (Q) of all 


i 
continuous functions on the set O C IR" equipped with the integral norm ||v||; — ( i ||? aa) "uas 


p « oo) (see [12.18]). 


Let X be a set with a finite measure, i.e., p(X) < +00, and suppose for the real numbers p1, p2, 1 < pı < 
po € coo. Then L”(X, p) C L”(X, u), and with a constant C = C (p1, p2, u(X)) > 0 (independent 
of x), there holds the estimation ||z|; € Cl|z||» for x € L7? (here |||; denotes the norm of the space 
L(X,u) (k — 1,2). 


12.9.5 Distributions 
12.9.5.1 Formula of Partial Integration 


For an arbitrary (open) domain Q C R”, C° (Q) denotes the set of all arbitrary many times in Q 
differentiable functions y with compact support, i.e., the set supp(y) = {a € Q : e(x) z 0) is compact 
in R” and liesin Q. The set of all locally summable functions with respect to the Lebesgue measure in R” 


is doted by Lj,.(Q), i-e., all the measurable functions f (equivalent classes) on Q such that f, |f| dA < 
-Foo for every bounded domain w C Q. 


Both sets are vector spaces (with the natural algebraic operations). 

There hold L?(Q) C LL,(0) for 1 € p € oo, and L1,(Q) = L!(Q) for a bounded Q. If the elements 
of C*(Q) are considered as the classes generated by them in L?(Q), then the inclusion C*(Q) Cc L?(Q) 
holds for bounded Q, where C*((Q) is at once dense. If 2 is unbounded, then the set Cg? (0) is dense (in 
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this sense) in L?((Q). 


For a given function f € C*(Q) and an arbitrary function v € Cg*(Q) the formula of partial integration 
has the form 


/ f(w)D% p(w) dd = (1) / ex) D? f (x) dÀ (12.222) 
n n 


Va with |a| < k (the fact that D%y|aq = 0 is used), and will be taken as the starting point for the 
definition of the generalized derivative of a function f € Lj,.(Q). 


12.9.5.2 Generalized Derivative 


Suppose f € Lj,,(Q). If there exists a function g € Lj,.(Q) such that Vy € C§°(Q) with respect to 
some multi-index a the equation 


/ {Κα Ώαφ(α) αλ -- (ΟΙ I g(x)p(x) dà (12.223) 
Q n 


holds, then g is called the generalized derivative (derivative in the Sobolev sense or distributional deriva- 
tive) of order a of f. It is denoted by g — D* f as in the classical case. 
The convergence of a sequence {pp} in the vector space C5? (Q) to o € Cg? (Q0) is defined as 


a) J a compact set K C Q with supp(x) C K for any k, 


Vi — ¢ if and only if { b) D? — D? uniformly on K for each multi-index a. 


(12.224) 
"The set Cg? (Q), equipped with this convergence of sequences, is called the fundamental space, and is 
denoted by D(Q). Its elements are often called test functions. 


12.9.5.3 Distributions 


A linear functional £ on D(Q) continuous in the following sense (see 12.2.3, p. 668): 
pe, y € D(Q) and vy, — vy imply Llyr) — Lop) (12.225) 
is called a generalized function or a distribution. 


B A: If f € LL(Q), then 


Ulp) = (Fy) = [, fee) ve DQ) (12.226) 


is a distribution. A distribution, defined by a locally summable function as in (12.226), is called regular. 
Two regular distributions are equal, i-e., 0¢(y) = lg(y) Vp € D(Q), ifand only if f = ga.e. with respect 
to A. 

E B: Leta €Q bean arbitrary fixed point. Then £5,(o) — v(a), v € D(Q) is a linear continuous 
functional on D(Q), hence a distribution, which is called the Dirac distribution, ô distribution or ð 
function. 

Since /;, cannot be generated by any locally summable function (see [12.11], [12.24]), it is an example 
for a non-regular distribution. 

The set of all distributions is denoted by D'(Q). From a more general duality theory than that discussed 
in 12.5.4, p. 681, D’(Q) can be obtained as the dual space of D(Q). Consequently, one should write 
D*(Q) instead. In the space D’(Q), it is possible to define several operations with its elements and with 
functions from C™(Q), e.g., the derivative of a distribution or the convolution of two distributions, 
which make D’(Q) important not only in theoretical investigations but also in practical applications in 
electrical engineering, mechanics, etc. 

For a review and for simple examples in applications of generalized functions see, e.g., {12.11],[12.24]. 





700 12. Functional Analysis 





Here, only the notion of the derivative of a generalized function is discussed. 
12.9.5.4 Derivative of a Distribution 


If £ is a given distribution, then the distribution D^/ defined by 


(D*0)(e) 2 (C1) "4(D^e), pE DO), (12.227 
is called the distributional derivative of order a of 6. 
Let f be a continuously differentiable function, say on R (so f is locally summable on R, and f can be 
considered as a distribution), let f" be its classical derivative and D! f its distributional derivative o 
order 1. Then: 


(D'f,y) = I f'(xye(z) dz, (12.2282. 
R 
from which by partial integration there follows 


(Df, p) = -f f(x)y'(x) de = —(f,y’). (12.228b 
R 


In the case of a regular distribution ls with f € Lj,,.(Q) by using (12.226) 
(DLlE) = (—1)!ey(D%y) = (1)! f Fle) D%~ aX (12.229 
à 





is obtained, which is the generalized derivative of the function f in the Sobolev sense (see (12.223)). 
W A: For the regular distribution generated by the (obviously locally summable) Heaviside function 
O(x) = { ; ο A - 7 (12.230) 
the non-regular 6 distribution is obtained as the derivative. 
WB D: In mathematical modeling of technical and physical problems one is faced with (in a certain sense 
idealized) influences concentrated at one point, such as a *point-like" force, needle-deflection, collision, 
etc., which can be expressed mathematically by using the 6 or Heaviside function. For example, mó, is 
the mass density of a point-like mass m concentrated at one point a (0 € a € l) of a beam of length I. 
The motion of a spring-mass system on which at time to there acts a momentary external force F is 
described by the equation ë + w?x = Fô. With the initial conditions x(0) = (0) = 0 its solution is 


x(t) = = sin(w(t — to))O(t — to). 


13 Vector Analysis and Vector Fields 


13.1 Basic Notions ofthe Theory of Vector Fields 


13.1.1 Vector Functions of a Scalar Variable 
13.1.1.1 Definitions 
1. Vector Function of a Scalar Variable t 
A vector function of a scalar variable is a vector à whose components are real functions of t: 

a = a(t) = a,(t)é, + ay(t)&, 4- a4(t)e;. (13.1) 
The notions of limit, continuity, differentiability are defined componentwise for the vector a(t). 
2. Hodograph of a Vector Function 
Considering the vector function a(t) as a position or radius vector r' — r(f) of a point P, then this 
function describes a space curve while ¢ varies (Fig. 13.1). This space curve is called the hodograph of 
the vector function a(t). 


P, 
p d 30 

a t 
τι P 2 

“a | LLL > 

0 e 
P, 0 r+Ar s 
Figure 13.1 Figure 13.2 Figure 13.3 


13.1.1.2 Derivative of a Vector Function 
The derivative of (13.1) with respect to t is also a vector function of t: 
dă lim a(t + At) —a(t) κ da, (t) = day(t) y _ da,(t) &. 


di — At» At di 7 d " ^ dti (13.2) 





dr 
dt 
tion of the tangent of the hodograph at the point P (Fig. 13.2). Its length depends on the choice of 
the parameter t. If t is the time, then the vector r(t) describes the motion of a point P in space (the 





of the radius vector is a vector pointing in the direc- 





The geometric representation of the derivative 





























space curve is its path), and 5 has the direction and magnitude of the velocity of this motion. Ift = s 
is the arclength of this space curve, measured from a certain point, then obviously τ | zu 
13.1.1.3 Rules of Differentiation for Vectors 
z (atb+é) A = - (13.34) 
: (ya) = E + o% (ip is a scalar function of t), (13.3b) 
i (ab) = A 4 A (13.3c) 
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d mo did > ib 
— (ax b)= Pops (the factors must not be interchanged), (13.3d) 
dt dt dt 
d dà d 
d a[e(t)] — a : a (chain rule). (13.3e) 
5 z a 5 3 . , da . dà 
If |a(t)| 7 const, i.e., a? (t) = a(t) - A(t) = const, then it follows from (13.3c) that à - mo 0, i.e., di 


and a are perpendicular to each other. Examples of this fact: 
W A: Radius and tangent vectors of a circle in the plane and 
W B: position and tangent vectors of a curve on the sphere. Then the hodograph is a spherical curve. 


13.1.1.4 Taylor Expansion for Vector Functions 


da da h” d'a 
a(t +h) = a(t) +h—4 d Eee 
dt 2! dt? n! dt^ 
The expansion of a vector function in a Taylor series makes sense only if it is convergent. Because the 
limit is defined componentwise, the convergence can be checked componentwise, so the convergence 
of this series with vector terms can be determined exactly by the same methods as the convergence of 
a series with complex terms (see 14.3.2, p. 751). So the convergence of a series with vector terms is 
reduced to the convergence of a series with scalar terms. 
The differential of a vector function a(t) is defined by: 
da 


dä = ΔΙ. (13.5) 








(13.4) 


13.1.2 Scalar Fields 
13.1.2.1 Scalar Field or Scalar Point Function 


If a number (scalar value) U is assigned to every point P of a subset of space, then one writes 
U =U(P) (13.6a) 
and one calls (13.6a) a scalar field (or scalar function). 
W Examples of scalar fields are temperature, density, potential, etc., of solids. 
A scalar field U = U(P) can also be considered as 
U — U(r), (13.6b) 
where r is the position vector of the point P with a given pole 0 (see 3.5.1.1, 6., p. 182). 
13.1.2.2 Important Special Cases of Scalar Fields 
1. Plane Field 
One speaks of a plane field, if the function is defined only for the points of a plane in space. 


2. Central Field 

If a function has the same value at all points P lying at the same distance from a fixed point C (ri), 
called the center, then it is called a central symmetric field or also a central or spherical field. 'The 
function U depends only on the distance CP = |f}: 


U = f(|r]). (13.7a) 
W The field of the intensity of a point-like source, e.g., the field of brightness of a point-like source of 
light at the pole, can be described with |r| — r as the distance from the light source: 


U= τ (c const) . (13.7b) 
7 
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3. Axial Field 

If the function U has the same value at all points lying at an equal distance from a certain straight line 
(axis of the field) then the field is called cylindrically symmetric or an axially symmetric field, or briefly 
an axial field. 


13.1.2.3 Coordinate Representation of Scalar Fields 
If the points of a subset of space are given by their coordinates, e.g., by Cartesian, cylindrical, or spher- 
ical coordinates, then the corresponding scalar field (13.6a) is represented, in general, by a function of 
three variables: 

U-d(r,y,z) | Uc-W(po,z) or U= x(r,0,9). (13.8a) 
In the case of a plane field, a function with two variables is sufficient. It has the form in Cartesian and 
polar coordinates: 

U = (x,y) or U = W (p,p). (13.8b) 
The functions in (13.8a) and (13.8b), in general, are assumed to be continuous, except, maybe, at some 
points, curves or surfaces of discontinuity. The functions have the form 





a) for a central field: U —U(qaz? - y? 4-22) 2 U(y g? 4- z2) - U() (13.94) 
with the origin of the coordinate system as the pole of the field, 
b) for an axial field: U = U(y x? +42) = U(p) = U(r sin ð) (13.9b) 


with the z-axis as the axis of the field. 
Dealing with central fields is easiest using spherical coordinates, with axial fields using cylindrical co- 
ordinates. 


13.1.2.4 Level Surfaces and Level Lines of a Field 


1. LevelSurface 
A level surface is the union of all points P in space where the function (13.6a) has a constant value 

U — U(P) — const. (13.10a) 
Different constants Up, Ui, U2,... define different level surfaces. There is a level surface passing through 
every point except the points where the function is not defined. The level surface equations in the three 
coordinate systems used so far are: 

U = (x,y,z) = const, U = W (p,p, z) = const, U — x(r, 9, p) = const. (13.10b) 
W Examples of level surfaces of different fields: 


A: U—cr-—cr-cyy-4 c;z: Parallel planes. 

B: U =a? + 2y? + 42?: Similar ellipsoids in similar positions. 
C: Central field: Concentric spheres. 

D: Axial field: Coaxial cylinders. 

2. Level Lines 


Level lines replace level surfaces in plane fields. They satisfy the equation 

U = const. (13.11) 
Level lines are usually drawn for equal intervals of U and each of them is marked by the corresponding 
value of U (Fig. 13.3). 
E Well-known examples are the isobaric lines on a synoptic map or the contour lines on topographic 
maps. 
In particular cases, level surfaces degenerate into points or lines, and level lines degenerate into separate 
points. 
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1 
E The level lines of the fields a) U = xy, b) U = A c) U = x? +4? = p, d) U = ~ are represented 
T p 
in Fig. 13.4. 





Figure 13.4 


13.1.3 Vector Fields 


13.1.3.1 Vector Field or Vector Point Function 
If a vector V is assigned to every point P of a subset of space, then it is denoted by 

V =V(P) (13.12a) 
and calls (13.12a) a vector field. 


W Examples of vector fields are the velocity field of a fluid in motion, a field of force, and a magnetic 
or electric intensity field. 





A vector field V — V(P) can be regarded as a vector function 


V = V(r), (13.12b) 
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where r is the position vector of the point P with a given pole 0. If all values of r as well as V lieina 
plane, then the field is called a plane vector field (see 3.5.2, p. 190). 


13.1.3.2 Important Cases of Vector Fields 


1. Central Vector Field 
Ina central vector field all vectors V lie on straight lines passing through a fixed point called the center 
(Fig. 13.5a). 
Locating the pole at the center, then the field is defined by the formula 

V — f(z) r, (13.132) 
where all the vectors have the same direction as the radius vector r. It often has some advantage to 
define the field by the formula 


v= eut (r = [z]), (13.190) 


=): = T. i 3 : 5 - 
where |o(r)| is the length of the vector V and — is a unit vector into the direction of r. 
r 





= 
c 








Figure 13.5 


2. Spherical Vector Field 

A spherical vector field is a special case of a central vector field, where the length of the vector V depends 
only on the distance |r| (Fig. 13.5b). 

W Examples are the Newton and the Coulomb force field of a point-like mass or of a point-like electric 
charge: 





Ca ΟἹ 
E =a. 
T r 


(c const) . (13.14) 


The special case of a plane spherical vector field is called a circular field. 
3. Cylindrical Vector Field 


a) All vectors V lie on straight lines intersecting a certain line (called the axis) and perpendicular to 
it, and 

b) all vectors V at the points lying at the same distance from the axis have equal length, and they are 
directed either toward the axis or away from it (Fig. 13.5c). 

Locating the pole on the axis parallel to the unit vector C, then the field has the form 


=k 


v= et^. (13.152) 
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where r* is the projection of r on a plane perpendicular to the axis: 
T =X (TXO). (13.15b) 
By intersecting this field with planes perpendicular to the axis, one always gets equal circular fields. 


13.1.3.3 Coordinate Representation of Vector Fields 


1. Vector Field in Cartesian Coordinates 

The vector field (13.12a) can be defined by scalar fields V, (r), Vo(r), and Va(r) which are the coordinate 
functions of V, i.e., the coefficients of its decomposition into any three non-coplanar base vectors €, 
65, and ea: 


V — V6 4 Vie 4- Vies. (13.164) 
With the coordinate unit vectors ij. K as base vectors and expressing the coefficients Vj, V2, V3 in 
Cartesian coordinates one gets 

v= Voltt, DIES Vr. y; 2] 1- γαι. z)k. (13.16b) 
So, the vector field can be defined with the help of three scalar functions of three scalar variables. 


2. Vector Field in Cylindrical and Spherical Coordinates 


In cylindrical and spherical coordinates, the coordinate unit vectors 


&, &, & (2k) and &(- 5, δ», &, (13.172) 


are tangents to the coordinate lines at each point (Fig. 13.6, 13.7). In this order they always form a 
right-handed system. The coefficients are expressed as functions of the corresponding coordinates: 


V = Valp, p, 2)Ep + Volp, p, 2)Ep + Va(p, p, 2)e 5 (13.17b) 
V = V, (r, 9, p)ē + Volr, , φγδρ ἠ- νχ{», ϑ, )δν. (13.170) 


At transition from one point to the other, the coordinate unit vectors change their directions, but 
remain mutually perpendicular. 











Figure 13.6 Figure 13.7 Figure 13.8 


13.1.3.4 Transformation of Coordinate Systems 
See also Table 13.1. 
1. Cartesian Coordinates in Terms of Cylindrical Coordinates 
Vz = V, cos 9 — Vo sin 9p, V, = V,siny + V, cos y, V, = Vz. (13.18) 
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2. Cylindrical Coordinates in Terms of Cartesian Coordinates 

V, = Vz cos p + Vy sin p, V, = —Vz sin ọ + Vy cos 9, V, = Vz. (13.19) 
3. Cartesian Coordinates in Terms of Spherical Coordinates 

V, = V, sind cosy — V, sing + Vy cos o cos 2, 

V; = V, sin vsin y + V cos o -- Vs sin cos 0, 

V; = V, cos V — V; sin 9. (13.20 
4. Spherical Coordinates in Terms of Cartesian Coordinates 

V, = V; sin cos o 4- V, sin üsin o + V; cos ð, 

Va — V; cos cos o 4- V, cos ) sin q — V; sin, (13.21 

Vo -— —V.sing- V,cosq. 
5. Expression of a Spherical Vector Field in Cartesian Coordinates 

V= o(u a? +y? + 22)(ai + yj zk). (13.22 
6. Expression of a Cylindrical Vector Field in Cartesian Coordinates 

V = p(y x£? + y?) (xi 4- yj). (13:23 





In the case of a spherical vector field, spherical coordinates are most convenient for investigations, 
i.e., the form V — V(r)e,; and for investigations in cylindrical fields, cylindrical coordinates are mos 


convenient, i.e., the form V — V(w)é,. In the case of a plane field (Fig. 13.8) 


V =V,(2,y)i + Vy(x, y)j = Volp, ϱ)6, + Volp, P)Ep, 


holds and for a circular field 


V = (2? + ?)(ai + yi) = e(o)6- 


(13.24 





(13.25 


Table 13.1 Relations between the components of a vector in Cartesian, cylindrical, and spherical 


coordinates 





Cartesian coordinates 


Cylindrical coord. 


Spherical coordinates 





V = Vip + VySy + Vie 


Vie, + V;& 4- V;e; 


V.e, 4- Voes 4- Vie, 




















Vy =V,cosp—V,sing | = V, sin Ù cos y + Va cos V cos p 
— Vọsin 

Vy = V, sin y + Vp coso | = V, sin ù sin 9 + Vg cos ù sin p 
+ Vọ cos p 

Vz =V, — V. cos Ü — Vo sin Ü 

V, cos y + Vy sin =V, = V, sin V + Vo cos V 

— Vz sin y + V, cos p = Vo =V, 

Vz =V; = V, cos Ù — Vo sin Ü 

V, sind cosy + Vysindsiny + V; cos | = Vpsin ð + V; cosð | = V, 

Vz cos Ù cos p + Vy cos V sin p — V; sing | = V, cos ù — Vzsinù | = Vo 

— Vz sin Y + Vy cos y = Vp =V; 
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13.1.3.5 Vector Lines 

A curve C is called a line of a vector or a vector line of the vector field 
V(r) (Fig. 13.9) if the vector V(F) is a tangent vector of the curve 
at every point P. There is a vector line passing through every point 
of the field. Vector lines do not intersect each other, except, maybe, 


at points where the function V is not defined, or where it is the zero 
vector. The differential equations of the vector lines of a vector field Figure 13.9 


V given in Cartesian coordinates are 


dr d dz lv d 
a) in general: V = Da =z (13.26a) b) for a plane field: V = W (13.26b) 
'To solve these differential equations see 9.1.1.2, p. 542 or 9.2.1.1, p. 571. 
W A: The vector lines of a central field are rays starting at the center of the vector field. 


Bl B: The vector lines of the vector field V = € x T are circles lying in planes perpendicular to the 
vector C. Their centers are on the axis parallel to €. 


13.2 Differential Operators of Space 


13.2.1 Directional and Space Derivatives 


13.2.1.1 Directional Derivative of a Scalar Field 


The directional derivative of a scalar field U = U(r) at a point P with position vector r in the direction 
€ (Fig. 13.10) is defined as the limit of the quotient 


BU ee i) (13.27) 
oc 20 E 


If the derivative of the field U = U(r) at a point r in the direction of the unit vector €? of c is denoted 


by EUR then the relation between the derivative of the function with respect to the vector c and with 
c 
respect to its unit vector c? at the same point is 
OU _, OU 
— = |c|—. (13.28) 
Oc Oc 





The derivative with respect to the unit vector represents the speed of increase of the function U in 


Oc? 
the direction of the vector c? at the point r. If ii is the normal unit vector to the level surface passing 
ΜΡ T E ; . ου 
through the point r, and n is pointing in the direction of increasing U, then On has the greatest value 
Hn 
among all the derivatives at the point with respect to the unit vectors in different directions. Between 
the directional derivatives with respect to 1 and with respect to any direction €° holds the relation 
OU οὐ o oU Ü 
= = => cos(€",n) = ~cosp=€"-gradU (see (13.34), p. 710). 13.29 
ago T ga CE HH) = Be cosp = C+ gradU (see (13.34), p. 710) (13.29) 


Hereafter, directional derivatives always mean the directional derivative with respect to a unit vector. 


13.2.1.2 Directional Derivative of a Vector Field 
The directional derivative of a vector field is defined analogously to the directional derivative of a scalar 


field. The directional derivative of the vector field V = Να) at a point P with position vector r 
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Figure 13.10 Figure 13.11 
(Fig. 13.11) with respect to the vector a is defined as the limit of the quotient 
NE I ανετα 
ο cig. EE VIE (13.30) 
θα ε-»θ ε 


If the derivative of the vector field V — Να) at a point T in the direction of the unit vector a? of & is 


av 
denoted by = , then 


0a’ 
2 
—— 13.31 
= ales (13.31) 
In duc coordinates, i.e., for v= V6; 4- Vj8j - V;6;, & — a;€, -- a8, 4 a;8, , holds: 
(&- grad)V — (&- grad V,)ez -- (&- grad V,)e; -- (& - grad V;)e; , (13.32a) 


in general coordinates: 


(&- grad)V = : 5 rot (V x &) 4- grad (&- V) + ádiv V — Vdiv& — à x rot V — V x rota. 
(13.32b) 
13.2.1.3 Volume Derivative 


Volume derivatives of a scalar field U = U(r) or a vector field V at a point F are quantities of three 
forms, which are obtained as follows: 
1. Surrounding the point r of the scalar field or of the vector field by a closed surface X. This surface can 
be represented in parametric form F = F(u, v) = z(u, v)ēs +y(u, v)€y + z(u, v)ē , so the corresponding 
vectorial surface element is 
= Or δε 
dS = — x —dudv. 13.33a 

ðu v ( ) 
2. Evaluating the surface integral over the closed surface ©. Here, the following three types of integrals 
can be considered: 


f U d$, f V. d$, p V x d&. (13.33b) 
(5) (Ὁ) (Ὁ) 
3. pes the limits (if iu exist) 


lim = f Ud$, lm. vf. d$, lm vý V x d8. (13.33c) 


Here V e the volume of i region of space s contains the point with the position vector 
inside, and which is bounded by the considered closed surface X . 
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The limits (13.33c) are called volume derivatives. The gradient of a scalar field and the divergence and 
the rotation of a vector field can be derived from them in the given order. In the following paragraphs, 
these notions will be discussed in details (even defining them again.) 


13.2.2 Gradient ofa Scalar Field 

The gradient of a scalar field can be defined in different ways. 

13.2.2.1 Definition of the Gradient 

The gradient of a function U is a vector grad U, which can be assigned to every point P with the vector 
T of a scalar field U = U (F), having the following properties: 

1. The direction of grad U is always perpendicular to the direction of the level surface U = const, 
passing through the considered points P, 

2. grad U points always into the direction in which the function U is increasing, 


OU , : : ολα xis : 
3. |grad U| = om i.c., the magnitude of grad U is equal to the directional derivative of U in the normal 
n 

direction. 
If the gradient is defined in another way, e.g., as a volume derivative or by the differential operator, 
then the previous defining properties became consequences of the definition. 
13.2.2.2 Gradient and Directional Derivative 
The directional derivative of the scalar field U with respect to the unit vector €° is equal to the projec- 
tion of grad U onto the direction of the unit vector c?: 

ου 

Oc? 
i.e., the directional derivative can be calculated as the dot product of the gradient and the unit vector 
pointing into the required direction. 


— c? . grad U, (13.34) 


Remark: The directional derivative at certain points in certain directions may also exist if the gradient, 
does not exist there. 


13.2.2.3 Gradient and Volume Derivative 


The gradient U of the scalar field U = U (F) at a point rf can be defined as its volume derivative. If the 
following limit exists, then it is called the gradient of U at r: 


ϕ U dš 


20 (3) 
srad U = lim ————. 
grad U lim V 
Here V is the volume of the region of space containing the point belonging to r inside and bounded by 
the closed surface X. (If the independent variable is not a three-dimensional vector, then the gradient 
is defined by the differential operator.) 


13.2.2.4 Further Properties of the Gradient 


1. The absolute value of the gradient is greater if the level lines or level surfaces drawn as mentioned 
in 13.1.2.4, 2., p. 703, are more dense. 


(13.35) 


2. The gradient is the zero vector (grad U = 0) if U has a maximum or minimum at the considered 
point. The level lines or surfaces degenerate to a point there. 


13.2.2.5 Gradient of the Scalar Field in Different Coordinates 


1. Gradient in Cartesian Coordinates 
OU(x,y,z)z; , OU(m,y,z)z  OU(v,y, z)r 
I — - k. 
δα : ὃν J+ Oz 





grad U 





(13.36) 
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2. Gradient in Cylindrical Coordinates (x = pcos », y=p sin y, z = z) 
































grad U = grad, Ue, + grad, Ue, + grad, Ue, with (13.37a) 

grad,U = As grad,U — LA grad,U — = (13.37b) 
3. Gradient in Spherical Coordinates (x = rsin cos o, y = rsin sin o, z = 
r cos 0) 

grad U = grad, Ue, + grady UEy + grad, Ue, with (13.38a) 

ου 10U 1 δῦ 

rad, U=, gradjU — -—,  grad,U = E 

βτας Br? gradyU rop 5 ad U ἜΤ (13.987) 
4. Gradient in General Orthogonal Coordinates (€, 7, C) 
For F(E, 1, ¢) = 2(£, 0, QË + yé, n, OJ + 2(E, Ok: 

grad U = grad, Ue, + grad, Ue, -- grad; U&, — where (13.394) 

1 ôU 1 OU 1 OU 
grad U = jeg| oe grad,U — [ος] δη” grad,U = or ac (13.39b) 
oE δη ὃς 

13.2.2.6 Rules of Calculations 
Assuming in the followings that € and c are constant, the following equalities hold: 

gradc — 0, grad (U + U2) = grad U; + grad U2, grad (cU) = cgrad U. (13.40 

dp 

grad (U,U2) = Uygrad Uz 4- UogradU4, grady(U) = we U. (13.41 

grad (V . V2) = (νι . grad )V» + (V2 - grad )JVi + Vi x rot Vo + Vo x rot Vi. (13.42 

grad (T - €) = c. (13.43 
1. Differential of a Scalar Field as the Total Differential of the Function U 

dU — gradU - di- ds dy ds (13.44 
2. Derivative of a Function U along a Space Curve r(t) 

d di d. dz 

qu _ OU da oU y QU dz (13.45 











WC wd od Oz dE 


3. Gradient of a Central Field 


grad U(r) v'e) (spherical field), (13.46a) gradr = - (field of unit vectors). (13.46b) 
T 
13.2.3 Vector Gradient 
The relation (13.32a) inspires the notation 
ὃν = 2 PL g = 
— =4- grad (V,é, + V,é, + V6.) = à- grad V (13.47a) 


δα 
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where grad V is called the vector gradient. It follows from the matrix notation of (13.47a) that the 
vector gradient, as a tensor, can be represented by a matrix: 






































OV, OV, OV, OV, oV; OV, 

Ox Oy Oz Ox Oy Oz 

> ὃν, ὃν, ὃν, | | *» > OV, OV, OV, 
à-grad)jV = | —4% —— —* (13.471 rad V — | — —. —* |. (134Tc 
(a: grad) Ox Oy Oz (x) αν BM Ox Oy Oz ας 

OV, OV, OV, OV, OV, OV; 

dx ὃν Oz Ox Oy Oz 


These types of tensors have a very important role in engineering sciences, e.g., for the description of 
tension and elasticity (see 4.3.2, 4., p. 282. 


13.2.4 Divergence of Vector Fields 
13.2.4.1 Definition of Divergence 
To a vector field Viz) a scalar field can be assigned which is called its divergence. The divergence is 
defined as a space derivative of the vector field at a point T: 
V.dS 
G 


div V — lim ( 
v0 


(13.48) 
If the vector field V is considered as a stream field, then the divergence can be considered as the fluid 
output or source, because it gives the amount of fluid given in a unit of volume during a unit of time 
flowing by the considered point of the vector field V. In the case div V > 0 the point is called a source, 
in the case div V < 0 it is called a sink. 

13.2.4.2 Divergence in Different Coordinates 

1. Divergence in Cartesian Coordinates 


26 ôV ôV% ƏV, 
aN ae oie ay 





(13.49a) with. V(r,y, z) 2 V 4- Vj + Vek. (13.49b) 


The scalar field div V can be represented as the dot product of the nabla operator V and the vector V 
as 

divV — V. V (13.49c) 
and it is translation and rotation invariant, i.e., scalar invariant (see 4.3.3.2, p. 283). 
2. Divergence in Cylindrical Coordinates 
. e 10(pV)., 10V, ὃν, . > . - i 
div V — 2 p + την ta (13.504) with —V(p, p, z) — V;&, 4- Vpēp + V.&,. (13.50b) 
3. Divergence in Spherical Coordinates 
1 à(r?V,) 1 O(sinOV;) 1 OV, 





liv V = 13.51a 
τ τ Or ' ναἰαὐ [o ' rsinó Ov (3-51a) 
with V(r, 0, p) = Vee» + Voby + Veep. (13.51b) 


4. Divergence in General Orthogonal Coordinates 


.5o 18 |ϑε|195 ὃ (1ϑε||ϑε ð 
ον 5 Lac (los δὲ DE δε v) | τ 


or 
ὃς 






































or 
2a 
a D \ (13.52a) 
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with F(£,n, C) = 2(£, n, Qi y(n, OF + 26.0, Ok, (13.52b) 
OF OF OF ὅτι οτι |Or - 
D= (555) = δε] [s ‘lac (13.52c) and. V(£,r, C) 2 Ve& + Vp, + VeEe. (13.52d) 




















13.2.4.3 Rules for Evaluation of the Divergence 


divé— 0, div(V, 4 V2) = div V, + div V2, div (cV) — cdiv V. (13.53) 
div (UV) — U div V 4- V - grad U (socias div(rc) — 2 : (13.54) 
$ 
div (V, x V3) 2 Va - rot V1 — V4 - rot Va. (13.55) 
13.2.4.4 Divergence of a Central Field 
div? =3, divg(r)¥ = 3y(r) + ro (r). (13.56) 


13.2.5 Rotation of Vector Fields 
13.2.5.1 Definitions of the Rotation 


1. Definition p 2 t 
The rotation or curl of a vector field V at the point r is a vector denoted by rot V, curl V or with the 
nabla operator V x V, and defined as the negative space derivative of the vector field: 
$ V x dS fp dS x V 
e — lim e 

V0 V 








rot V lim (13.57) 
V0 

2. Definition z 

The vector field of the rotation of the vector field V (r) can be defined in the following way: 


a) Putting a small surface sheet S (Fig. 13.12) 

through the point r and describing this surface sheet 
ΗΝ by a vector S whose direction is the direction of the 
Vdf surface normal ii and its absolute value is equal to the 
area of this surface region. The boundary of this sur- 
face is denoted by C. 


K 





b) Evaluating the integral f V. dr along the closed 
(C) 

boundary curve C of the surface (the sense of the curve 

is positive looking to the surface from the direction of 

the surface normal (see Fig. 13.12). 





1 m 
c) Determining the limit (if it exists) lim 5 f V. dY, 
=p v 
(C) 


I while the position of the surface sheet remains un- 
Figure 13.12 changed. 


d) Changing the position of the surface sheet in order to get a maximum value of the limit. The surface 
area in this position is Smax and the corresponding boundary curve is Cmax- 
e) Determining the vector rot r at the point 7, whose absolute value is equal to the maximum value 
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found above and its direction coincides with the direction of the surface normal of the corresponding 
surface. Then one gets: 


V.dr 
[τοι v| B ,.im LN (13.584) 


max —0 Oma 

The projection of rot V onto the surface normal ii of a surface with area S , ie. the component of the 
vector rot V in an arbitrary direction n = Tis 
f V.dr 

X oos Sq xRegm (0) 

l- rot V = rot; V = lim ——z— . (13.58b) 

S30 S 

The vector lines of the field rot V are called the curl lines of the vector field V. 


13.2.5.2 Rotation in Different Coordinates 


1. Rotation in Cartesian Coordinates 


> ὃν, OV, 7[OV, OV, 2[(0V, OV, 
tV =i z y Fi T 2 ΕΚ y zi 
τ i ( ðy Oz ) J ( Oz Ox ) ( Or | Oy ) 





(13.594) 


οσο ον --ι 
SUE|ec-a4 
SS esum 


The vector field rot V can be represented as the cross product of the nabla operator V and the vector 


V: 








rot V — V x V. (13.59b) 
EN 2. Rotation in Cylindrical Coordinates 
rot V = rot Vē, + rot „Vē, + rot -Vē with (13.60a) 
= 10V, OV, > OV, OV, 4, 1f 0 OV, 
t,V-- AS ote V = 2 ot; V = — --- Se εν 13.601 
TOt p ve az? Tte ὃς δρ᾽ ro TEGA 2 (13.60b) 


3. Rotation in Spherical Coordinates 


rot V = rot, Vē, + rot Vē + rot nu with (13.61a) 


rot, V = a (nov) — τ.) ; 








rsinv | 0) 
= 1 0V. 10, 
rotg V = "OW ras e» (13.61b) 
- 1/0 oV, 
Tot V = s (Zev) - 50 } : 


4. Rotation in General Orthogonal Coordinates 


rot V = rot ; V& -- rot, V&, -- rot c V&; with (13.62a) 
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= 1 dr|[ 0 195 a (lor 
teV = —|—]|— (Jon) -= Je 
"er cg ος (ση [ος νο) = (oe w) 
= 1|ϑ9τ|[9 f|ür 9 {195 
sv EE? (5. - 2 (65.3) m 
- 11959 (ος 9 195 
r7 = p fa læ (lal) = a Aæ |: 
z lEn Otun OTt zE n OR .|9r| |Br| |a 
ο ο ο D= LEE (13.620) 
13.2.5.3 Rules for Evaluating the Rotation 
rot(V4-- V3) — rot V, --rot Va, | rot (cV) — crot V. (13.63) 
rot (UV) = U rot V 4- gradU x V. (13.64) 
rot (V4 x V3) — (Va - grad V3 — (Vi - grad Va 4- V1 div Va — V3 div Vi. (13.65) 


13.2.5.4 Rotation of a Potential Field 
This also follows from the Stokes theorem (see 13.3.3.2, p. 725) that the rotation of a potential field is 
identically zero: 

rot V — rot (grad U) — 0. (13.66) 


This also follows from (13.59a) for V= grad U, if the assumptions of the Schwarz interchanging theo- 
rem are fulfilled (see 6.2.2.2, 1., p. 448). 


E Forf = ri + yj + zk with r = |F] = Vz? + y? + 2 holds: rot F = 6 and rot (y(r)#) = 6, where 
(r) is a differentiable function of r. 


13.2.6 Nabla Operator, Laplace Operator 
13.2.6.1 Nabla Operator 


The symbolic vector V is called the nabla operator. Its use simplifies the representation of and calcu- 
lations with space differential operators. In Cartesian coordinates holds 
ὃν ὃν ὃν 
V= —i+—j+—k. 13.67 
Ox ay? Oz ( ) 
The components of the nabla operator are considered as partial differential operators, i.e., the symbol 
Jz means partial differentiation with respect to x, where the other variables are considered as con- 
x 
stants. 
The formulas for spatial differential operators in Cartesian coordinates can be obtained by formal mul- 
tiplication of this vector operator by the scalar U or by the vector V. For instance, in the case of the 
operators gradient, vector gradient, divergence, and rotation: 


grad U = VU (gradient of U (see 13.2.2, p. 710)), (13.68a) 
grad Ý = V V (vector gradient of V (see 13.2.3, p. 711)), (13.68b) 
div V — V. V (divergence of V (see 13.2.4, p. 712)), (13.68c) 
rot V — V x V (rotation or curl of V — (sce 13.2.5, p. 713)). (13.68d) 
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13.2.6.2 Rules for Calculations with the Nabla Operator 


1. If V stands in front of a linear combination 57 a; X; with constants a; and with point functions Xj, 
then, independently of whether they are scalar or vector functions, we have the formula: 


2. If V is applied to a product of scalar or vector functions, then it has to be applied to each of these 
functions after each other and the results are to be added. There is a |, above the symbol of the function 
submitted to the operation 


4 4 4 
V(XYZ)=V(XYZ)+V(XYZ)+V(XY Z), ie. (13.70) 
V(XYZ) = (VX)YZ + X(VY)Z) + XY (VZ). 


Then the products have to be transformed according to vector algebra so as the operator V is applied 
to only one factor with the sign |. Having performed the computation one omits that sign. 


: : l ER e : 
W A: div(UV) - V(UV) » V(U V) - V(U V) 2 V- VU -UV - V — V - gradU -- U div V. 

Y" - Lo ad : ο 
WB B: grad (ViV5) — V(V1V5) — V( V4 V3) 4- V(Vi Vo). Because Ῥ(ᾶ6) -- (αρ)ό-- ἄχ(οχ ο 
follows: grad (ΝΙΝ) = (V3 V)V4 + Vs x (V x Vi) + (V1V)V5 + Vi x (V x V2) 














= (Vagrad Vi + Vo x rot νι + (Vigrad )Vs + Vi x rot Vo. 
13.2.6.3 Vector Gradient 
The vector gradient grad V is represented by the nabla operator as 

grad V = VV. (13.71a) 
The expression occurring in the vector gradient (& - V)V (sce (13.32b), p. 709) has the form: 

2(&- V)V — rot (V x a) + grad (aV) + adiv V — Vdiva— ax rot V—Vx rota. (13.71b) 
In particular one gets for r — αἲ 4- yj 4 zk: 

(a-V)F=a. (13.710) 
13.2.6.4 Nabla Operator Applied Twice 
For every field V: 

V(V x V) — divrot V = 0, (13.72) V x (VU) = rot grad U = Õ, (13.73) 


V(VU) = div grad U = AU. (13.74) 


13.2.6.5 Laplace Operator 


1. Definition 
The dot product of the nabla operator with itself is called the Laplace operator: 

A=V.V=V. (13.75) 
The Laplace operator is not a vector. It prescribes the summation of the second partial derivatives. It 
can be applied to scalar functions as well as to vector functions. The application to a vector function, 
componentwise, results in a vector. 
The Laplace operator is an invariant, i.e., it does not change during translation and/or rotation of the 
coordinate system. 
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2. Formulas for the Laplace Operator in Different Coordinates 


Here the Laplace operator is applied to the scalar point function U(r). Then the result is a scalar. The 


application of it for vector functions V(r) results in a vector AV with components AV,, AV,, ΔΥ.. 


1. Laplace Operator in Cartesian Coordinates 
LOU PU PU 

Or? Oy O2 
2. Laplace Operator in Cylindrical Coordinates 
198 δυ 19 OU 
pap (a) pog as 


3. Laplace Operator in Spherical Coordinates 


AU (x, y, z) 


AU(p, 9, 2) = 


; 18 f3) 1 Af, QU 1 OU 
AU(r, 9, p) r? Or ( =) " 72 sin 0 OU (sn 5 a) MN r? sin? 9 0g? © 


4. Laplace Operator in General Orthogonal Coordinates 





158 D OU ὃ D OU ὃ D OU 























ANE) =D | a6 | Tore ae | "δη | Τουρ an | * aC | yore ac || Un 
δε δη ὃς 

: z M : or| |Oz| | ar 

PE, n, C) = alén, Ci + yl n, OJ + z(£.m CK, — (13.79b) D= ael lanl al 




















3. Special Relations between the Nabla Operator and Laplace Operator 


V(V - V) — graddiv V, 
V x (V x V) — rotrot V, 
V(V.V)-Vx(VxV)- AV, where 








Ox? + Oy? Oz? 


> OV, OV, 9WN. 
tl >> +55 4+54 k. 
Ox? Oy? Oz? 





2 2y a2 
AV = (V- VW = avis avis avi- (75 ov. πη 


(9v,  δῦν, δν, 

l ( r? T Oy? + op 
13.2.7 Review of Spatial Differential Operations 
13.2.7.1 Rules of Calculation for Spatial Differential Operators 


U, Ui, U5 and F are scalar functions; c is a constant; V, V4, V9 are vector functions: 


grad (U; + U2) = grad U; + grad Ug. (13.84) grad (cU) — cgrad U. 


ο. 


grad (U,U5) = Ui grad U54-U» grad U4.(13.86) grad F(U) = F'(U) grad U. 


div (Vi + V2) — div Vi c div Vo. (13.88) div (cV) — cdiv V. 


(13.76) 


(13.77) 


(13.78) 


(13.792) 


(13.79c) 


(13.80) 
(13.81) 
(13.82) 


(13.83) 


(13.85) 
(13.87) 


(13.89) 
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div (UV) — V - gradU -- U div V. 


rot (cV) — crot V. 


div rot V = 0. 


div grad U = AU. 


(13.90) rot (Vi + V2) = rot Vi +rot V2. (13.91) 
(13.92) rot (UV) = Urot V—V x gradU. (13.93) 
(13.94) rot gradU = 0 (zero vector). (13.95) 
(13.96) rotrot V — grad div V — AV. (13.97) 


div (V4 x V3) — Va-rot V; —V rot V3.(13.98) 


13.2.7.2 Expressions of Vector Analysis in Cartesian, Cylindrical, and 


Spherical Coordinates (see Table 13.2) 


Table 13.2 Expressions of vector analysis in Cartesian, cylindrical, and spherical coordinates 

























































































Cartesian coordinates | Cylindrical coordinates | Spherical coordinates 
ds = d¥ | €,dx + éydy + €,dz é,dp + €,pdyp + €,dz Edr + éyrdü -- &,r sin üdip 
radU εδύ La U pg 2U g 3U ε 100 „g U g 2U Le, 19U z 1 OU 
E "Oy | " Oy * Oz ? gp Ῥρϑῳ 8 "ar |r av ?rsinÜ p 
= | ôV» ƏV; OV, 1 ὃ 18V, OV. 1 ὃ 1 8 
livV 5 y z 7 p # 9 " ο sin d 
κα δα Y Oy T Oz nape) m p Op Oz r? rV Ver rsinÜ ga sin) 
1 W, 
rsinÜ Op 
> ὃν, OV, 18V, OV, 1 ὃ ; OV; 
ο: é, fo” gale a é,—— | (V, sin) — 
m * ( Oy Oz ) se E Op Oz ) err sind lo Mase) «| 
geom e qd nk s αν se ing 
Y\ dz Ox °\ Oz Op r|sinvd 0p Or’ ? 
OV, OV afl a 10V, LPa oV, 
i e = a) = anor) ρ 23 gor [gett | 
OU PU QU 10 οὐ 1 90 10 {αν 
AU aa 1 p t— f 
Ox? Oy 02 pOp NN Op p og? τὰ ὃτ Or 
+ PSU d 2 singt 
Oz r? sin ð 00 Ov 
1 OU 
r? sin? 9 0? 
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13.2.7.3 Fundamental Relations and Results (see Table 13.3) 


Table 13.3 Fundamental relations for spatial differential operators 








Operator Symbol | Relation | Argument | Result Meaning 

Gradient grad U VU scalar vector maximal increase 
Vector gradient | grad V [VV vector tensor second order 

Divergence div V ν.ν vector scalar source, sink 
Rotation rot V VxV vector vector curl 

Laplace operator | AU (V-V)U | scalar scalar potential field source 
Laplace operator | AV (V- V)V vector vector 























13.3 Integrationin Vector Fields 

Integration in vector fields is usually performed in Cartesian, cylindrical or in spherical coordinate 
systems. Usually one integrates along curves, surfaces, or volumes. The line, surface, and volume 
elements needed for these calculations are collected in Table 13.4. 





Table 13.4 Line, surface, and volume elements in Cartesian, cylindrical, and spherical coordinates 





Cartesian coordinates Cylindrical coordinates Spherical coordinates 





dr | €,dx  é,dy + €,dz &,dp -- &;pdq 4- e,dz é,dr + éyrd + &r sin dip 





dS é,dydz + 6,dxdz + €,dardy | €,pdpdz + €,dpdz + €.pdpdy | &,r? sin Vdddy 
+€yr sin Vdrdy 























+é,rdrdddp 

dv* | drdydz pdpdydz 1? sin üdrdüd 

& ὃν Χχ 6, Ep = &p x €, 6, = Ey X Ey 

Ey = €z X ër é, = &, x &, dj —8, x6, 

Ez = Er X &y E= E, X Bp 6, = &, x Ey 
---[ τή κο jee-( is 


The indices ¿į and j take the place of x, y, z or p, p, Z or r, 9, qp. 





The volume is denoted here by v to avoid confusion with the absolute value of the vector 
function |V| = V. 














13.3.1 Line Integral and Potential in Vector Fields 
13.3.1.1 Line Integral in Vector Fields 


1. Definition The scalar-valued curvilinear integral or line integral of a vector function γα) along 
a rectificable curve AB (Fig. 13.13) is the scalar value 
Pe / V(z) - dz. (13.992) 


AB 
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Figure 13.13 


2. Evaluation of this Integral in Five Steps 

a) Dividing the path AB (Fig. 13.13) by division points 

Ai(ri) A2(r9),..., An i(rn i1) (A — Ao, B = Απ) 

into n small arcs which are approximated by the vectors 

Τι = τι = AT; 4. 

b) Choosing arbitrarily the points P; with position vec- 

tors & lying inside or at the boundary of each small arc. 

c) Calculating the dot product of the value of the func- 

tion V(&) at these chosen points with the corresponding 
Tj 1. 

d) Taking the sum of all the n products. 

e) Calculating the limit of the sums got in this way 


32 V(&) - Afi 4 for |Af; 4| — 0, while n — oo obvi- 
i=l 
ously. 


If this limit exists independently of the choice of the points A; and P;, then it is called the line integral 


[ν-ᾱ-- im Y VÈ) Af. 
E i-l 


AB 


IAf; -11-0 
no 


(13.99b) 


A sufficient condition for the existence of the line integral (13.99a,b) is that the vector function V (F) 


A 


Figure 13.14 


5 
and the curve AB are continuous and the curve has a tangent varying continuously. 


A vector function V(r) is continuous if its components, the three scalar functions, 


are continuous. 


13.3.1.2 Interpretation of the Line Integral in Mechanics 
If V(®) is a field of force, i.e., V(r) — F(r), then the line integral (13.99a) 


represents the work done by F while a particle rn moves along the path AB 


(Fig. 13.13,13.14). 


13.3.1.3 Properties of the Line Integral 


/ VE- d= / V. a+ f V(#)- dé — (Fig. 13.14). (13.100) 
ABC AB BC 

[ v9 de=- f YE) a (13.101) 
AB BÀ 

/ [Va + Ww] - a= J VE) -dr + / We): dz. (13.102) 
AB AB AB 

/ cV(z)- dg — c / V(E): df ^ (cconst). (13.103) 
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13.3.1.4 Line Integral in Cartesian Coordinates 
In Cartesian coordinates the following formula holds: 
"M )- dř = J (V, dz -- V, dy + Vz dz). (13.104) 


AB 


13.3.1.5 Integral Along a Closed Curve in a Vector Field 
A line integral is called a contour integral if the path of integration is a closed curve. If the scalar value 
of the integral is denoted by P and the closed curve is denoted by C, then the following notation is used: 
= j να). ας. (19.105) 
(C) 


13.3.1.6 Conservative Field or Potential Field 


1. Definition 

If the value P of the line integral (13.99a) in a vector field depends only on the initial point A and the 
endpoint B, and is independent of the path between them, then this field is called a conservative field 
or a potential field. 

The value of the contour integral in a conservative field is always equal to zero: 


/νῶ . di — 0. (13.106) 
(C) 

A conservative field is always irrotational: 
rot V — 06, (13.107) 


and conversely, this equality is a sufficient condition for a vector field to be conservative. Of course, it 
is to be supposed that the partial derivatives of the field function V are continuous with respect to the 


corresponding coordinates, and the domain of V is simply connected. This condition, also called the 
integrability condition (see 8.3.4.2, p. 521), has the following form in Cartesian coordinates 


OV, OV, OV, OV, ὃν. OV, 











; = : : 13.108 
ὃν ὃς Oz ὃν Ox Oz ( ) 
2. Potential of a Conservative Field, 
or its potential function or briefly its potential is the scalar function 

συ -- /νῶ -αξ. (13.1092) 


In a conservative field it is calculated with a fixed initial point A(ro) and a variable endpoint B(r) as 
the line integral 


m /νῶ: d£. (13.109b) 


Remark: In physics, the potential U*(#) of a function V (F) at the point F is often considered with the 
opposite sign: 


υπ) -- -[ve : di — —U(r). (13.110) 
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3. Relations between Gradient, Line Integral, and Potential 
If the relation V (T) = grad U (F) holds, then U (F) is the potential of the field V (F), and conversely, V (F) 
is a conservative or potential field. In physics often the negative sign is used corresponding to (13.110). 
4. Calculation of the Potential in a Conservative Field 2 
If the function V (r) is given in Cartesian coordinates V — V;i-J- V,j -- V.k, then for the total differen- 
tial of its potential function U 

dU = V; dx + Vy dy + V; dz (13.111a) 
holds. Here, the coefficients Vz, Vy, Vz must fulfill the integrability 
condition (13.108). The determination of U follows from the equa- 
tion system 


OU OU OU 
ey, oe ee. 13.1111 
Ox Oy V ^ Oz ( >) 





In practice, the calculation of the potential can be done by perform- 
ing the integration along three straight line segments parallel to the 
Figure 13.15 coordinate axes and connected to each other (Fig. 13.15): 


r E T y z 
U= [V dř = U (x, yo; zo) «f Vi (zx, yo; Zo) de+ f Vy (x, y, zo) dy «f V.(a, y, z) dz . (13.112) 
9 Jzo Jyo Jz 
To 


13.3.2 Surface Integrals 
13.3.2.1 Vector ofa Plane Sheet 


The vector representation of the surface integral of general type (see 8.3.4.2, p. 537) requires to assign 
a vector S to a plane surface region S, which is perpendicular to this region and its absolute value is 
equal to the area of S. Fig. 13.16a shows the case of a plane sheet. The positive direction in S is 
given by defining the positive sense along a closed curve C according to the right-hand law (also called 
right-screw rule): Looking from the initial point of the vector into the direction of its final point, then 
the positive sense is the clockwise direction. By this choice of orientation of the boundary curve one 
fixes the exterior side of this surface region, i.e., the side on which the vector lies. This definition works 
in the case of any surface region bounded by a closed curve (Fig. 13.16b,c). 











5 
C exterior side | interior side | | 
7 pe — 
a) b) 4 C c) >e 


Figure 13.16 


13.3.2.2 Evaluation of the Surface Integral 


The evaluation of a surface integral in scalar or vector fields is independent of whether the surface S is 
bounded by a closed curve or is itself a closed surface. The evaluation is performed in five steps: 

a) Dividing the surface region S on the exterior side defined by the orientation of the boundary curve 
(Fig. 13.17) into n arbitrary elementary surfaces AS; so that each of these surface elements can be 
approximated by a plane surface element. Assigning the vector AS; to every surface element AS; as 
given in (13.33a). In the case of a closed surface, the positive direction is defined so that the exterior 


13.3 Integration in Vector Fields 723 





side is where AS; should start. 
b) Choosing an arbitrary point P; with the position vector rj inside or on the boundary of each surface 
element. 


c) Producing the products U(#}) AS; in the case of a scalar field and the product V (5i) - AS; or V(#) x 
AS; in the case of a vector field. 

d) Taking the sum of all these products. 

e) Evaluating the limit while the diameters of AS; tend to zero, i.e., |AS;| — 0 for n — oo. So, the 


surface elements tend to zero in the sense given in 8.4.1, 1., p. 524, for double integrals. 
If this limit exists independently of the partition and of the choice of the points rj, then one calls it the 


surface integral of V on the given surface. 


yz 























Figure 13.17 Figure 13.18 


13.3.2.3 Surface Integrals and Flow of Fields 
1. Vector Flow of a Scalar Field 





P= Πα ΣΟ) ΔΘ, - / U(#) dS. (13.113) 
ΙΔ8:[-»0 ici 
n—oo (8) 
2. Scalar Flow of a Vector Field 
Q= lim YV(z): AS; - [ dS. (13.114) 
|Δ8:|-»0 1-1 5 
n—oo ( ) 
3. Vector Flow of a Vector Field 
R= lim S)V(#) x AS; = [vo x d$. (13.115) 
|AS;|20 i-i (3) 
n—oo 


13.3.2.4 Surface Integral in Cartesian Coordinates as 
Surface Integrals of Second Type 


[ud-= J [Uayaei+ μμ ο f [ vocavi (13.116) 
(S) 


ὧν) (5.2) (Ses) 
[ 9 48- [ [Vedyde+ NATI [ [ Vede dy. (13.117) 
(S) (Syz) (Sza) (Sey) 
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Jd d5 = "nr Vj - Vk) dy dz 1- [[v- Vi dzdz + f[[oi-v. 5) de dy.(13.118) 


(Syz) (S2) (Szy) 


The existence theorems for these integrals can be given similarly to those in 8.5.2, 4., p. 537. 

In the formulas above, each of the integrals is taken over the projection S on the corresponding coor- 
dinate plane (Fig. 13.18), where one of the variables x, y or z should be expressed by the others from 
the equation of S. 


Remark: Integrals over a closed surface are denoted by 
jud = gu as, jv ds = $v- dS, jy x dS = gv x d$. (13.119) 
(8) (S) S) (8) (S) 
W A: Calculate the integral P- / σσ dS, where the surface is the planeregion r4-y4-z — 1 bounded 
(S) 
: the coordinate planes. The upward side is the positive side: 


Life —y—z)yzdydzi+ [fa —a —2)xzdzdxj+ [fa — x — y)xy dx dy k; 


(Sze) = 
1-2 
TH f 1 — y — z)yz dy dz = [ Í (1 — y — z)yzdydz — zs We get the two further integrals 
(Suz) 
1 
analogously. The result is: P= 120 —( + j + k). 
W B: Calculate the integral Q — 1 . d$ — Hi v dydz 4 L ydz dx + um dy over the 
(Syz) (S22) (Szy) 
1-2 Τ 
same plane region as in A: J / x dy dz -f ' (1—y—2z)dydz — 5 Both other integrals are 
dl Jo Jo 
ld * 
calculated similarly. The result is: Q — — 6*6 5 + as 
E C: Calculate the integral Ñ = 1 Fx dS = [oi i+ yj + 2k) x (dydzi+ dzdxj + dz dy K), where 
(S) (5) 


the surface region is the same as in A: Performing the computations gives R = 0. 


13.3.3 Integral Theorems 


13.3.3.1 Integral Theorem and Integral Formula of Gauss 


1. Integral Theorem of Gauss or the Divergence Theorem 


The integral theorem of Gauss gives the relation between a volume integral of the divergence of V over 
a volume v, and a surface integral over the surface S surrounding this volume. The orientation of the 
surface (see 8.5.2.1, p. 535) is defined so that the exterior side is the positive one. The vector function 
V should be continuous, their first partial derivatives should exist and be continuous. The integral 
theorem of Gauss reads as follows: 


f Vd = JII div V dv, (13.1202) 
(S) (v) 
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i.e., the scalar flow of the field V through a closed surface S is equal to the integral of divergence of V 
over the volume v bounded by S. In Cartesian coordinates one gets: 


po. dy dz + Vy dz dz + V, dz dy) = Jl (Gee 2 m E) dx dy dz. (13.120b) 








2. Integral Formula of Gauss 

In the planar case, the integral theorem of Gauss restricted to the x, y plane becomes the integral for- 
mula of Gauss. It represents the correspondence between a line integral and the corresponding surface 
integral. The integral formula of Gauss reads as follows: 


JI je T,y) EE ded) T [P(z, y) dz + Q(z, y) dy]. (13.121) 


B ae a plane region which is bounded by C. P and Q are continuous functions with continuous 
first partial derivatives. 
3. Sector Formula 


The sector formula is an important special case of the Gauss integral formula to calculate the area of 
plane regions. For Q = x, P = —y it follows that 


F= [[ = flea — y dz]. (13.122) 
(B) (C) 


13.3.3.2 Integral Theorem of Stokes 
The integral theorem of Stokes gives the relation between a surface integral over an oriented surface 


region S, in which the vector field V is defined, and the integral along the closed boundary curve C of 
the surface S. The sense of the curve C is chosen so that the sense of traverse forms a right-screw with 
the surface normal (see 13.3.2.1, p. 722). The vector function V should be continuous and it should 
have continuous first partial derivatives. The integral theorem of Stokes reads as follows: 


n rot V - d8 = ϕ V - dz, (13.1232) 
S ο 


i.e., the vector flow of the rotation through a surface S bounded by the closed curve C is equal to the 


contour integral of the vector field V along the curve C. 
In Cartesian coordinates 


OV, OV, (Ve ὃν; [Vy We) y 
LG τ) Rupe (5 a) dade (5: x) de d| 


= $ (Va de + Vy dy + Vz d2) (13.123b) 
(C) 


holds. In the planar case, the integral theorem of Stokes, just as that of Gauss, becomes into the integral 
formula (13.121) of Gauss. 


13.3.3.3 Integral Theorems of Green 


The Green integral theorems give relations between volume and surface integrals. They are the applica- 





tions of the Gauss theorem for the function V — U, grad U2, where U; and U2 are scalar field functions 
and v is the volume surrounded by the surface S. The following theorems hold: 


i: JIR U, AUS -- grad Us - grad U;) dv = hU grad 03 ds, (13.124) 
(v) (S) 
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2. f[[ oov — U3AU;) dv = pu: grad Us — Us grad U;) - dS. (13.125) 
(v) (S) 


In particular for U, = 1, Uz = U 


3. nii AU dv — fperaau E (13.126) 
(v) (8) 


holds. In Cartesian coordinates the third Green theorem has the following form (compare (13.120b)): 








U OU OU ου ου au 

JII (GS t y? t τε) du j e dy dz 4 δι dz da 4 5: da: iy) ] (13.127) 

W A: Calculating the line integral I = j (a?y? da 4- dy - z dz) with a circle C as the intersection curve 
(€) 


of the cylinder z? + y? = a? and the plane z = 0. With the Sp theorem (13.123a) one gets: 


gê 
1 -- IW dř = Qj we d$ — — rene i jr cos? ysin !gdrdp- m with 


y=0r=0 
-- = aoe. dS = k dz dy 2d “Ἢ circle S*: a? 4- y? « a?. 


W B: Determine the flux J = f V - dS in the drift space V = ri + yit 2k through the surface S 
(8) 
of the sphere a? + y? + 2? = a?. The theorem of Gauss yields: 


d dš = eee 2? y ajenas] / | rsinvardyag = ate 


y=0 v=0 r=0 


a δ. Heat oe iw ins i in time of the heat Q of a space region v containing 


no heat source is given by Ž — a / IE coo dv (specific heat-capacity c, density o, temperature T), 


i the corresponding time- denn change of the heat flow through the surface S of v is given 


με. » = {pr grad T - dS (thermal conductivity A). Applying the theorem of Gauss for the surface 
(5) 


integral (13.120a) one gets from / / li ler ea - div (A grad T)| du — 0 the heat conduction equation 


OT T 
at — div (A grad T), which has the form F — a?AT in the case of a homogeneous solid (c, o, À 


constants). 


13.4 Evaluation of Fields 
13.4.1 PureSource Fields 


A field V, is called a pure source field or an irrotational source field when its rotation is equal to zero 
everywhere. If the divergence is q(r), then 


divV,—q(F), ^ rot V4 2 0 (13.128) 
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holds. In this case, the field has a potential U, which is defined at every point P by the Poisson differ- 
ential equation (see 13.5.2, p. 729) 


Vi=eradU, ^ divgradU — AU = q(ř), (13.1292) 


where F is the position vector of P. (In physics most often V, — —gradU is used.) The evaluation of 


U comes from 
pr Jj 
mx div V (£*) dv(£ div V(r*) dv(z*). (13.129b) 
|r — r*| 


The integration is taken over the whole of space (Fig. 13.19). The divergence of V must be differen- 
tiable and be decreasing sufficiently rapidly for large distances. 


γον m; OF qı 


m, Or qz 


Coulomb or 
gravitational field 


Vis (symbolically) 
T* dv(T) ... οκ CL T 











Figure 13.19 Figure 13.20 


13.4.2 Pure Rotation Field or Zero-Divergence Field 
A pure rotation (or curl) field or a solenoidal field is a vector field V2 whose divergence is equal to zero 
everywhere; this field is free of sources. With w(r) as the rotation density 

divV2=0, — rot V2 = W(z) (13.130a) 
hold. The rotation density w(r) cannot be arbitrary; it must satisfy the equation div w = 0. With the 
approach 


Va(F) -rotA(f) ^ divA —0, ie, rotrotÀ — w (13.130b) 
follows according to (13.97) 
graddivA - AA — W, ie, AA —-—w. (13.130c) 


So, A(#) formally satisfies the Poisson differential equation (see (13.135a), p. 729) just as the potential 


U of an irrotational field Vi and that is why it is called a vector potential. For every point P, then 
= - EE Ἱ WT") yo 
V,-rotAÁ ^ holdswith A-— Í / fi IY o”). (13.130d) 
Am |r — z*| 
The meaning of r is the same as in (13.129b); the integration is taken over the whole of space. 


13.4.3 Vector Fields with Point-Like Sources 
13.4.3.1 Coulomb Field of a Point-Like Charge 


The Coulomb field is an example of an irrotational field, which is also solenoidal, except at the location 
of the point charge q, the point source (Fig. 13.20). For the Coulomb force 
= l qq qm τ r a 
Fc = é, = q — eq , e- 13.131a 
4περ τ2 4περ 1273 1273 4περ ( ) 
holds. This force affects attractively for electric charges q1, qo with different signs and repulsively for 
charges with equal signs. &o is the electric constant (see Table 21.2, p. 1053), e is the intensity or 
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source strength of the source. The electric field strength and the electrostatic potential, generated in 
the space around the charge qı and affecting to the charge qz are given as 





Éo = e e c —-grdU, U-—- (13.131b) 
T 


U denotes the electrostatic potential of the field. The scalar flow in accordance with the theorem of 
Gauss (see (13.120a), p. 724) is equal to 47e or 0, depending on whether the surface S encloses the 
point source or not: 


a { 4πε, if S encloses the point source, 


j E-dS= 0, otherwise. 
(8) 


(13.131c) 


Because of the irrotationality of the electrostatic field 
rot Éc — 0. (13.131d) 
13.4.3.2 Gravitational Field of a Point Mass 


The field of gravity of a point mass or the Newton field is a second example of an irrotational and at the 
same time solenoidal field, except at the point of the center of mass. For the Newton mass attraction 


mm (13.132) 


Εν =y τα er 





holds, where y is the gravitational constant (see Table 21.2, p. 1053). Every relation valid for the 
Coulomb field is valid analogously also for the Newton field. 


13.4.4 Superposition of Fields 
13.4.4.1 Discrete Source Distribution 


Analogously to superposition of fields in physics, vector fields superpose each other. The superposition 
lawis: If the vector fields V, have the potentials U,, then the vector field 

V — XV, has the potential U — XU,. (13.1332) 
For n discrete point sources with source strength e, (v — 1,2,...,n), whose fields are superposed, the 
resulting field can be determined by the algebraic sum of the potentials U,: 
€y 


V(F) — —grad Mu, with U, = 


v=1 


(13.133b) 


F- r| 
Here, the vector r is again the position vector of the point under consideration, r, are the position 
vectors of the sources. 

If there is an irrotational field V4 and a zero-divergence field V3 together and they are everywhere 
continuous, then 


V=Vi+ V2. =-— ia |e I dv(r*) — rot nii os z αυ(α e. (13.133c) 


If the vector field is extended to infinity, then the decomposition of me is unique if |V (z)| decreases 
sufficient rapidly for r = |f| + oo. The integration is taken over the whole of space. 








13.4.4.2 Continuous Source Distribution 


If the sources are distributed continuously along lines, surfaces, or in domains of space, then, instead of 
the finite source strength e,, there are infinitesimals corresponding to the density of the source distri- 
butions, and instead of the sums, we have integrals over the domain. In the case of a continuous space 


distribution of source strength, the divergence is q(r) — div V. 
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Similar statements are valid for the potential of a field defined by rotation. In the case of a continuous 
space rotation distribution, the “ rotation density” is defined by W(£) = rot V. 
13.4.4.3 Conclusion 


A vector field is determined uniquely by its sources and rotations in space if all these sources and rota- 
tions located inside a finite space. 


13.5 Differential Equations of Vector Field Theory 
13.5.1 Laplace Differential Equation 


The problem to determine the potential U of a vector field Vi = grad U containing no sources, leads 
to the equation according to (13.128) with q(T) = 0 
div V, — divgradU — AU — 0, (13.1342) 
i.e., to the Laplace differential equation. In Cartesian coordinates holds: 
OU OU OU 
AU =-=; t; Ἔ ας Ξξ 0. 13.134b 
Or? Əy O2 ( ) 
Every function satisfying this differential equation and which is continuous and possesses continuous 
first and second order partial derivatives is called a Laplace or harmonic function (see also 14.1.2.2,2., 
p. 732). 
There are to distinguish three basic types of boundary value problems: 
1. Boundary value problem (for an interior domain) or Dirichlet problem: A function U(x, y, z) is 
determined, which is harmonic inside a given space or plane domain and takes the given values at the 
boundary of this domain. 


2. Boundary value problem (for an interior domain) or Neumann problem: A function U (x,y,z) is 
: ον. νο ] : 20, OU : 
determined, which is harmonic inside a given domain and whose normal derivative n takes the given 
n 
values at the boundary of this domain. 
3. Boundary value problem (for an interior domain): A function U (z, y, z) is determined, which is 
ου 
harmonic inside a given domain and the expression aU + 8 On (a, const, a?+ 6? 40) takes the 
n 


given values at the boundary of this domain. 


13.5.2 Poisson Differential Equation 


The problem to determine the potential U of a vector field V, = grad U with given divergence, leads 
to the equation according to (13.128) with q(r) 4 0 
div Vi = divgradU = AU = q(#) £0, (13.135a) 


i.e., to the Poisson differential equation. Since in Cartesian coordinates: 


OU QU PU 
AUT LLL, 13.135] 
Ox? Oy 02 ( 5b) 
the Laplace differential equation (13.134b) is a special case of the Poisson differential equation (13.135b). 
The solution is the Newton a (for point masses) or the Coulomb potential (for point charges) 


q(r*) dv(r* . 
Ue sa [[[ 252 DA (13.135c) 


The integration is taken over the whole of space. U(r) tends to zero sufficiently rapidly for increasing 
|r| values. 
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One can discuss the same three boundary value problems for the Poisson differential equation as for the 
solution of the Laplace differential equation in 13.5.1. The first and the third boundary value problems 
can be solved uniquely; for the second one there are to prescribe further special conditions (see [9.5]). 


14 Function Theory 


14.1 Functions of Complex Variables 


14.1.1 Continuity, Differentiability 
14.1.1.1 Definition of a Complex Function 


Analogously to real functions, complex values can be assigned to complex values, i.e., to the value 
z = x + iy one can assign a complex number w = u + iv, where u = u(x,y) and v = v(a,y) are 
real functions of two real variables. This relation is denoted by w = f(z). The function w = f(z) isa 
mapping from the complex z plane to the complex w plane. 

The notions of limit, continuity, and derivative of a complex function w = f(z) can be defined analo- 
gously to real functions of real variables. 


14.1.1.2 Limit of a Complex Function 

The limit of a function f(z) is equal to the complex number wo if for z approaching zo the value of the 
function f(z) approaches wo: 

wo = lim F(z). (14. 1a) 
In other words: For any positive ¢ there is a (real) 6 > 0 such that for every z satisfying (14.1b), except 
maybe zo itself, the inequality (14.1c) holds: 


[zo — z| « ô, (14.1b) [wo — f(z)] « €. (14.1c) 
The geometrical meaning is as follows: Any point z in the circle with center zo and radius 4, except 
maybe the center zo itself, is mapped into a point w = f(z) inside a circle with center wo and radius 





g in the w plane where f has its range, as shown in Fig. 14.1. The circles with radii 6 and £ are also 
called the neighborhoods U; (wo) and Us(zo). 


y v 


0 zplane x 0 w plane u 


a) b) 


Figure 14.1 
14.1.1.3 Continuous Complex Functions 


A function w = f(z) is continuous at zọ if it has a limit there, and a substitution value, and these two 
are equal, i.e., if for an arbitrarily small given neighborhood U;(wg) of the point wo — f(zo) in the w 
plane there exists a neighborhood U;(zo) of zo in the z plane such that w = f(z) € U-(wo) for every 
z € Us(zo). As represented in Fig. 14.1, U-(wo) is, e.g., a circle with radius € around the point wo. 
The continuity of f is expressed by the usual notation 


ο ο (14.2) 


14.1.1.4 Differentiability of a Complex Function 
A function w — f(z) is differentiable at z if the difference quotient 
Aw _ f@+Az)— fl) 


Αχ” A. —— (14.3) 
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has a limit for A z — 0, independently of how A z approaches zero. This limit is denoted by f’(z) and 
it is called the derivative of f(z). 

E The function f(z) = Rez = x is not differentiable at any point z = zo, since approaching zo parallel 
to the x-axis the limit of the difference quotient is one, and approaching parallel to the y-axis this value 
is zero. 


14.1.2 Analytic Functions 
14.1.2.1 Definition of Analytic Functions 


A function f(z) is called analytic, regular or holomorphic on a domain G, if it is differentiable at every 
point of G. The boundary points of G, where f’(z) does not exist, are singular points of f(z). 
The function f(z) = u(x, y) 4-iv(z, y) is differentiable in G if u and v have continuous partial derivatives 
in G with respect to x and y and they also satisfy the Cauchy-Riemann differential equations: 


ðu Qv Ou Ov 


E eae Iu LIE 14.4 
Ox Oy’ ὃν Ox ne 
The real and imaginary parts of an analytic function satisfy the Laplace differential equation: 
Ou Pu Ov Fv 
A u(r,y) — an + ay = 0, (14.5a) Av(a,y) = oj + aye =0. (14.5b) 


The derivatives of the elementary functions of a complex variable can be calculated with the help of 
the same formulas as the derivative of the corresponding real functions. 


MA: f(z) =23, f(z) = 327; MB: f(z) =sinz, f’(z) =cosz. 
14.1.2.2 Examples of Analytic Functions 


1. Elementary Functions 

The elementary algebraic and transcendental functions are analytic in the whole z plane except at 
some isolated singular points. If a function is analytic on a domain, i.e., it is differentiable, then it is 
differentiable arbitrarily many times. 

Bl A: The function w — z? with u — a? — y?, v — 2xy is everywhere analytic. 

W B: The function w = u 4- iv, defined by the equations u — 2x 4- y, v = x + 2y, is not analytic at 
any point. 

Bl C: The function f(z) — z? with f'(z) — 32? is analytic. 

B D: The function f(z) — sin z with f'(z) — cos z is analytic. 

2. Determination of the Functions u and v 

If both the functions u and v satisfy the Laplace differential equation, then they are harmonic functions 
(see 13.5.1, p. 729). If one of these harmonic functions is known, e.g., u, then the second one, as 
the conjugate harmonic function v, can be determined up to an additive constant with the Cauchy— 
Riemann differential equations: 


ο fou, z ] de _ Ou ὃ [δι I 
v- [55 dye ea) with m (5-5: ο (14.6) 


Analogously u can be determined if v is known. 
14.1.2.3 Properties of Analytic Functions 
1. Absolute Value or Modulus of an Analytic Function 
The absolute value (modulus) of an analytic function is: 
lw] = |F(2)| = y lu(z, y)? + Io, y)]? = p(z, y). (14.7) 


The surface 














w| = y(x, y) is called its relief , i.e., |w| is the third coordinate above every point z — x-4-iy. 
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Ε A: Theabsolute value ofthe function sin z = sin x cosh y+i cos z sinh yis | sin z| = ysin? £ + sinh? y. 
The relief is shown in Fig. 14.2a. 

E B: The relief of the function w — e! is shown in Fig. 14.2b. 

For the reliefs of several analytic functions see [14.10]. 
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Figure 14.2 
2. Roots 


Since the absolute value of a function is positive or zero, the relief is always above the z plane, except 
the points where | f(z)| = 0 holds, so f(z) = 0. The z values, where f(z) = 0 holds, are called the roots 
of the function f(z). 


3. Boundedness 





A function f(z) is bounded on a certain domain G, if there exists a positive number N such that | f(z)| < 
N for all z in G. In the opposite case, if no such number N exists, then the function is called unbounded 


in G. 


4. Theorem about the Maximum Value 





If w = f(z) is an analytic function on a closed domain, then the maximum of its absolute value is 
attained on the boundary of the domain. 


5. Theorem about the Constant (Theorem of Liouville) 


If w = f(z) is analytic in the whole z plane and also bounded, then this function is a constant: f(z) = 
const. 


14.1.2.4 Singular Points 

If a function w = f(z) is analytic in a neighborhood of z = a, i.e., in a small circle with center a, except 
a itself, then f has a singularity at a. There exist three types of singularities: 

1. f(z) is bounded in the neighborhood. Then there exists w = lim f(z), and setting f(a) = w the 
function becomes analytic also at a. In this case, f has a removable singularity at a. 

2. If lim |f (z)| 7 oo, then f has a pole at a. About poles of different orders see 14.3.5.1, p. 753. 


3. If f has neither a removable singularity nor a pole at a, then f has an essential singularity at a. In 
this case, for any complex w there exists a sequence zn — a such that f (zn) > w. 





1 
W A: The function w — has a pole at a. 
z—a 





734 14. Function Theory 





E B: The function w = e!/ has an essential singularity at 0 (Fig. 14.2b). 


14.1.3 Conformal Mapping 
14.1.3.1 Notion and Properties of Conformal Mappings 
1. Definition 


A mapping from the z plane to the w plane is called a conformal mapping if it is analytic and injective. 
In this case, 

w= f(z) =u+iv, f'(z) #0. (14.8) 
The conformal mapping has the following properties: 
The transformation dw = f'(z) dz of the line element dz = ( 2 is the composition of a dilatation by 
o = |f'(z)| and of a rotation by a — arg f'(z). This means that infinitesimal circles are transformed 
into almost circles, triangles into (almost) similar triangles (Fig. 14.3). The curves keep their angles 
of intersection, so an orthogonal family of curves is transformed into an orthogonal family (Fig. 14.4). 








» 

x ol 
b) 

Figure 14.3 Figure 14.4 





Remark: Conformal mappings can be found in physics, electrotechnics, hydro- and aerodynamics and 
in other areas of mathematics. 

2. The Cauchy—Riemann Equations 

The mapping between dz and dw is given by the affine differential transformation 


δι 


1 Qu Ov Ov 
du = — dz + — dy, dvu = — dz + — di 14.9a 
= ag Oy 9 ΠΕΣ ur y (14.98) 
and in matrix form 
ἄω-λας ον A= ce 2 (14.9b) 
Ug Uy 


According to the Cauchy-Riemann differential equations, A has the form of a rotation-stretching ma- 
trix (see 3.5.2.2, 2., p. 192) with o as the stretching factor: 


Au is 2 ος poe) (14.10a) 
Up Ug sina cosa 

Uz = Vy = 0 COS Q , (14.10b) o=|f'(2)| = yu? + u? = yu v2, (14.10c) 

—Uy = vr = 0 SNQ, (14.10d) a = arg f'(z) = arg (us + ivr). (14.10e) 


3. Orthogonal Systems 
The coordinate lines x = const and y = const of the z plane are transformed by a conformal mapping 
into two orthogonal families of curves. In general, a bunch of orthogonal curvilinear coordinate systems 


14.1 Functions of Complex Variables 735 





can be generated by analytic functions; and conversely, for every conformal mapping there exist an 
orthogonal net of curves which is transformed into an orthogonal coordinate system. 
W A: In the case of u = 2x + y, v = x + 2y (Fig. 14.5), orthogonality does not hold. 
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Figure 14.5 
E B: In the case w — z? the orthogonality is retained, except at the point z — 0 because here w’ = 0. 
The coordinate lines are transformed into two confocal families of parabolas (Fig. 14.6), the first 
quadrant of the z plane into the upper half of the w plane. 


y 





b) 





Figure 14.6 
14.1.3.2 Simplest Conformal Mappings 


In this paragraph, some transformations are discussed with their most important properties, and there 
are given the graphs of isometric nets in the z plane, i.e., the nets which are transformed into an or- 
thogonal Cartesian net in the w plane. The boundaries of the z regions mapped into the upper half 
of the w plane are denoted by shading. Black regions are mapped by the conformal mapping onto the 
square in the w plane with vertices (0,0), (0,1), (1,0), and (1, 1) (Fig. 14.7). 
1. Linear Function 
For the conformal mapping given in the form of a linear function 

w=az+b (a,bcomplex, const;a # 0), (14.11a) 
the transformation can be done in three steps: 


a) Rotation of the z plane by the angle a = arga according to: — u4 — ez. 
b) Stretching of the w plane by the factor |a|: w = |aļw. (14.11b) 
c) Parallel translation of the wz plane by b : w = w+ b. 


Altogether, every figure of the z plane is transformed into a similar one of the w plane. The points 





zı = œ and 2ο = I for a £ 1 are transformed into themselves, and they are called fired points. 
—a 
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Fig. 14.8 shows the orthogonal net which is transformed into the orthogonal Cartesian net. 


γα 








SKS 
SSS 
SS 





(11) 








0 u 
Figure 14.7 Figure 14.8 Figure 14.9 


2. Inversion 
The conformal mapping 


1 


w= 
2 


(14.12) 
represents an inversion with respect to the unit circle and a reflection in the real axis, namely, a point 
z — re? of the z plane with the absolute value r and with the argument ¢ is transformed into a point 
w= lee of the w plane with the absolute value 1/r and with the argument —¢ (see Fig. 14.10). 
Circles are transformed into circles, where lines are considered as limiting cases of circles (radius — oc). 
Points of the interior of the unit circle become exterior points and conversely (see Fig. 14.11). The 
point z — 0 is transformed into w — oo. The points z — —1 and z = 1 are fixed points of this conformal 
mapping. The orthogonal net of the transformation (14.12) is shown in Fig. 14.9. 

Remark: In general a geometric transformation is called inversion with respect to a circle with radius 
r, when a point P> with radius rz inside the circle is transformed into a point P, on the elongation of 


— MM 
the same radius vector OP, outside of the circle with radius OP; — ri — r?/ra. Points of the interior 
of the circle become exterior points and conversely. 


y 













a 
e 











Figure 14.10 Figure 14.11 


3. Linear Fractional Function 
For the conformal mapping given in the form of a linear fractional function 
az +b 


KE cz+d 


(a, b, c, d complex, const; bc — ad # 0; c £ 0) (14.13a) 
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the transformation can be performed in three steps: 


a) Linear function: wy, = cz + d. 





b) Inversion: w = u (14.13b) 
: . à | bc— ad 
c) Linear function: w =- + — wz. 
Ὁ ο 


Circles are transformed again into circles (circular transformation), where straight lines are considered 
as limiting cases of circles with radius r — oo. Fixed points of this conformal mapping are the both 
points satisfying the quadratic equation 
az+b 

cz+d° 

If the points 21 and 22 are inverses of each other with respect to a circle Kı of the z plane, then their 
images u4 and wz in the w plane are also inversions of each other with respect to the image circle Ka 
of Kj. 

The orthogonal net which has the orthogonal Cartesian net as its image is represented in Fig. 14.12. 


2 Ξ 





(14.14) 








y 
x 
Figure 14.12 Figure 14.13 Figure 14.14 

4. Quadratic Function 
The conformal mapping described by a quadratic function 

w=2 (14.15a) 
has the form in polar coordinates and as a function of x and y: 

w= pre’, (14.15b) w =u +iv = r? — y? + diay. (14.15c) 


It is obvious from the polar coordinate representation that the upper half of the z plane is mapped onto 
the whole w plane, i.e., the whole image of the z plane will cover twice the whole w plane. 

The representation in Cartesian coordinates shows that the coordinate lines of the w plane, u = const 
and v = const, come from the orthogonal families of hyperbolas z? — y? = u and 2xy = v of the z plane 
(Fig. 14.13). 

Fixed points of this mapping are z = 0 and z = 1. This mapping is not conformal at z = 0. 

5. Square Root 


The conformal mapping given in the form as a square root of z, 


w=V2, (14.16) 
transforms the whole z plane whether onto the upper half of the w plane or onto the lower half of it, 
i.e., the function is double-valued, i.e., to every value of z (z 4 0) belong two values of w (see 1.5.3.6, 
p. 38). The coordinate lines of the w plane come from two orthogonal families of confocal parabolas 
with the focus at the origin of the z plane and with the positive or with the negative real half-axis as 
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their axis (Fig. 14.14). 

Fixed points of the mapping are z = 0 and z = 1. The mapping is not conformal at z = 0. 
6. Sum of Linear and Fractional Linear Functions 

The conformal mapping given by the function 


k 1 
ip (z + J (k areal constant, k > 0) (14.17a) 
z 


can be transformed by the polar coordinate representation of z = p e? and by separating the real and 
imaginary parts according to (14.8): 


k 1 k 1 
=, (» + *) cosp, U= 5 (» — *) sin y. (14.17b 
p 2 p 
Circles with p — po — const of the z plane (Fig. 14.15a) are transformed into confocal ellipses 
u? v k 1 k 1 
++ 5=1 with a== — b—-lpo— — 14.17 
di + pg with a=> (n^ ~) ; 5 |Po . ( ο 











in the w plane (Fig. 14.15b). The foci are the points +k of the real axis. For the unit circle with 
p = po = lone gets the degenerated ellipse of the w plane, the twice overrunning segment (—k, +k 
of the real axis. Both the interior and the exterior of the unit circle are transformed onto the entire w 
plane with the cut (—k, +k), so its inverse function is double-valued: 


wt Vee 


Ra s (14.17d 
The lines o — qo of the z plane (Fig. 14.15c) become confocal hyperbolas 

u v ] ; 

— ——5,—1 with o-kcosqo, B=ksing (14.17e 

o2 p? 





with foci tk (Fig. 14.15d). The hyperbolas corresponding to the coordinate half-axis of the z plane 





T 3 . ] : ' ; 
(e =0, 5.7, στ) are degenerate in the axis u — 0 (v axis) and in the intervals (—oo, —k) and (k, oo 


of the real axis running there and back. 


y ν γ 
=o 
©)): 
x u x 
a) b) c) 


Figure 14.15 








7. Logarithm 


The conformal mapping given in the form of the logarithm function 














w=Lnz (14.18a) 
has the form for z given in polar coordinates: 

u=Inp, v= +2kr (k= Oye ae? cae). (14.18b) 
One can see from this representation that the coordinate lines u = const and v = const come from 


concentric circles around the origin of the z plane and from rays starting at the origin of the z plane 
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Figure 14.15 Figure 14.16 


(Fig. 14.16). The isometric net is a polar net. 

The logarithm function Ln z is infinitely many valued (see (14.74c), p. 758). 

Restricting the investigation to the principal value In z of Lnz (—r < v < +r), then the whole z plane 
is transformed into a stripe of the w plane bounded by the lines v = +7, where v = 7 belongs to the 
stripe. 





8. Exponential Function 
The conformal mapping given in the form of an exponential function (see also 14.5.2, 1., p. 758) 


w = e? (14.19a) 
has the form in polar coordinates: 

w = pe”. (14.19b) 
From z = x + iy follows: 

ρξ ε and p=y. (14.19c) 


If y changes from —7 to +r, and x changes from —oo to --oo, then p takes all values from 0 to oo and 
ib from —7 to 7. A 27 wide stripe of the z plane, parallel to the x-axis, will be transformed into the 
entire w plane (Fig. 14.17). 





Figure 14.17 


9. The Schwarz-Christoffel Formula 
By the Schwarz-Christoffel formula 


w 


dt 
po / (£ — wi (E — σης 





+O (14.20a) 


the interior of a polygon of the z plane can be mapped onto the upper half of the w plane. The polygon 
has n exterior angles o47, o27,..., o47 (Fig. 14.18a,b). The points of the real axis in the w plane 
assigned to the vertices of the polygon are denoted by w; (i — 1,..., n), and the integration variable is 
denoted by t. The oriented boundary of the polygon is transformed into the oriented real axis of the w 
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plane by this mapping. For large values of |¢|, the integrand behaves as 1/t? and is regular at infinity. 
Since the sum of all the exterior angles of an n-gon is equal to 27, consequently 


Σταν --32. (14.200) 





Figure 14.18 


'The complex constants C1 and C5 yield a rotation, a stretching and a translation; they do not depend 
on the form of the polygon, only on the size and the position of the polygon in the z plane. 

"Three arbitrary points w;, w», ws of the w plane can be assigned to three points 21, 2ο, 23 of the polygon 
in the z plane. Assigning a point at infinity in the w plane, i.e., w; = +00 to a vertex of the polygon 
in the z plane, e.g., to z = 2, then the factor (t — w,)™ is omitted. If the polygon is degenerated, 
e.g., a vertex is at infinity, then the corresponding exterior angle is 7, so ax = 1, i.e., the polygon is a 
half-stripe. 





Í B d } A ΄΄ P 
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Figure 14.19 


W A: Mapping of a certain region of the z plane (shaded region in Fig. 
14.19a): Considering 5; a, — 2 one chooses three points as the table shows 
(Fig. 14.19a,b). The formula of the mapping is: 







A 
ω B 
z—-0, Í — = 20; (vw — arctan Vw) = im (vw — arctan vw) i Ὁ 


t4 1)t12 
A2. | 
: 0 (-1 + pe^) ipe” dp 
To determinate C4 one substitutes t = pe? — 1, giving id = C; lim / A ese 
P> Jr pe 
1 
ie, Gy i. 
π 


For the constant C follows Cy = 0, considering that the mapping assigns *z = 0 > w = 0”. 




















HB: Mapping of a rectangle. Let the vertices of the rectangle be z214 = ΕΚ, 223 = +K +ik’. The 
points z, and zg should be transformed into the points wı = 1 and w = 1/k ( 0 « k « 1) of the real 
axis, 24 and 23 are reflections of z, and 22 with respect to the imaginary axis. a ding to the Schwarz 
reflection principle (see 14.1.3.3, p. 741) they must correspond to the points w4 = —1 and w3 = —1/k 
(Fig. 14.20a,b). So, the mapping formula for a rectangle (a1 — a9 — a3 — a4 — 1/2) of the position 
w It w dt 
sketched above is: z = C1 [ : σι [ . The 
J wt ene ang) [ay (2 
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Figure 14.20 


point z = 0 has the image w — 0 and the image of z — iK is w = oo. With Cı = 1/k follows 
. 5 dt [ dv 

© Jo Jae) ο v1- Ks) 
F(q, k) is the elliptic integral of the first kind (see 8.1.4.3, p. 490). 

For the constant C5 one gets C5 — 0, considering that the mapping assigns "^ z — 0 — w — 0". 


14.1.3.3 Schwarz Reflection Principle 


1. Statement 

Suppose f(z) is an analytic complex function in a domain G, and the boundary of G contains a line 
segment gı. If the function is continuous on g; and it maps the line g; into a line gj, then the points 
lying symmetric with respect to gı are transformed into points lying symmetric with respect to gj 
(Fig. 14.21). 


F(y,k) (substituting t = sin, w = sin ọ). 








source source 


oundar 





ayy 
SM sink 





pt 
Figure 14.21 Figure 14.22 Figure 14.23 


2. Application 

'The application of this principle makes it easier to perform calculations and the representations of 
plane regions with straight line boundaries: If the line boundary is a stream line (isolating boundary 
in Fig. 14.22), then the sources are reflected as sources, the sinks as sinks and curls as curls with 
the opposite sense of rotation. If the line boundary is a potential line (heavy conducting boundary in 
Fig. 14.23), then the sources are reflected as sinks, the sinks as sources and curls as curls with the 
same sense of rotation. 


14.1.3.4 Complex Potential 
1. Notion of the Complex Potential 
A ficld V — V (a, y) is considered in the a, y plane for the zero-divergence and the irrotational case 


with continuous and differentiable components V,(, y) and V,(x, y) of the vector V. 
OV, | OV, 


a) Zero-divergence field with div V — 0, ie, E" + 7 0: That is the integrability condition 
T y 
for the differential equation expressed with the field or stream function V (x, y) 
y V 
di — —V,dr--V,dy —0, (1421a) andthen V,— = 3 Vy- a : (14.21b) 
y z 
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For two points P, , P; of the field V the difference W(P) — W(P,) is a measure of the flux through a 
curve connecting the points P, and P», in the case when the whole curve is in the field. 
: : > >. OV, OV, : ; - το 

b) Irrotational field with rot V = 0, i.e., Je — "v — 0: That is the integrability condition for the 
z y 

differential equation with the potential function P(x, y) 

op . OD 

—, V; = —. 

Ox Oy 

If the field is free of sources and rotations, then the functions ® and V satisfy the Cauchy-Riemann 

differential equations (see 14.1.2.1, p. 732) and both satisfy the Laplace differential equation (A® = 

0, Av — 0). Combining the functions and V into the analytic function 


W = f(z) = (x,y) +i (x,y), (14.23) 


then this function is called the complex potential of the field V. 


d® = V, dx + V} dy — 0, (14.22a) and then V, = (14.225) 


Then —é(z, y) is the potential of the vector field V in the sense of the usual notation in physics and 
electrotechnics (see 13.3.1.6, 2., p. 721). The level lines of V and 9 form an orthogonal net. For the 


derivation of the complex potential and the field vector V the following equations hold: 


dW dd 00 , aw 


= = f(z) = V, + iV,. 4. 
dz Ox συ Tec thy F(Z) = Va +iVy (14.24) 





2. Complex Potential of a Homogeneous Field 
The function 
W =az, (14.25) 


with real a is the complex potential of a field whose potential lines are parallel to the y-axis and whose 
direction lines are parallel to the z-axis (Fig. 14.24). A complex a results in a rotation of the field 
(Fig. 14.25). 
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3. Complex Potential of Source and Sink 
The complex potential of a field with a strength of source s > 0 at the point z = zp has the equation: 


s 
W= s ln(z — zo) - (14.26) 


A sink with the same intensity has the equation: 
s 
W = —— ln(z — zo). (14.27) 
2π 


The direction lines run away radially from z — 2o, while the potential lines are concentric circles around 
the point 29 (Fig. 14.26). 
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4. Complex Potential of a Source-Sink System 
One gets the complex potential for a source at the point z; and for a sink at the point 25, both having 
the same intensity, by superposition: 


5 z= zi 





(14.28) 


2T z—29 
The potential lines = const form Apollonius circles with respect to z1 and 29; the direction lines 
V = const are circles through 2; and 22 (Fig. 14.27). 








Figure 14.26 Figure 14.27 


5. Complex Potential of a Dipole 
The complex potential of a dipole with dipole moment M > 0 at the point zọ, whose axis encloses an 
angle o with the real axis (Fig. 14.28), has the equation: 

Μος 


W2———. 
2n(z — zo) 


(14.29) 
6. Complex Potential of a Curl 
If the intensity of the curl is |I| with real I" and its center is at the point zo, then its equation is: 
I 
W = — ln(z — zo). (14.30) 
27i 
In comparison with Fig. 14.26, the roles of the direction lines and the potential are interchanged. For 
complex /' (14.30) gives the potential of a source of curl, whose direction and potential lines form two 
families of spirals orthogonal to each other (Fig. 14.29). 
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14.1.3.5 Superposition Principle 


1. Superposition of Complex Potentials 

A system composed of several sources, sinks, and curls is an additive superposition of single fields, i.e., 
one gets its function by adding their complex potential and stream functions. Mathematically this is 
possible because of the linearity of the Laplace differential equations A® = 0 and AW = 0. 

2. Composition of Vector Fields 

1. Integration Beside addition from complex potentials new fields can be constructed also by inte- 
gration with the application of weight functions. 

H A curl-covering is given on a line segment l with density function o(s). The derivative of the complex 
potential is given by an integral of Cauchy-type (see 14.2.3, p. 748) 


dW 1 o(s)ds | 1 [ος I 
dz 3πὶ 1 z — C(s) πας ᾱ-ς a, (a31) 


where ¢(s) is the complex parameter representation of the curve / with arc length parameter s. 

2. Maxwell Diagonal Method If one wants to compose the superposition of two fields with the 
potentials 94 and z, then one can draw the potential line figures |[®,]] and [[®2|] so that the value of 
the potential changes by the same amount h between two neighboring lines in both systems. Then the 
lines are directed so that the higher 9 values are on the left-hand side. The lines lying in the diagonal 
direction to the net elements formed by [[®;]] and |[®2]] give the potential lines of the field [[]], whose 
potential is  — Φι + Dy or P = 9, — Dg. One gets the figure of {[; + ©] if the oriented sides of the net 
elements are added as vectors (Fig. 14.30a), and one gets the figure of [[®; — 82]] by subtracting them 
(Fig. 14.30b). The value of the composed potential changes by h at transition from one potential line 
to the next one. 

E Vector lines and potential lines of a source and a sink with an intensity quotient |e;|/|e2| = 3/2 (see 
Fig. 14.31a,b). 
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14.1.3.6 Arbitrary Mappings of the Complex Plane 
A function 

w= f(z=a+iy) = u(x, y) +iv(z,y) (14.32a) 
is defined if the two functions u = u(x, y) and v = v(x, y) with real variables are defined and known. 


The function f(z) must not be analytic, as it was required i in conformal mappings. The function w 
maps the z plane into the w plane, i.e. it assigns to every point z, a corresponding point wy. 


a) Transformation of the Coordinate Lines 





y =c — u =u(z,c), v=v(2,c), x is a parameter; 

T= —u=ulc,y), v=v(c1,y), y is a parameter. (14.32b) 
b) Transformation of Geometric Figures Geometric figures as curves or regions of the z plane are 
usually transformed into curves or regions of the w plane: 


c=x(t), y=y(t) —  u-u(z(t),u(t), v —v(x(t),u(t)), tis a parameter. (14.32c) 

E For u = 2x + y, v = x 4- 2y, the lines y — c are transformed into u — 2r + c, v = x 4- 2c, hence into 
u 

the lines v = 3 + 3 The lines z — c, are transformed into the lines v — 2u — 3c, (Fig. 14.5). The 
shaded region in Fig. 14.5a is transformed into the shaded region in Fig. 14.5b. 
c) Riemann Surface Getting the same value w for several different z of the mapping w = f(z), 
then the image of the function consists of several planes “lying on each other”. Cutting these planes 
and connecting them along a curve gives a many-sheeted surface, the so-called many-sheeted Riemann 
surface (see [14.13]). This correspondence can be considered also in an inverse relation, in the case of 
multiple-valued functions as, e.g., the functions Yz, Ln z, Arcsin z, Arctan z. 


Bow = 2: While z = re? overruns the entire z plane, i.e., 

0Ο: φ < 2m, the values of w = ge” = re”, cover twice 
the w plane. One can imagine to place two w planes one on 
each other, and cut and connect them along the negative 
real axis according to Fig. 14.32. This surface is called 
the Riemann surface of the function w = 2?. The zero 
point is called a branch point. The image of the function e* 


Figure 14.32 (see 14.69) is a Riemann surface of infinitely many sheets. 





(In many cases the planes are cut along the positive real axis. It depends on whether the principal value 
of the complex number is defined for the interval (—7, +7] or for the interval [0, 2π).) 


14.2 Integration in the Complex Plane 


14.2.1 Definite and Indefinite Integral 
14.2.1.1 Definition of the Integral in the Complex Plane 


y 1. Definite Complex Integral 

Suppose f(z) is continuous in a domain G, and the curve C 
is rectifiable, it connects the points A and B, and the whole 
curve is in this domain. The curve C is decomposed between 
the points A and B by arbitrary division points z; into n sub- 
arcs (Fig.14.33). 

A point ¢; is chosen on every arc segment and forms the sum 








T. f(G)Az with Azi—z-—z.a. (14.33a) 


Figure 14.33 
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If the limit 
Jim Σ f(G) Azi (14.33b) 


exists for Az; — 0 and n — oo independently of the choice of the points z; and G;, then this limit is 
called the definite complex integral 


I= 1 f(z)dz — (C) / f(z)dz (14.33c) 
~ A 
AB 


along the curve C between the points A and B. The value of the integral usually depends on the path 
of the integral. 

2. Indefinite Complex Integral 

If the definite integral is independent of the path of the integral (see 14.2.2, p. 747), then holds: 


F(z) = EO dz+C with F'(z)- f(z). (14.34) 
Here C is the integration constant which is complex, in general. The function F(z) is called an indefinite 
complex integral. 
The indefinite integrals of the elementary functions of a complex variable can be calculated with the 
same formulas as the integrals of the corresponding elementary function of a real variable. 


mA: [sinzdz=—cosz+C. mB: fetdz=e +C. 


3. Relation between Definite and Indefinite Complex Integrals 
If the function f(z) is analytic (see 14.1.2.1, p. 732) and has an indefinite integral, then the relation 
between its definite and indefinite integral is 


B 
1 8) ἀξ -- Jro dz = F (zp) — F (z4). (14.35) 
~ A 
AB 


14.2.1.2 Properties and Evaluation of Complex Integrals 


1. Comparison with the curvilinear integral of the second type 

The definite complex integral has the same properties as the curvilinear integral of the second type (see 
8.3.2, p. 517): 

a) Reversing the direction of the path of integration the integral changes its sign. 

b) Decomposing the path of integration into several parts the value of the total integral is the sum of 
the integrals on the parts. 

2. Estimation of the Value of the Integral 

If the absolute value of the function f(z) does not exceed a positive number M at the points z of the 


path of integration AB which has the length s, then: 
[ f@az|<Ms if Af) s Μ. (14.36) 
AB 
3. Evaluation of the Complex Integral in Parametric Representation 


If the path of integration AB (or the curve C) is given in the form 
x= x(t), y = y(t) (14.37) 
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where x and y are differentiable functions of t and the t values for the initial and endpoint are t4 and 
tp, then the definite complex integral can be calculated with two real integrals. Separating the real and 
the imaginary parts of the integrand the integral is 


(ο) {Κο dz — f (om —vdy) +i f(vde+ u dy) 
A A A 
= 2 [u(£)z (t) — v(£)y (£)] dt +i J [v(£)a (£) + u(t)y!(t)] dt (14.38a) 
with f(z) =u(s,y)+iv(z,y), 2z=axt+iy. (14.38b) 


B 

The notation (C) | f(z) dz means that the definite integral is calculated along the curve C between 
A 

the points A and B. The notation / f(z) dz is also often used, and has the same meaning. 


AB 
π!- 1 (z — zo)" dz (n € Z). Let the curve C be a circle around the point zo with radius ro: 
(C) 


x = £o + rocost, y = yo + rosint with 0 < t < 2r. For every point z of the curve C: z = x +i y = 
Zo + ro(cost -- isint), dz — ro(— sint + icost) dt. After substituting these values and transforming 


2n 
according to the de Moivre formula follows: I = r" / (cos nt + isin nt)(— sin t + icost) dt 
0 


0 for nz —1, 


27 
= uf [i cos(n + 1)t — sin(n + 1)t] dt = { aba]. 


4. Independence of the Path of Integration 

Suppose a function f(z) of a complex variable is defined in a simply connected domain. The integral 
(14.33c) of the function can be independent of the path of integration, which connects the fixed points 
A(z) and B(zg). A sufficient and necessary condition is that the function f(z) is analytic in this 
domain, i.e., u and v satisfy the Cauchy-Riemann differential equations (14.4, p. 732). Then also the 
equality (14.35) is valid. If a domain is bounded by a simple closed curve, then the domain is simply 
connected. 

5. Complex Integral along a Closed Curve 

Suppose f(z) is analytic in a simply connected domain G. Integrating the function f(z) along a closed 
curve C which is the boundary of this domain, the value of the integral according to the Cauchy integral 
theorem is equal to zero (see 14.2.2, p. 747): 


(ΟΦ d: - o. (14.39) 


If f(z) has singular points in this domain, then the integral is calculated by using the residue theorem 
(see 14.3.5.5, p. 754), or by the formula (14.38a). 


; 1 : . : t 
E The function f(z) = ——, with a singular point at z = a has an integral value for the closed curve 
z—a 


C directed counterclockwise around a (Fig.14.34) (04 — 2ni Res/ (z)|, 24 — 27i. 
z2—-a 

14.2.2 Cauchy Integral Theorem 

14.2.2.1 Cauchy Integral Theorem for Simply Connected Domains 


If a function f(z) is analytic in a simply connected domain G, then two equivalent statements hold: 
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a) The integral is equal to zero along any closed curve C inside of G: 
(0) f f) 2 = 0. (14.40) 


B 
b) The value of the integral / f(z) dz is independent of the curve connecting the points A and B and 
JA 


running inside of G, i.c., it depends only on A and B. 
This is the Cauchy integral theorem. 


14.2.2.2 Cauchy Integral Theorem for Multiply Connected Domains 

If C, C1, C2, .. ., C, are simple closed curves such that the curve C encloses all the C, (v = 1,2,...,n), 
none of the curves C, encloses or intersects another one, and furthermore the function f(z) is analytic 
in a domain G which contains the curves and the region between C and C,, i.e., at least the region 
shaded in Fig.14.35, then 


joe j f(z)dz+ j f()dz-... j f(z) dz, (14.41) 
(C) (C1) (C2) (Cn) 


if the curves C, Ci, ..., Cy are oriented in the same direction, e.g., counterclockwise. 
This theorem is useful for the calculation of an integral along a closed curve C,, if it also encloses singular 
points of the function f(z) (see 14.3.5.5, p. 754). 





Figure 14.34 Figure 14.35 Figure 14.36 


dz, where C is a curve enclosing the origin and the point z = —1 


) 


(Fig. 14.36). Applying the Cauchy integral theorem, the integral along C is equal to the sum of the 
integrals along C4 and C5, where Cj is a circle around the origin with radius r4 — 1/2 and C3 is a circle 


m 
Hi Calculate the integral f DEI. 
i 2(24-1 





around the point z = —1 with radius ro = 1/2. The integrand can be decomposed into partial fractions. 
z—1 2d 2dz d dz 
Then follows: f dz — j E | f f a f ~ = 0 + απὶ-- 9πὶ -- 0 -- 9πὶ. 
J z(z+1) z+1 z+1 J z J z 
(C) (C1) (C2) (C1) (C2) 


(Compare the integrals with the example in 14.2.1.2, 3., p. 747.) 


14.2.3 Cauchy Integral Formulas 
14.2.3.1 Analytic Function on the Interior of a Domain 


If f(z) is analytic on a simple closed curve C and on the simply connected domain inside it, then the 
following representation is valid for every interior point z of this domain (Fig. 14.37): 


(9) 1. 4 10) 


^ 2mi -z 
(0) e 





ας (Cauchy integral formula), (14.42) 
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where Ç traces the curve C counterclockwise. With this formula, the values of an analytic function in 
the interior of a domain are expressed by the values of the function on the boundary of this domain. 
The existence and the integral representation of the n-th derivative of the function analytic on the 
domain G follows from (14.42): 


(n)(,) — nl f FO 7 

f(z) = oni f (Cz ας. (14.43) 
(€) 

Consequently, if a complex function is differentiable, i.e., it is analytic, then it is differentiable arbi- 

trarily many times. In contrast to this, in the real case differentiability does not include repeated 

differentiability. 

The equations (14.42) and (14.43) are called the Cauchy integral formulas. 

14.2.3.2 Analytic Function on the Exterior of a Domain 

If a function f(z) is analytic on the entire part of the plane outside of a closed curve of integration 

C, then the values and the derivatives of the function f(z) at a point z of this domain can be given 

with the same Cauchy formulas (14.42), (14.43), but the orientation of the curve C is now clockwise 

(Fig. 14.38). 

Also certain real integrals can be calculated with the help of the Cauchy integral formulas (see 14.4, 

p. 754). 

y 





la X 
Figure 14.37 Figure 14.38 


14.3 Power Series Expansion of Analytic Functions 


14.3.1 Convergence of Series with Complex Terms 


14.3.1.1 Convergence of a Number Sequence with Complex Terms 

An infinite sequence of complex numbers 24, z2,...,2n,.-- has a limit z (z = lim z,) if for every 
noo 

arbitrarily given positive € there exists an mo such that the inequality |z — z,| < £ holds for every 

n > no, i.e., from a certain no the points representing the numbers 2p, 2n41,... are inside of a circle 

with radius € and center at z. 


W If the expression { (/a) means the root with the smallest non-negative argument, then the limit 
jim, { /a} = 1 is valid for arbitrarily complex a 4 0 (Fig. 14.39). 


14.3.1.2 Convergence of an Infinite Series with Complex Terms 
A series a1 d- a3 d- :: - - Q4, 4-: -: with complex terms a; converges to the number s, the sum of the series, 
if the sequence of the partial sums s, with 

Sj — 044-03 +an (n=1,2,...) (14.44) 
converges. Connecting the points corresponding to the numbers s, = a, + d2 +---+ dy in the z plane 
by a broken line, then convergence means that the end of the broken line approaches the point s. 

- 5 8 sS - 5. p . 

., πμ τω. (16. 14.40). 
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Figure 14.39 Figure 14.40 


A series is called absolutely convergent (sce W.B), if the series of absolute values of the terms |ai| + 
|42| + |as| +--+ is also convergent. The series is called conditionally convergent (see Ml A) if the series 
is convergent but not absolutely convergent. If the terms of a series are functions f;(z), like 

Fale) + falz) Ht fal) t, (14.45) 


then its sum is a function defined for the values z for which the series of the function values is convergent. 





14.3.1.3 Power Series with Complex Terms 


1. Convergence 
A power series with complex coefficients has the form 
P(z— zo) = ao + a(z — zo) + a2(z — 20)? - + an(z — zo)” ++, (14.46a) 


where Zp is a fixed point in the complex plane and the coefficients a, are complex constants (which can 
also have real values). For z9 = 0 the power series has the form 





Ρ(2) -- αρ + az + ag2* +--+ + an2"4+-°°. (14.46b) 
If the power series P(z — zo) is convergent for a value z4, then it is absolutely and uniformly convergent 
for every z in the interior of the circle with radius r — |z1 — zo| and center at zo. 
2. Circle of Convergence 
The limit between the domain of convergence and the domain of divergence of a complex power series 
is a uniquely defined circle. One determines its radius just as in the real case, if the limits 


1 y 
r= —— or r — lim 
lim {lanl icd 
πι ορ 





(14.47) 








Qn+1 


exist. If the series is divergent everywhere except at z = zo, then r = 0; if it is convergent everywhere, 
then r = oo. The behavior of the power series on the boundary circle of the domain of convergence 
should be investigated point by point. 


oo z^ 
W The power series P(z) — $X — with radius of convergence r — 1 is divergent for z — 1 (harmonic 
n 


n=1 
series) and convergent for z = —1 (according to the Leibniz criteria for alternating series (see 7.2.3.3,1., 
p. 463)). This power series is convergent for all further points of the unit circle |z| = 1 except the point 
s=; 





3. Derivative of Power Series in the Circle of Convergence 

Every power series represents an analytic function f(z) inside of the circle of convergence. One gets the 
derivative by a term-by-term differentiation. The derivative series has the same radius of convergence 
as the original one. 
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4. Integral of Power Series in the Circle of Convergence 
z 
The power series expansion of the integral / f (C) dG can be obtained by a term-by-term integration 
20 


of the power series of f(z). The radius of convergence remains the same. 


14.3.2 Taylor Series 


Every function f(z) analytic in a domain G can be expanded uniquely into a power series of the form 
οο 
f(z) 2 anlz - zo)" (Taylor series), (14.48a) 
n=0 
for any zo in G, where the circle of convergence is the greatest circle around zo which belongs entirely 
to the domain G (Fig. 14.41). The coefficients a, are complex numbers in general; for them one gets: 
ΟΡ 
dn = P9) (14.48b) 
n! 
The Taylor series can be written in the form 
F(z F" (zo f (zo 
£02) = so) + Ea a) ο yr pO 
1! 2! n! 
Every power series is the Taylor expansion of its sum function in the interior of its circle of convergence. 


(z—z9) 4. (14.48c) 


W Examples of Taylor expansions are the series representations of the functions e^, sin z, cos z, sinh z, 
and cosh z in 14.5.2, p. 758. 





Figure 14.41 Figure 14.42 Figure 14.43 


14.3.3 Principle of Analytic Continuation 


Let consider the case when the circles of convergence Ko around zo and Kı around 2; of the two power 
series 


fo(z) = 5 an(z — zo)” and fi(z) = Y b.(z — 21)” (14.494) 
n=0 n=0 


have a certain common domain (Fig. 14.42) and in this domain they are equal: 


folz) = file). (14.49b) 
Then both power series are the Taylor expansions of the same analytic function f(z), belonging to the 
points zo and z;. The function fi(z) is called the analytic continuation into K of the function fo(z) 
defined only in Ko. 
oo 
Wl The geometric series fo(z) — Y; z" with the circle of convergence Ko (ro — 1) around zo — 0 and 
n=0 





1 Sfz-i\” 
filz) = i-i Σ G =) with the circle of convergence Kı (rı = v2) around z1 = i have the same 
—i —i 


n=0 
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analytic function f(z) = 1/(1 — z) as their sum in their own circle of convergence, consequently also 
on the common part of them (doubly shaded region in Fig. 14.42) for z Z 1. So, fi(z) is the analytic 
continuation of fo(z) from Ko into Ky (and conversely). 


14.3.4 Laurent Expansion 


Every function f(z), which is analytic in the interior of a circular ring between two concentric circles 
with center zo and radii r; and rs, can be expanded into a generalized power series, into the so-called 
Laurent series: 








. oo Ln Lu. Q—k N QA—k4+1 es 
f(z) = P 20) + (= z)" ; (z — gg) + 
+ σσ + ag + ai(z — 2%) + a2(z — 20)” +e + alz zo)" Te (14.502) 
-- 20 


The coefficients a, are usually complex and they are uniquely defined by the formula 














1 FO . 

On = n j (εκατ), (14.500) 
(€) 

where C denotes an arbitrary closed curve which is in the circular ring r4 « |z| « ro, and the circle 

with radius r is inside of it, and its orientation is counterclockwise (Fig. 14.43). If the domain G of 

the function f(z) is larger than the circular ring, then the domain of convergence of the Laurent series 

is the largest circular ring with center 2 lying entirely in G. 


W The Laurent series expansion of the function f(z) — is determined around 2 = 0 in 


oe S 
(z — 1)(z — 2) 


< 2 where f(z) is analytic. First the function f(z) is decomposed into partial 








the circular ring 1 < 


1 1 
fractions: f(z) — pan ee Since |1/z| < land |z/2| < 1 holds in the considered domain, the two 
z-2 z= 


terms of this decomposition can be written as the sums of geometric series being absolutely convergent 
in the entire circular ring 1 < |z| < 2. One gets: 


2 





1 1 1 eed doeet 
τση I ΠΕ 27 2t 
z(1—- 241. n=1 n=0 

2 9 —— GC 
|z| 21 |2| «2 

x e for n = —1,—2,..., 

s: απζ with a4,-—4 -1 I 
AG gayi for n —0,1,2,.... 


14.3.5 Isolated Singular Points and the Residue Theorem 
14.3.5.1 Isolated Singular Points 


If a function f(z) is analytic in the neighborhood of a point zo but not at the point zo itself, then zo is 
called an isolated singular point of the function f(z). If f(z) can be expanded into a Laurent series in 
the neighborhood of zo 
oo 
fi) = YS an(z- 2)", (14.51) 
n=—oo 

then the isolated singular point can be classified by the behavior of the Laurent series: 
1. If the Laurent series does not contain any term with a negative power of (z — 2), i.e., an = 0 for 
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n < 0 holds, then the Laurent series is a Taylor series with coefficients given by the Cauchy integral 
formula 


1 e 
an = z f (6-20) f()dc- 
Ti. 
(K) 
In this case, the function f(z) itself is either analytic at the point zo and f(z9) — ao or zo is a removable 
singularity. 


f(a) 


n! 


(n — 0,1,2,... .). (14.52) 


2. If the Laurent series contains a finite number of terms with negative powers of (z — 29), i.e., aq, 4 0, 
an = O forn < m « 0, then zo is called a pole, a pole of order m, or a pole of multiplicity m. Multiplying 
by (z — zo), and not by any lower power, then f(z) is transformed into a function which is analytic at 
zo and in its neighborhood. 


1 1 
WB /(:)-— J ( + =) has a pole of order one at z = 0. 


3. If the Laurent series contains an infinite number of terms with negative powers of (z — 29), then zo 
is an essential singularity of the function f(z). 
Approaching a pole, | f(z)| tends to oo. Approaching an essential singularity, f(z) gets arbitrarily close 
to any complex number c. 

oo 


E The function f(z) = e", whose Laurent series is f(z) = X> 
z=0. E 
14.3.5.2 Meromorphic Functions 


If an otherwise holomorphic function has only isolated singularities which are poles, then it is called 
meromorphic. A meromorphic function can always be represented as the quotient of two analytic func- 
tions. 


ci has an essential singularity at 
n'z 


W Examples of functions meromorphic on the whole plane are the rational functions which have a finite 


z COS Z 
and cotz — 








. sin 
number of poles, and also transcendental functions such as tan z — 
cos sin z 


14.3.5.3 Elliptic Functions 


Elliptic functions are double periodic functions whose singularities are poles, i.e., they are meromorphic 
functions with two independent periods (see 14.6, p. 762). If the two periods are w; and we, which are 
in a non-real relation, then 














f(z + mw + nw) = f(z) (m,n = 0,41, +2,...;Im (=) # 0). (14.53) 
We 
The range of f(z) is already attained in a primitive period parallelogram with the points 0, w1, w1 + 
W2, W2. 


14.3.5.4 Residue 


With zo as an isolated singularity of the function f(z) the coefficient a. ; of the power (z — zo)- | in the 
Laurent expansion of f(z) valid in the neighborhood of zo is called the residue of the function f(z) at 
the point zo. According to (14.50b) 


1 
a1 = Res f(z)hua — z— f FO dt (14.54a) 
(K) 
holds. The residue belonging to a pole of order m can be calculated by the formula 


απ] 
aci c Res (boa, = Be Ga ideni 


(FG) — 20)". (14.54b) 
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If the function f(z) can be represented as a quotient f(z) = y(z)/(z), where the functions y(z) and 
p(z) are analytic at the point z = zo and z is a simple root of the function v(z), i.e., (zo) = 0 and 
¥' (zo) # 0 holds, then the point z = zp is a pole of order one of the function f(z). It follows from 
(14.54b) that 








5) (zo) 

Res = ; 14.54c 
[5]... 50 
If z is a root of multiplicity m of the function ψ(α), i.e., (zo) — w'(zo) — -« — v-9»(z) — 


0, Y™ (z0) 4 0 holds, then the point z = zo is a pole of order m of f(z). 
14.3.5.5 Residue Theorem 


With the help of residues one can calculate the integral of a function along a closed curve enclosing 
isolated singular points (Fig. 14.44). 

If the function f(z) is single valued and analytic in a simply connected domain G except at a finite 
number of points zo, 21, 22, ... , Zn, and the domain is bounded by the closed curve C, then the value of 
the integral of the function along this closed curve in a counterclockwise direction is the product of 27i 
and the sum of the residues in all these singular points: 


ϕ Fe) d = mi Y Res f(e) [oca (14.55) 
(K) z 





E The function f(z) = e7/(2? + 1) has poles of order one at 21,2 = +i. The corresponding residues 
have the sum sin 1. If K is a circle around the origin with radius r > 1, then 


e? fe g^ dé ei ao 
j agi te = mi (SE | 2) -m(s x) — 2misin 1. 
(K) 
y 


Udi 


Figure 14.44 Figure 14.45 
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14.4 Evaluation of Real Integrals by Complex Integrals 
14.4.1 Application of Cauchy Integral Formulas 


The value of certain real integrals can be calculated with the help of the Cauchy integral formula. 


W The function f(z) — e^, which is analytic in the whole z plane, can be represented with the Cauchy 
integral formula (14.42), where the path of integration C' is a circle with center z and radius r. The 
equation of the circle is ¢ = z + re’?. From (14.43) follows 


! JC ! —2& qure) I p2 
ος -- Τε f Ξ dc E AE. irci? dy LOW “pet cos ytir sin p—ing dy so that 
2πὶ . ; (¢ — z)n QriJy=o rrttelv(n+) 2ar” Jo : 





27r" 


n! 


jT COS (p 


e [sin(r sin e — ng)] dg. 





(σ 
2π λος 2π 2π 
= / οἳ C95 eti(rsin p—ny) dy = 1 e" 995? [cos(r sing =! ny)| dy + if 
0 0 0 
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on Qar” 
Comparing real and imaginary part gives / €" 8? cos(r sin y — ny) dp = —_— ; Since the imaginary 
0 n! 


part is equal to zero. 


14.4.2 Application ofthe Residue Theorem 


Several definite integrals of real functions with one variable can be calculated with the help of the 
residue theorem. If f(z) is a function which is analytic in the whole upper half of the complex plane 
including the real axis except the singular points 2, 22, ... , Zn, above the real axis (Fig. 14.45), and if 
one of the roots of the equation f(1/z) = 0 has multiplicity m > 2 (see 1.6.3.1, 1., p. 43), then 


+00 2 
/ f(x) dz — 2ri Y, Res f ()],.—... (14.56) 
Mm ici 
Bl Calculation of the int if de επ ti ΠΕ ! E. ui 
alculation of the integra. ——: The equatio =) = ——— = ———— = Ohas 
u. 10n 1€ 1ntegr n ü 3E ση at quation i ( 1 y (a? m T)? li 
1+5 
32 
a root of order six at x — 0. The function w = UF AF has a single singular point z = i in the upper 


half-plane, which is a pole of order 3, since the equation (1 4- 22)? = 0 has two triple roots at i and —i. 
The residue is according to (14.54b): 











1 1 d? [(z—ip do (z—iW d? ο. AB 
ραπ ο, Ed wats) = gat) = e+ it 
1 j +00 
follows that Res ασ] — 6(z-i) L2 ap i and with (14.56): i. f(x) dx = 
3 3 
2πὶ (-ži) =" For further applications of residue theory see, e.g., [14.12]. 


14.4.3 Application ofthe Jordan Lemma 
14.4.3.1 Jordan Lemma 


In many cases, real improper integrals with an infinite interval of integration can be calculated by 
complex integrals along a closed curve. To avoid the always recur- 
rent estimations, the Jordan lemma is used about improper inte- 
grals of the form 


/ F(A i% dz, (14.57a) 
(Cr) 
where Cg is the half-circle arc with center at the origin and with the 
radius R in the upper half of the z plane (Fig. 14.46). The Jordan 
Figure 14.46 lemma distinguishes the following cases: 





a) a > 0: If f(z) tends to zero uniformly in the upper half-plane and also on the real axis for |z| — oo 
and if a > 0 holds, then for R — oo is valid: 


1 f(z)e** dz ^ 0. (14.57b) 
(Cr) 
b) @ = 0: If the expression z f(z) tends to zero uniformly for |z| — oo, then the above statement 
(14.57) is also valid in the case a = 0. 
c) a < 0: Ifthe half-circle Cz is now below the real axis, then the statement corresponding to (14.57b) 
is also valid for a < 0. 
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d) The statement to (14.57b) is also valid if only an arc segment is considered instead of the complete 
half-circle. 

e) When C5 is a half-circle or an arc segment in the left half-plane with a 7 0, or in the right one with 
a <0, then the statement corresponding to (14.57b) is valid for the integral in the form 


I Fee” dz. (14.570) 
(CR) 
14.4.3.2 Examples of the Jordan Lemma 


1. Evaluation of the Integral / M dz (00,220). (14.58a) 


The RE complex integral is assigned to the above real E a 


R d R 
ο sin esi . Tmar LCOS ar 
JE Pee 7 da zi E rug dr fx uud (14.58b) 


even en dnm Yoga 
= 0 (odd integrand) 





; : : ze . 
The very last of these integrals is part of the complex integral f στα 4z. The curve C contains the 
J 2+a 


Cp half-circle defined above and the part of the real axis between the values Re and R (R > |al). 
The complex integrand has the only singular point in the upper half-plane z = ai. From the residue 











αεἰα zelo? zelaz m 
theorem follows: J = f 3 dz = Ēri lim 5(¢— ai)| = 27i lim, - — mie ^", hence 
J z2 +a? zai | 22 + a? zai Z + ai 
(C) 
EL R raer 
I / —— —5 dz 4 i ————5 dx — sie ?^, From lim / and from the Jordan lemma follows 
22 a? xv? + a2 Roo 
(Cr) 
oo 
zsinax «s 
[oS ae =e% (a>0, a2 0). (14.586) 
0 


Several further integrals can be evaluated in a similar way (see Table 21.8, p. 1098). 

2. Sine Integral (see also 8.2.5,1., (8.95), p. 513) 

The integral I Sm? dris called the sine integral or the integral sine 
T 


(see also 8.2.5, 1., p. 513). Analogously to the previous example, 
pig 
here the complex integral / — f f deis investigated with the curve 
Cg ^ 
Cpr according to Fig. 14.47. The integrand of the complex integral 
Figure 14.47 has a pole of first order at z = 0, so 





2π 


piè 
dx + pee dy + 1 f dz — 2mi This 
z 
Ed 


εἶδ 


I — 2nilim |Ξ 
230] z 


sin x 





R 
| — 2mi, hence J = af 


limit is evaluated as R —> o0, r — 0, where the second integral tends to 1 uniformly for r — 0 with 
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respect to q, i.c., the limiting process r — 0 can be done behind the integral sign. With the Jordan 
lemma follows: 
co , oo . 
sina sina 
2ἱ / dr --7i- 2751, hence I 
Joc f 
0 0 








dx =~. (14.59) 
x 2 

3. Step Function 

Discontinuous real functions can be represented as complex integrals. The so-called step function (see 
also 15.2.1.3, p. 774) is an example: 


1 eitz 1 for t>0, 
P()-— / © dz = 21/2 for t=0, (14.60) 
2mb 2 0 for t<0. 


The symbol —~- denotes a path of integration along the real axis (| R | oo) going round the origin 
(Fig. 14.47). 

If t denotes time, then the function (t) — cF(t — to) represents a quantity which jumps at time t = to 
from 0 through the value c/2 to the value c. It is called a step function or also a Heaviside function. It 
is used in the electrotechnics to describe suddenly occurring voltage or current jumps. 

4. Rectangular Pulse 

A further example of the application of complex integrals and the Jordan lemma is the representation 
of the rectangular pulse (see also 15.2.1.3, p. 774): 


0 for t<aandt>b 


1 JA(b—t)z 1 ci(a—t)z , 
ψίθ-π | S e-z | al) &racteb (14.61) 


g 2πὶ 8 1/2 for t 2 a and t — b. 








——— ——— 


5. Fresnel Integrals 


X To derive the Fresnel integral 
ΠΙ VA" | 1 
II / sin(z^) dr — I cos(z^) dz — 3 π/2 (14.62) 
0 0 
the integral J = / e 7 dz has to be investigated on the closed path 
0 I Rx x 
of integration shown in Fig. 14.48. According to the Cauchy inte- 
Figure 14.48 gral 


R 
theorem holds: J = Jy + [yy + yy = 0 with Ty = / e” dr, dp in fe 7 F'(cos2p+isin 2p) Hedy, 
im 0 . 1 R R i 
1ΠΙ Ξ- 63 Í e" dr = 5V2(1 +i) v sinr? dr -f cos r? «| Estimation of Ip: Since |i] = |e"| — 
R 0 0 
7/A a 7/2 
1 (7 real) holds, one gets: |Π1| € Rf eB? 0829 qu 2 e hee qo -- zl e Rose quy c 
0 4 Ja 


2 
g P ane 7/2 ano, — R? cos ak _R2 cosa, eg FE cosa ( ) 

dp zr ?dp +— - Perform- 
2 i. 2 sina. ἘΠῚ 2“ 2Rsina 0<a< 2 erform: 


1 
ing the limiting process jm I yields the values of the integrals J] and [yy]: jim = ον, jm. Πιτ 


0. The given formulas (14.62) can be obtained by separating the real and imaginary parts. 
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14.5 Algebraic and Elementary Transcendental Functions 


14.5.1 Algebraic Functions 


1. Definition 
A function which is the result of finitely many algebraic operations performed with z and maybe also 
with finitely many constants, is called an algebraic function. In general, a complex algebraic function 
w(z) can be πε in an implicit way as a polynomial, just as its real analogue 

a32" w™ + agw +- H apr rw = 0. (14.63) 
It happens s w cannot be expressed explicitly. 
2. Examples of Algebraic Functions 





ἃ ] 1 
Linear function: w = az +b. (14.64) Inverse function: w= -. (14.65) 
z 
Quadratic function: w = z?. (14.66) Square root function: w — vz? — a?. (14.67) 
: : ; z+i 
Fractional linear function: w = -. (14.68) 
z—i 


14.5.2 Elementary Transcendental Functions 

The complex transcendental functions have definitions corresponding to the transcendental real func- 
tions, just as in the case of the algebraic functions. For a detailed discussion of them see, e.g., [21.1] or 
[21.12]. 


1. Natural Exponential Function 
2 4B 


2 
alt G+ ot ate ες (14.69 


The series is absolutely convergent in the whole z plane. 
a) Pure imaginary exponent iy: This is valid according to the Euler relation (see 1.5.2.4, p. 36): 


e = cosy +isiny with e™ = —1. (14.70 
b) General case z = x + iy: 














e? — gt — et el¥ = e* (cosy + isiny), (14.71a 
Re(e*) =e*cosy, Im(e*)=e*siny, |eé|-—e*, arg(e*) =y. (14.71b 
The function e? is periodic, its period is 277i: e? — ce^?*r! (E —0, 31.4 2,...). (14.71c 
In particular: e° = e?*™! = 1, eĉktDri — 1, (14.71d 


c) Exponential form of a complex number (see 1.5.2.4, p. 36): 
a -- ib — pe. (14.72 
d) Euler relation for complex numbers: 


c* = cos z + isin z, (14.73a) e™ = cos z — isin z. (14.73b 


2. Natural Logarithm 


w=Lnz, if z=e”. (14.74a 
Since z = ρε», one can write: 
Lnz=Inp+i(p+2k7) and (14.74b 





Re(Lnz)=Inp, Im(Lnz) = y+ 2k (k =0,+1,+2,...). (14.74c 
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Since Ln z is a multiple-valued function (see 2.8.2, p. 86), we usually give only the principal value of the 
logarithm In z: 


Inz=np+ig (=r < p < +r). (14.74d) 
The function Ln z is defined for every complex number, except for z = 0. 
3. General Exponential Function 

g ceme, (14.75a) 
a? (a Æ 0) is a multiple-valued function (see 2.8.2, p. 86) with principal value 

a* — e*lne, (14.75b) 
4. Trigonometric Functions and Hyperbolic Functions 








ae 3 gb liu 

sing = — 5; to a ta (14.76a) 
e + e`” z2 24 

cos z = 3 ο aps (14.76b) 

: e" —e* gv ae 

sinhz= — z; =+ tar (14.77a) 
e” Le 22 24 

eshz———;——ltgtqgt' - (14.77b) 


All four series are convergent on the entire z plane and they are all periodic. The period of the functions 
(14.76a,b) is 27, the period of the functions (14.77a,b) is 27 1. 
The relations between these functions for any real or complex z are: 
siniz = isinh z, (14.78a) cosiz = cosh z, (14.78b) 
sinhiz = isin z, (14.79a) coshiz — cos z. (14.79b) 


The transformation formulas of the real trigonometric and hyperbolic functions (see 2.7.2, p. 81, and 
2.9.3, p. 91) are also valid for the complex functions. The values of the functions sin z, cos z, sinh z, and 
cosh z for the argument z = z +i y can be calculated with the help of the formulas sin(a +b), cos(a +b), 
sinh(a + b), and cosh(a + b) or by using the Euler relation (see 1.5.2.4, p. 36). 














W cos(x +iy) = cosx cosiy -— sin xsiniy = cos x cosh y — isin x sinh y. (14.80) 
Therefore: 
Re (cos z) = cos Re (z) cosh Im (2), (14.81a) 
Im (cos z) = — sin Re (z) sinh Im (z). (14.81b) 
The functions tan z, cot z, tanh z, and coth z are defined by the following formulas: 
tanz= 2A , cotz = = e j (14.82a) tanhz — m , cothz= coh . . (14.82b) 
cos z sin z cosh z sinh z 


5. Inverse Trigonometric Functions and Inverse Hyperbolic Functions 


These functions are many-valued functions, and one can express them with the help of the logarithm 
function: 


Arcsin z = —i Ln (iz + v1 — z2),  (14.83a) Arsinh z = Ln (z + V22 + 1), (14.83b) 
Arccos z = —i Ln (z + Vz2?— 1),  (14.84a) Arcosh z = Ln (z + V22? — 1), (14.84b) 


i 1. 1-2 
: —, (14.85a) Artanh z = gla 1 + - 3 








1 
Arctanz — z (14.85b) 
i 
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1 i 1 1 z+1 
Arccot z = —— Ln Eo (14.86a) Arcoth z = -Ln + : (14.86b) 
2 12-- 1 2 2-1 


The principal values of the inverse trigonometric and the inverse hyperbolic functions can be expressed 
by the same formulas using the principal value of the logarithm In z: 




















arcsin z = —i In(iz + V1 — 22), (14.87a) arsinh z = In(z + v 22 + 1), (14.87b) 
arccos z = —i In(z + Vz? — 1), (14.88a) arcosh z = In(z + Vz? — 1), (14.88b) 
1, l-ciz d. E 
arctàng  — In = ] (14.894) artanh z = —In f : (14.89b) 
2. 1-12 2. β 
1 iz+1 1 +1 
arccotz — —— ln: + À (14.90a) arcothz = ln 2 s (14.90b) 
2i 12-1 2 z-—1 


6. Realand Imaginary Part of the Trigonometric and Hyperbolic Functions 
(See Table 14.1) 


Table 14.1 Real and imaginary parts of the trigonometric and hyperbolic functions 


























Function 
Real part Re (w) Imaginary part Im (w) 
sin x cosh y + cos x sinh y 
cos x cosh y + sin x sinh y 
sin 2x af sinh 2y 

cos 2x + cosh 2y ~ cos 2a + cosh 2y 

sinh(x X iy) sinh x cos y + cosh x sin y 

cosh(a + iy) cosh x cos y +sinh x sin y 

. sinh 2x sin2y 

tanh(x + iy) ——————— +—_____ 
cosh 2a + cos 2y cosh 2x + cos 2y 








7. Absolute Values and Arguments of the Trigonometric and Hyperbolic Func- 
tions (See Table 14.2) 


Table 14.2 Absolute values and arguments of the trigonometric and hyperbolic functions 


Function 
w = f(x + iy) 
sin(x + iy) ysin? x + sinh? y +arctan(cot x tanh y 
cos(x + iy) \/ cos? a + sinh? y + arctan(tan x tanh y 
sinh(x + iy) V sinb? x + sin? y + arctan(coth x tan y 
i ( 


cosh(x + iy) 











Absolute value |w| Argument arg w 























oy RS ay A 








+ arctan(tanh x tan y 








14.5.3 Description of Curves in Complex Form 
A complex function of one real variable t can be represented in parameter form: 

z= x(t) +iy(t) = f(t). (14.91) 
As t changes, the points z draw a curve z(t). Now, the equations and the corresponding graphical 
representations of the line, circle, hyperbola, ellipse, and logarithmic spiral are represented. 
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1. Straight Line 
a) Line through a point (21, 9) (ois the angle with the z-axis, Fig. 14.492): 
2 itte, 
b) Line through two points 21, z2 (Fig. 14.49b): 
z = z1 + t(z2 — 21). 
2. Circle 
a) Circle with radius r, center at the point z = 0 (Fig. 14.50a): 


x 





z—re* (|z| ^r). 
b) Circle with radius r, center at the point zo (Fig. 14.50b): 





z— m4 ret (|z — zo| ^ r). 
y 
0 
b) 
Figure 14.49 Figure 14.50 
3. Ellipse 
g^ ay? 
a) Ellipse, Normal Form d T p — ] (Fig. 14.51a): 





a+b a—b 


z=acost + ibsint (14.94a) or z — ce + de~* (14.94b) with c — x d= 5 


i.e., c and d are arbitrary real numbers. 


(14.922) 


(14.92b) 


(14.934) 


(14.935) 


(14.940) 


b) Ellipse, General Form (Fig. 14.51b): The center is at z;, the axes are rotated by an angle. 


z= za + ce” + de”. 


(14.95) 


Here c and d are arbitrary complex numbers, they determine the length of the axis of the ellipse and 


the angle of rotation. 


y y 
πμ 
0 x 
a) b) 
Figure 14.51 Figure 14.52 


πλ γῇ 
4. Hyperbola, Normal Form a — a = 1 (Fig. 14.52): 
a 
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z =acosht + ibsinht (14.96a) 
or 
z = ce +e”, (14.97) y 
where c and € are conjugate complex numbers: 
c- £22, ασ. (14.98) t 
5. Logarithmic Spiral (Fig. 14.53): 
z=ae™, (14.99) 
where a and b are arbitrary complex numbers. Figure 14.53 


14.6 Elliptic Functions 
14.6.1 Relation to Elliptic Integrals 


Integrals in the form (8.22) with integrands R(x, \/P(x)) can not be integrated in closed form if P(x) is 
a polynomial of degree three or four, except in some special cases, but they are calculated numerically as 
elliptic integrals (see 8.1.4.3, p. 490). The inverse functions of elliptic integrals are the elliptic functions. 
They are similar to the trigonometric functions and they can be considered as their generalization. As 
an illustration the special case 
u 

/α- θά τα (|u| € 1). (14.100) 

0 
is considered. 


a) There is a relation between the trigonometric function u = sin x and the principal value of its inverse 
function 
π 
2 
b) the integral (14.100) is equal to arcsin u. The sine function can be considered as the inverse function 
of the integral (14.100). Analogies are valid for the elliptic integrals. 


u = sing & x = arcsinu for <r>, —l<u<l, (14.101) 


NIA 


E The period of a mathematical pendulum, with mass m, hanging on a non-elastic weightless thread 
of length / (Fig. 14.54), can be calculated by a second-order non-linear differential equation. This 
equation follows from the balance of the forces acting on the mass of the pendulum: 





d g. a ἃ d | /ao\? gd 
dg * 198? — 0 with 0(0) — do, (0) — 0 or di (2) = 27 qos 0). (14.1022) 


The relation between the length l and the amplitude s from the normal position is s = lù, so $ — IJ and 
§ = [9 hold. The force acting on the mass is F = mg, where g is the acceleration due to gravity (see 
Table 21.2, p. 1053), and it is decomposed into a normal component Fy and a tangential component 
Fr with respect to its path (Fig. 14.54). The normal component Fy — mg cos is balanced by the 
thread stress. Since it is perpendicular to the direction of motion, it has no effect to the equation of 
motion. The tangential component Fr yields the acceleration of the motion. Fr = mä = ml = 
—mgsinv. The tangential component always points in the direction of the normal position. 

By separation of variables follows: 


pa T LL 
° g J0 \/2(cos © — cos vo) 


(14.102b) 
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Here, to denotes the time for which the pendulum is in the deepest position for the first time, i.e., 
where (tg) — 0 holds. O denotes the integration variable. After some transformations and with the 


h 
substitutions sin — = ksiny, k = sin — follows 


tHe Er M afr (k, p) (14.1020) 
g 4/1 — k? sin? 


Here F(k, ~) is an elliptic integral of the first kind (see (8.25a), p. 490). The angle of deflection 0 — V(t) 
is a periodic function of period 27' with 


l 1 
T= E F (x, 5) - [ie (141024) 
g 2 g 
where K represents a complete elliptic integral of the first kind (Table 21.9, p. 1103). T denotes the 
: ' . . "m .. αὐ 
period of the pendulum, i.e., the time between two consecutive extreme positions for which um 0. If 


the amplitude is small, i.e., sin V ~ v, then T = 27\/1/g holds. 





cnx 


Figure 14.54 Figure 14.55 Figure 14.56 


14.6.2 Jacobian Functions 


1. Definition m 
It follows for 0 < k < 1 from the representation (8.24a) and (8.25a) (see 8.1.4.3, p. 490) for the elliptic 
integral of the first kind F (k, p) that 


dF 
dp 
i.e., F(k, q) is strictly monotone with respect to v, so the inverse function 


= (1—k? sin? y)~? > 0, (14.103) 


1 
 — am(k, u) = y(u) (14.1042) of u dum dy μ(φ) (14.1045) 


4/1 — Kk? sin? 


exists. It is called the amplitude function. The so-called Jacobian functions are defined as: 


snu = siny =sinam(k,u) (amplitude sine), (14.105a) 
cnu = cosy = cosam(k,u) (amplitude cosine), (14.105b) 
dnu = v1 — K?sn?u (amplitude delta). (14.105c) 


2. Meromorphic and Double Periodic Functions 
The Jacobian functions can be continued analytically in the z plane. The functions snz, cnz, and dnz 
are then meromorphic functions (see 14.3.5.2, p. 753), i.e., they have only poles as singularities. Besides, 
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they are double periodic: Each of these functions f(z) has exactly two periods w; and w2 with 


Fle +w) = f(z), flz+we) = f(z). (14.106) 
Here, wi and w» are two arbitrary complex numbers, whose ratio is not real. The general formula 
f(z + mw + nw) = f(z) (14.107) 


follows from (14.106), and here m and n are arbitrary integers. Meromorphic double periodic functions 
are called elliptic functions. The set 

{z0 + awi + agwo: 0 < a1, a2 < 1}, (14.108) 
with an arbitrary fixed zy € C, is called the period parallelogram of the elliptic function. If this function 
is bounded in the whole period parallelogram (Fig. 14.55), then it is a constant. 
W The Jacobian functions (14.105a) and (14.105b) are elliptic functions. The amplitude function 
(14.1042) is not an elliptic function. 
3. Properties of the Jacobian functions 
The properties of the Jacobian functions given in Table 14.3 can be got by the substitutions 


k?=1-k, K'=F (e; z) ,K=F Ὁ 7) (14.109) 








where m and n are arbitrary integers. 








Table 14.3 Periods, roots and poles of Jacobian functions 
Periods w1, w2 Roots Poles 
snz 4K, 2iK' 2mK + 2niK' 
cnz | 4K, 2(K +iK') (2m + 1)K + 2nik’ 2mK + (2n + 1)iK' 
dnz 2K,4iK' (2m +1)K 4- (2n - 11K" 

















The shape of snz, cnz, and dnz can be found in Fig. 14.56. The following relations are valid for the 
Jacobian functions except at the poles: 


1. snzcen?z 2 1, K?sn?z p dm?z — ], (14.110) 








(snu)(cnv)(dnv) 4- (snv)(cnu)(dnu) 








2. sn(u+v) 1 — issu (sndv) (14.111a) 
nu)(cnv) — (snu)(dnu)(snv)(dnv 
mie)" m T ed SEN 
dnu)(dnv) — K?(snu)(cnu)(snv)(cnv 
a c -— Forero] x e 
3. (snz) = (cnz)(dnz), (14.112a) (cnz) = —(snz)(dnz), (14.112b) 
(dnz)' 2 —K?(snz)(cnz). (14.112c) 


For further properties of the Jacobian functions and further elliptic functions see [14.8], [14.12]. 


14.6.3 Theta Functions 


The theta functions can be used to evaluate the Jacobian functions: 
co 


D(z, q) = 243 Y (—1)^q" 9 sin(2n -- 1)2, (14.1132) 


n=0 
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ϑο(α, α) -- 941 S5 a? cos(2n ^ 1)2, (14.113b) 


n=0 


J3(z,q) =1+2 5° q" cos2nz, (14.113c) 


n=1 


δισ, 2 1-- 2 Y (-1)"q" cos 2nz. (14.113d) 


n=1 





If |g| < 1 (q complex) holds, then the series (14.113a)-(14.113d) are convergent for every complex 
argument z. In the case of a constant q one uses the following brief notations: 


Ox(z) = Up (7z,q) (k= 1,2,3,4). (14.114) 


Then, the Jacobian functions have the representations: 











snz = 2K- Ya mu = n o (14.1152) cnz — ma (14.115b) 


pons f. 2s 
ϑ(0) οκ) 
BO a Sy" 
2K 
and K, K" are as in (14.109). 


dnz — 








Κ' v ? 
(14115c) with q= exp (^) , k= ( 0) (14.115d) 


14.6.4 Weierstrass Functions 


The functions 
p(z) = p(z,w, w2), (14.116a) C(z) — (z, w1, w2), (14.116b) 


a(z) = e(z,u1, 02), (14.116c) 


were introduced by Weierstrass, and here w and wg represent two arbitrary complex numbers whose 
quotient is not real. One substitutes 


Wmn = 2(mw + nwa), (14.117a) 
where m and n are arbitrary real numbers, and defines 
p(z, w1, w2) 2p Σ 1 [(z Z — Umn) 2 = wma] : (14.117b) 


The accent behind the sum sign denotes that the value pair m = n — 0 is omitted. The function 
p(z, w1, w2) has the following properties: 

1. It is an elliptic function with periods w, and wy. 

2. The series (14.117b) is convergent for every z Z wma. 

3. The function p(z, w1, w2) satisfies the differential equation 


9? — 4o? — gap —gs (14.118a) with gj — 60S ωρες. gs = in me. (14.118b) 


mn 


The quantities go and gg are called the invariants of p(z, w1, w2). 
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4. The function u = ¢(z,w1, w2) is the inverse function of the integral 


T it 
z= [-π----- (14.119) 
J VIB = gal — gs 
1[e(u) 2 gQ)]? 
5. olu +v) = -| 1] — — plv). 14.12 
plu +v) 1 | πο. plu) — plv) ( 0) 
The Weierstrass functions 
gle) = 27+ [e =n + + Winn? ; (14.121a) 








o(z) = zexp ([ [c(t) i] D “Η΄ (1 =) exp ( Z za) (14.121b) 


Wmn 


are not double periodic, so they are not elliptic functions. The following relations are valid: 


1. (z) = —@(2), C(z) = (na(z)), (14.122) 
2. ¢(-z) =—-C(z), e(-2) — —o(2), (14.123) 

G(z + 2ur) = (2) 1- 2ζ(ωι), Cl + Que) = C(z) + 2¢(w2), (14.124) 
4. C(u+v) =C(u) +6(0) + Cr (14.125) 


5. Every elliptic function is a rational function of the Weierstrass functions p(z) and ¢(z). 


15 Integral Transformations 


15.1 Notion of Integral Transformation 


15.1.1 General Definition of Integral Transformations 


AnIntegral transformation is a correspondence between two functions f(t) and F(p) in the form 
+00 
F()- f Kw.nroa. (15.19) 
—oo 


The function f(t) is called the original function, its domain is the original space. The function F(p) is 
called the transform, its domain is the image space. 
'The function K(p,t) is called the kernel of the transformation. In general, t is a real variable, and 
p = c + iw is a complex variable. 
The shorter notation can be used by introducing the symbol 7 for the integral transformation with 
kernel K (p, t): 

F(p) = T{ F} (15.1b) 


Then, (15.1b) is called 7 transformation. 


15.1.2 Special Integral Transformations 

Different kernels K (p, t) and different original spaces yield different integral transformations. The most 
widely known transformations are the Laplace transformation, the Laplace-Carson transformation, 
and the Fourier transformation. In this book an overview is given about the integral transformations 
of functions of one variable (sce also Table 15.1). More recently, some additional transformations 
have been introduced for use in pattern recognition and in characterizing signals, such as the Wavelet 
transformation, the Gabor transformation and the Walsh transformation (see 15.6, p. 800ff.). 


15.1.3 Inverse Transformations 


The inverse transformation of a transform into the original function has special importance in applica- 
tions. With the symbol 7 -! the inverse integral transformation of (15.1a) is 


f(t τρ}. (15.34) 
The operator T~! is called the inverse operator of T, so 
THT{ FOH = FO. (15.2b) 


The determination of the inverse transformation means the solution of the integral equation (15.1a), 
where the function F(p) is given and function f(t) is to be determined. If there is a solution, then it 
can be written in the form 

ft) - T^ (tF(). (15.2c) 
The explicit determination of inverse operators for different integral transformations, i.e., for different 
kernels K(p, t), belongs to the fundamental problems of the theory of integral transformations. The 
user can solve practical problems by using the given correspondences between transforms and original 
functions in the corresponding tables (Table 21.13, p. 1109, Table 21.14, p. 1114, and Table 21.15, 
p. 1128). 


15.1.4 Linearity of Integral Transformations 
If f (t) and f2(t) are transformable functions, then 

Tiki fit) + ke fol} = mT {A} + kaT (fa (0). (15.3) 
where ky and kz are arbitrary numbers. That is, an integral transformation represents a linear operation 
on the set T' of the 7 -transformable functions. 
© Springer-Verlag Berlin Heidelberg 2015 767 
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Table 15.1 Overview of integral transformations of functions of one variable 

Transformation | Kernel K (p, t) Symbol Remark 

Laplace 5 fort«0 7 fe E o . 

transformation e? for t0 £0 - fen pog 

; Επ{/()1(9} - Ε{9} 

Two-sided too ; aim 

Laplace ο θὰ L = / e ? f (t)dt 

janua i ne pe —oo nu 1(t) = 0 for t<0 
I 14ο {0 

Finite 0 fort«0 a 

Laplace e for 0<t<a |La{f(t)} = [ο (θα 

transformation 0 fort>a B 





The Carson transforma- 















































oo 
Laplace-Carson t for t<0 CIFA) b= | pe vt p tion can also be a two- 
transformation pe” for t > 0 sided and finite transfor- 
τ mation. 
Foo 
Fourier cát F. f —iwt 
: B ie e t)dt =0 + =0 
transformation t )} É f Je η z 
—oo 
One-sided 0 f τ 
or t«0 ži 
Fourier { e» for t0 Jit f(t)) = fe wt ftdt |p=ot+iw σ--θ 
transformation 0 
Finite 0  fort«0 
Fourier e for 0<t<a|Fiff(t}} = / e" f(tdt |p=otiw σ-0 
transformation 0 for t>a 
Fouri i 0 for t<0 T 
ourler cosine or . —" = 
S REA { Re [e^] for t» 0 F.A{f(t)} = [ro cosuwtdt |p — G6 «iw σθ 
0 
Fourier si rt<0 Ῥ 
ourier sine = PN ' - 
irandlormisin 1 Im [e*t] t>0 F{f(t)} = [η J(t)sinotdt |jp=ot+iw a=0 
0 
0 for t<0 T 
Mellin or $« / -1 
3 H= p tr t)dt 
transformation |. Pb for $20 MO J f(t) 
anke p=otiw w=0 
cake { 0 fort«0 HA f(t} - fue t)dt J,(ot) is the v-th or- 
af order E tJ,(ot) for t 0 τ der Bessel function of the 
HORUM ? first kind. 
ἜΝ E fort «0 oo 
Stieltjes T SUF} = / F(t) dt 
transformation z for t0 à J p +t 
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15.1.5 Integral Transformations for Functions of Several Variables 


Integral transformations for functions of several variables are also called multiple integral transforma- 
tions (see [15.13]). The best-known ones are the double Laplace transformation, i.e., the Laplace trans- 
formation for functions of two variables, the double Laplace-Carson transformation and the double 
Fourier transformation. The definition of the double Laplace transformation is 


F(p.a) - (fs) m f. [ es) any. (15.4) 


The symbol £ denotes the Laplace transformation for functions of one variable (see Table 15.1). 


15.1.6 Applications of Integral Transformations 
1. Fields of Applications 


Besides the great theoretical importance that integral transformations have in such basic fields of math- 
ematics as the theory of integral equations and the theory of linear operators, they have a large field of 
applications in the solutions of practical problems in physics and engineering. Methods with applica- 
tions of integral transformations are often called operator methods. They are suitable to solve ordinary 
and partial differential equations, integral equations and difference equations. 

2. Scheme of the Operator Method 

The general scheme to the use of an operator method with an integral transformation is represented 
in Fig. 15.1. One gets the solution of a problem not directly from the original defining equation; first 
an integral transformation is applied. The inverse transformation of the solution of the transformed 
equation gives the solution of the original problem. 









































Equation of Solution of 
Problem —» ——— : > Result 
the problem the equation 
Transformation Inverse transformation 
Solution by using | Transformed Solution of m 
the transformation equation æ the transformed 
equation 














Figure 15.1 





The application of the operator method to solve ordinary differential equations consists of the following 
three steps: 

1. Transition from a differential equation of an unknown function to an equation of its transform. 

2. Solution of the transformed equation in the image space. The transformed equation is usually no 
longer a differential equation, but an algebraic equation. 

3. Inverse transformation of the transform with help of 7 -! into the original space, i.e., determination 
of the solution of the original problem. 

The major difficulty of the operator method is usually not the solution of the transformed equation, 
but the transformation of the function and the inverse transformation. 


lH .. | 
SS f(t)=sin t 
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15.2 Laplace Transformation 


15.2.1 Properties of the Laplace Transformation 
15.2.1.1 Laplace Transformation, Original and Image Space 


1. Definition of the Laplace Transformation 
The Laplace transformation 
co 
LAFE} = f f) — F() (15.5) 
o 
assigns a function F(p) of a complex variable p to a function f(t) of a real variable t, if the given 
improper integral exists. f(t) is called the original function, F(p) is called the transform of f(t). In 
the further discussion it is assumed that the improper integral exists if the original function f(t) is 
piecewise smooth in its domain t > 0, in the so called original space, and for t — oo, |f(t)| € Ke*' 
holds with certain constants K > 0, œ > 0. The domain of the transform F(p) is called the image 
space. 
In the literature the Laplace transformation is often found also in the Wagner or Laplace-Carson form 


£w4f() 7» f e” fO dt = pF). (15.6) 
0 


2. Convergence 
The Laplace integral £{ f (t)} converges in the half-plane Rep > a (Fig. 15.2). The transform F(p) 
is an analytic function with the properties: 


1. πα F(p)=0. (15.7a) 
Rep-oo 
This property is a necessary condition for F(p) to be a transform. 
2. lim pF(p) = A, (15.7b) 
p 
(poc) 


if the original function f(t) has a finite limit lim f(t) = A. 


(t0) 





Figure 15.2 Figure 15.3 


3. Inverse Laplace Transformation 
One can retrieve the original function from the transform with the formula 
c-ioo 
-1 pcm he „PETI, —Jf(t) for t>0, É 
£ {FP = Í e” F(p) dp o for t « 0. (15:8) 


c—ioo 
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The path of integration of this complex integral is a line Re p = c parallel to the imaginary axis, where 
Rep = c > a. Ifthe function f(t) has a jump at t = 0, i.e., im, f(t) Z 0, then the integral has the 
> 


1 
value 5 f (4-0) there. 


15.2.1.2 Rules for the Evaluation of the Laplace Transformation 
The rules for evaluation are the mappings of operations in the original domain into operations in the 
transform space. 
Hereafter the original functions will be denoted by lowercase letters, the transforms are denoted by the 
corresponding capital letters. 
1. Addition or Linearity Law 
'The Laplace transform of a linear combination of functions is the same linear combination of the 
Laplace transforms, if they exist. With constants À1,..., A, that is 

L{Afi(t) + Aafa(t) - H Anfalt)} = AFi (p) + A2Folp) +-+ 0 MER). (15.9) 


2. Similarity Laws 
The Laplace transform of f(at) (a » 0,a real) is the Laplace transform of the original function divided 
by a and with the argument p/a: 


1 
L{f(at)} = =F (2) (a > 0, real). (15.10a) 
Analogously for the inverse transformation 
1 t 
£UF(ap)} =~ f (-) (a >0). (15.10b) 
a 


Fig. 15.3 shows the application of the similarity laws for a sine function. 
W Determination of the Laplace transform of f(t) = sin(wt). The transform of the sine function is 


L{sin(t)} = F(p) = 1/(p? + 1). Application of the similarity law gives £{sin(wt)} = * F(p/) B 
1 1 w 

coui Pa 

3. Translation Laws 


i: Shifting to the Right The Laplace transform of an original function shifted to the right by a 
(a > 0) ) is equal to the Laplace transform of the non-shifted original function multiplied by the factor 


E —a)} =e? F(p). (15.11a) 


2. Shifting to the Left The Laplace transform of an original function shifted to the left by a is 
equal to e*? multiplied by the difference of the transform of the non-shifted function and the integral 


JË F(E) e dt: 
Liflt+a)} =e” | F(p) — fe* Fo αἱ. (15.11) 

0 

Figs. 15.4 and 15.5 show the cosine function shifted to the right and a line shifted to the left. 

4. Frequency Shift Theorem 

The Laplace transform of an original function multiplied by e~ 

the argument p+ b (bis an arbitrary complex number): 


L{e" f(t)} = F(p +b). (15.12) 





èt is equal to the Laplace transform with 
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10 j f(t) f(t) f(t) 
0 κα η bam t 0 ἘΠ t 


Figure 15.4 Figure 15.5 








5. Differentiation in the Original Space 
If the derivatives f'(t), f"(t),. .., f? (t) exist for t > 0 and the highest derivative of f(t) has a trans- 
form, then the lower derivatives of f(t) and also f(t) have a transform, and: 


ELP (0) — p F(») — fC-0). 
L{f"(t)} = p’ F(p) — f(+0) p— f'(+0), 


£(f9(0) = ph Fp) — fG0) p — fo) p? - ο 
— fl) (+40) p— f- (+0) with 
JO(+0)= lim, FO). 


Equation (15.13) gives the following representation of the Laplace integral, which can be used for ap- 
proximating the Laplace integral: 


$ n (n—1) 
L{f(t)} = a + " εὐ ga pps n + SLIM}: (15.14) 





6. Differentiation in the Image Space 
£i f) - C1 F9(). (15.15) 


The n-th derivative of the transform is equal to the Laplace transform of the (—t)"-th multiple of the 
original function f(t): 


E£((-1) t^ f(0) 2 FP?(p (n=1,2,...). (15.16) 
7. Integration in the Original Space 


The transform of an integral of the original function is equal to 1/p” (n > 0) multiplied by the trans- 
form of the original function: 


e i] ans. f reis) G-p! cL fja- on 1) rear} = Fro) (15.17a) 
0 0 0 0 


In the special case of the ordinary simple integral 


£ πω. --ρῳ) (15.17b) 
0 p 


holds. In the original space, differentiation and integration act in converse ways if the initial values are 
zeros. 


8. Integration in the Image Space 


cus 2) = fon Jin jn F (pn) dpn = "m j 1 _fe-p \-l F(z) dz. (15.18) 


n-1i 
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This formula is valid only if f(t)/t” has a Laplace transform. For this purpose, f(x) must tend to zero 
fast enough as t — 0. The path of integration can be any ray starting at p, which forms an acute angle 
with the positive half of the real axis. 


9. Division Law 
In the special case of n — 1 of (15.18) 


FOL T mAg 
cf = i (15.19) 


t 
holds. For the existence of the integral (15.19), the limit lim . must also exist. 
— 


10. Differentiation and Integration with Respect to a Parameter 





Of (t, a) OF (p, a) 
Ae 


ag o2 
à Ξ--π--- (15.204) £ | EO ia} B Io a) da. (15.20b) 
a 
αι αι 
Sometimes one can calculate Laplace integrals from known integrals with the help of these formulas. 
11. Convolution 


1. Convolution in the Original Space The convolution of two functions fi(f) and fa(t) is the 
integral 


fi* h= / filt): falt- T)dr. (15.21) 


Equation (15.21) is also called the one-sided convolution in the interval (0,t). A two-sided convolution 
occurs for the Fourier transformation (convolution in the interval (—oo, oo) see 15.3.1.3, 9., p. 789). 
The convolution (15.21) has the properties 





a) Commutative law: fi * fo =fe* fi- (15.22a) 
b) Associative law: (fi * fo) * fg = fi * (fo * fs). (15.22b) 
c) Distributive law: (fi fo) * fg = fi * fat fo * fs. (15.22c) 
g(t) In the image domain, the usual multiplication 
corresponds to the convolution: 

: L{fi * fo} = Fi(p)- Fo(p). (15.23) 

t The convolution of two functions is shown in 

f(t) Fig. 15.6. One can apply the convolution theo- 


rem to determine the original function: 
a) Factoring the transform 








> 
e jo) t F(p) = Fi(p) - Fa(p). 

5 b) Determining the original functions f;(t) and 
fo(t) of the transforms F\(p) and Fs(p) (from a 
table). 

> 
t c) Determining the original function associated 


to F(p) by convolution of fı(t) and f2(t) in the 
Figure 15.6 original space (f(t) = fi(t) * fo(t)). 
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2. Convolution in the Image Space (Complex Convolution) 


1 αι Γἶου 
Jri Fı(z)- Fo(p — z) dz, 
EU) - f) — pon (15.24) 
ad Fi(p— z): Fa(z) dz. 


The integration is performed along a line parallel to the imaginary axis. In the first integral, x and p 
must be chosen so that z is in the half plane of convergence of Z( fi] and p — z is in the half plane of 
convergence of L{ f2}. The corresponding requirements must be valid for the second integral. 


15.2.1.3 Transforms of Special Functions 


1. Step Function 
The unit jump at t = to is called a step function (Fig. 15.7) (see also 14.4.3.2, 3., p. 757); it is also 
called the Heaviside unit step function: 


1 for t > to, 
u(t — ty) = t5 Brice (o>). (15.25) 
; p W Cosw to +p sinwt : 
BA: f(t) = u(t — to) sinut, F(p) ^ e te» E (Fig. 15.8). 
ω 





E B: f(t) = u(t — to) sinw (t — to), F(p) — e top (Fig. 15.9). 


f(t) 


fH u(t-t) sinot-t) 





Figure 15.7 Figure 15.8 Figure 15.9 


2. Rectangular Impulse 
A rectangular impulse of height 1 and width T (Fig. 15.10) is composed by the superposition of two 
step functions in the form 


0 for t < to, 
ur(t = to) = u(t = to) = u(t — 1g — T) —41 for to < t < to + T (15.26) 
0 for t>to+T; 


e toP(1 — e-TP 
£(ur(t to) = E 


3. Impulse Function (Dirac ô Function) 


(See also 12.9.5.4, p. 700.) The impulse function ô(t — to) can obviously be interpreted as a limit of 
the rectangular impulse of width T and height 1/T at the point t = to (Fig. 15.11): 





ó(t to) = Jim, jut to) — u(t — to — T)]. (15.28) 
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t bT t O| t tT t 
Figure 15.10 Figure 15.11 
For a continuous function h(t), 
j ο μα ito is outside (a0). (15.29) 
Relations such as 
ó(t — tg) — RUF NE L{5(t—to)} =e” — (to > 0) (15.30) 


dt 
are investigated generally in distribution theory (see 12.9.5.3, p. 699). 


4. Piecewise Differentiable Functions 


The transform of a piecewise differentiable function can be determined easily with the help of the 6 








function: If f(t) is piecewise differentiable and at the points t, (v = 1,2,...,n) it has jumps a,, then 
its first derivative can be represented in the form 
df (t 
i Fit) +aid(t — t1) + a2ô(t — t2) + -- -+ anô(t — tn) (15.31) 
dt 


where f{(t) is the usual derivative of f(t), where it is differentiable. 

If jumps occur first in the derivative, then similar formulas are valid. In this way, one can easily de- 
termine the transform of functions which correspond to curves composed of parabolic arcs of arbitrarily 
high degree, e.g., curves found empirically. In formal application of (15.13), the values f (+0), f’(+0),... 
should be replaced by zero in the case of a jump. 


ΒΑ: 
s(t) - [qut bird e tolo (Fig. 18.12); f'(t) — aus (1) + DOLE) — (ato +b) ( — to): LUPO} = 


0 otherwise 


(t 
a —top —top 1 -top | 2 
ae )+b-— (ato +b)e™?; L{f(t)} = 2 atime αρ : 


HB: 
t for 0 «€ t € to, 1 for 0<t< to, 
f(t) 2to —t for to <t < 2to, (Fig. 15.13); /'(t) 2 4 —1 for tp <t < 2to, (Fig. 15.14); 
0 for t > 2to, 0 for t > 2to, 
: (1— eter)? 
F'O = 8(0)-5(£—19)-8(t—19)--3(2t9);. £UP'()) - 1-26 em Cg) - πα 
Et/to for 0<t<to, 
; E for to « t € T — tg, ; : 
mc: f(t)= —E(t — T)/to for Pip L T (Fig. 15.15); 


0 otherwise, 
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0 ty ty 25 
Figure 15.12 Figure 15.13 Figure 15.14 
E/to for 0 « f « tg, 
0 for to « t « T — to, (t T) p 
y .. ? . 
FO) =) Eft for Toh cher (Fig. 15.16); 
0 otherwise, 





p(t) = 25) δι) Zt T44)4 Eo(t-T); £(f"() - = [1 — eP — eTo) 4 e-TP]; 





to to 0 to 

(i= e top y1- e (to) 
££) 2 2 MM. 

0 p 

{9 
t ThT t 
Figure 15.15 Figure 15.16 

ED: 


t—t for0<t<1, mp; . 1—2t forü ct « 1, 
fe = ‘6 otherwise, (Fig. 15.17); Pup [c otherwise, 


f"(t) 2 —2uy(t) + (t) + ôlt — 1); 
L{f"(t)} = -q -e?)-icre* L£(f(0)- 


(Fig. 15.18); 





l+te? 2(1—e?) 
p ES p) i 

f(t) f'(t) 

1 


1/2 1 


1/2 1 t 





Figure 15.17 Figure 15.18 


5. Periodic Functions 
The transform of a periodic function f *(t) with period T', which is a periodic continuation of a function 
f (t), can be obtained from the Laplace transform of f(t) multiplied by the periodization factor 


(1—e79)71, (15.32) 
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E A: The periodic continuation of f (t) from example B (see above) with period T' — 2to is f *(t) with 


uv (ety? 1 1— e-top 
EL (0) p `I- eop pl ey 


E B: The periodic continuation of f(t) from example C (see above) with period T'is f*(t) with 


E (1 — ο ορ) (1 — e-(T-to) 
αμ ο) - EU. 


15.2.1.4 Dirac 6 Function and Distributions 
In describing certain technical systems by linear differential equations, functions u(t) and 6(¢) often 
occur as perturbation or input functions, although the conditions required in 15.2.1.1, 1. p. 770, are 
not satisfied: u(t) is discontinuous, and 6(t) cannot be defined in the sense of classical analysis. 
Distribution theory offers a solution by introducing so-called generalized functions (distributions), so 
that with the known continuous real functions 6(¢) can also be examined, where the necessary differen- 
tiability is also guaranteed. Distributions can be represented in different ways. One of the best known 
representations is the continuous real linear form, introduced by L. Schwartz (see 12.9.5, p. 698). 
Fourier coefficients and Fourier series can be associated uniquely to periodic distributions, analogously 
to real functions (see 7.4, p. 474). 
1. Approximations of the 6 Function 
Analogously to (15.28), the impulse function 6(¢) can be approximated by a rectangular impulse of 
width ¢ and height 1/e (e > 0): 
_f1/e for |t| < ¢/2, Eas 
f(t,e) = lg i ELS (15.334) 


Further examples of the approximation of 6(t) are the error curve (see 2.6.3, p. 73) and Lorentz function 
(see 2.11.2, p. 95): 


i oca ΠῚ 
f(t,e) = NS (e » 0), (15.33b) 








f(te) = m (e » 0). (15.33c) 
These functions have the common properties: 

| f(t,¢) dt =1. (15.34a) 
2. P €) — f(t, £), i.e., they are even functions. (15.34b) 
3 Μη {ο mizo (15.340) 


2. Properties of the ó Function 
Important properties of the ô function are: 


rca 
1. / f(t)6(a — t) dt = f(x) (f is continuous, a > 0). (15.35) 
2. d(ax) = ο (a > 0). (15.36) 


n 


3. d(g(x)) = 





ó(r— vij) with g(v;) 2 0 and g'(z;) Z 0 (i — 1,2,...,n). (15.37) 
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Here all roots of g(a) are considered and they must be simple. 
4. n-th Derivative of the 6 Function: After n repeated partial integrations of 
rta 
f(a) = / f(t) 6(a — t) dt, (15.38a) 
χ-α 
a rule is obtained for the n-th derivative of the ó function: 
rca 


(1) f 9" (a eq Pope (15.38b) 


15.2.2 Inverse Transformation into the Original Space 


To perform an inverse transformation, there are the following possibilities: 

1. Using a table of correspondences, i.e., a table with the corresponding original functions and trans- 
forms (see Table 21.13, p. 1109). 

2. Reducing to known correspondences by using some properties of the transformation (see 15.2.2.2, 


p. 778, and 15.2.2.3, p. 779). 
3. Evaluating the inverse formula (see 15.2.2.4, p. 780). 


15.2.2.1 Inverse Transformation with the Help of Tables 


The use of a table is shown here by an example with Table 21.13, p. 1109. 
Further tables can be found, e.g., in [15.3]. 


1 =l 
OO MAO) c7; 


il 
LM QE P :) =e“ = f,(t). Applying the convolution theorem (15.23) yields: 


1 
pP +w? 





E F(p) li : —sinwt = fi(t), 








f(t) =£{Fi(p)- Fa(p)} 


- [ n0: n - nar - ['e7 du 


ω ce To? 





(¢—7) Sin WT 1 ( sin wt — w cos wt P 
t ] 


ω 


15.2.2.2 Partial Fraction Decomposition 


1. Principle 

In many applications, there are transforms in the form F(p) — H(p)/G(p), where G(p) is a polynomial 
of p. If theoriginal functions for H (p) and 1/G(p) are already known, then the required original function 
of F(p) can be got by applying the convolution theorem. 

2. Simple Real Roots of G(p) 

If the transform 1/G(p) has only simple poles p, (v = 1,2,...,n), then it has the following partial 
fraction ucc 


cg σσ TE p) (15.39) 


The corresponding original function is 


qalt) = L l- το Pt, (15.40) 
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3. The Heaviside Expansion Theorem 
If the numerator H (p) is also a polynomial of p with a lower degree than G(p), then we can obtain the 
original function of F(p) with the help of the Heaviside formula 


- ο τν 
IO =E Gp ] (15.41) 


4. Complex Roots 

Even in cases when the denominator has simple complex roots, the Heaviside expansion theorem can 
be used in the same way. The terms belonging to complex conjugate roots can be collected into one 
quadratic expression, whose inverse transformation can be found in tables also in the case of roots of 
higher multiplicity. 

J 
(p + c)? +w?) 
zeroes of G(p) pı = —c, pz = iw, pa = —iw are all simple. According to the Heaviside theorem one gets 
1 


1 ; : 
ft) = = se" πο - 3u(u 4i jp or by using partial fraction decomposition and 
ω 4 c 2w(w — ic w(w + ic 





B F(p) = „ie, H(p) = 1, G(p) = (p+ 0)(p? + w?), G'(p) — 3p? + 2pc + w?. The 





1 c—p 
«?-|ptc pcw? 
expressions for f(t) are identical. 


the table F(p) = 








1 c 
| , fo = ud e * 4 z Sino — cosut . These 


15.2.2.3 Series Expansion 


In order to obtain f(t) from F(p) one can try to expand F(p) into a series F(p) = Y Fa(p), whose 


n=0 
terms F;,(p) are transforms of known functions, i.e., F,(p) = L{fn(t)}. 
1. F(p) isan Absolutely Convergent Series 
If F(p) has an absolutely convergent series 





oo 
απ 
Fp) = o>, (15.42) 
n=0 p 
for |p| > R, where the values A, form an arbitrary increasing sequences of numbers 0 < Ag < Ay < 
exc Àn <+ x o0, then a termwise inverse transformation is possible: 
oo p» 
t)- [E 15.43 
FO = Yet (15.43) 


I denotes the gamma function (see 8.2.5, 6., p. 514). In particular, for A, = n + 1, i.e., for F(p) = 


œ a k œ a 
È zl the series f(t) = Y; ME 
p n=0 n! 





t" is obtained, which is convergent for every real and complex t. 


Furthermore, one can have an estimation in the form |f (t)| « C e** (C, c real constants). 


38 86 1 
1 1 1 z> 1 
E F(p) = — =-{14+3 = —+— . After a termwise transformation into the 
(p) Vit P al 5) ΣΙ Jp 








n=0 

oo = 1 pn oo (-1)" t 2n 
original space the result is f(t) = X | 2 D 5 ae (5) = Jo(t) (Bessel function of 

aon J Qn! zn) 
0 order). 
2. F(p)isa Meromorphic Function 
If F(p) is a meromorphic function, which can be represented as the quotient of two integer functions 
(of two functions having everywhere convergent power series expansions) which do not have common 
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roots, and so can be rewritten as the sum of an integer function and infinitely many partial fractions, 
then the equality 


ctiyn 


1 Τε 1 
-- I e? F(p) dp = X by” — — I e F(p) dp (15.44) 
2πι J A Θπὶ . 
c—iyn (Kn) 
is obtained. Here p, (v = 1,2,...,m) are the first-order poles of the function F'(p), b, are the corre- 


sponding residues (see 14.3.5.4, p. 753), y, are certain values and K, are certain curves, for example, 
half circles in the sense represented in Fig. 15.19. The solution f(t) has the form 


fü-Y e d Í e? F(p) dp > 0 (15.45) 
v=1 


(Kn) 


2ri 


as y — oo, what is often not easy to verify. 








Figure 15.19 Figure 15.20 


In certain cases, e.g., when the rational part of the meromorphic function F(p) is identically zero, the 
above result is a formal application of the Heaviside expansion theorem to meromorphic functions. 


15.2.2.4 Inverse Integral 


The inverse formula 
1 ct+iyn 
‘)= Jim z= | Fod 15.46 
f(t) = lim 7 | 3 (p) dp (15.46) 
c—iyn 
represents a complex integral of a function analytic in a certain domain. The usual methods of inte- 
gration for complex functions can be used, e.g., the residue calculation or certain changes of the path 
of integration according to the Cauchy integral theorem. 
p 


Β Τί») ---; lo e^ VP? is double valued because of yP. Therefore, we chose the following path of inte- 
p+w 


1 η 
gration (Fig. 15.20): xif 731467 9-[- tf t fo = 
(K) AB óD EP DA BE FC 


YRese?F(p) 5 eV "P? cos(.;t — o /uj2). According to the Jordan lemma (see 14.4.3, p. 755), the 








~ E 
integral part over AB and CD vanishes as yn —> oo. The integrand remains bounded on the circular 
A 


arc EF (radius £), and the length of the path of integration tends to zero for € — 0; so this term of the 
integral also vanishes. There are to investigate the integrals on the two horizontal segments B E and 
FC, where it is to consider the upper side (p — re*) and the lower side (p = re~?) of the negative real 
axis: 
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0 oo T . —oo oo T τ 
Paes e m -tr oia yT Jp ufi _ „—tr Pla VT Jy 
Jo F(p)e? dp — 1 € gap dr, f F(p)e” dp = [ e out dr. 


Finally one gets: 
W 9ο rsin ayr 
i-e? «P eos ( t— a-f otr dr. 
f(t) =e wt— als zh £ ατα r 


15.2.3 Solution of Differential Equations using Laplace 
Transformation 

It has been noticed already from the rules of calculation of the Laplace transformation (see 15.2.1.2, 

p. 771), that complicated operations, such as differentiation or integration in the original space, can be 

replaced by simple algebraic operations in the image space using the Laplace transform. Here, some 

additional conditions are considered, such as initial conditions in using the differentiation rule. These 

conditions are necessary for the solution of differential equations. 


15.2.3.1 Ordinary Linear Differential Equations with Constant Coefficients 


1. Principle 
The n-th order differential equation of the form 

y(t) + ray Y(t) +--+ ey lt) + oytt) = FE) (15.47a) 
with the initial values y(+0) = yo, y/(+0) — yj, ..., y 9 (4-0) — y? 
transformation into the equation 


can be transformed by Laplace 


n n k-1 
Yd cap*¥ (p) — Sa Spt ty? = F(p) (en = 1). (15.47b) 
k=0 k=1 v=0 


n 
Here G(p) = Y; cyp* — Vis the characteristic equation of the differential equation (see 4.6.2.1, p. 315). 
k=0 


2. First-Order Differential Equations 


The original and the transformed equations are: 














y (t) + coy(t) = f(t), y(+0) = yo, (15.48a) (p--co)Y(p)— yo — F(p), (15.458) 
where co = const. The solution for Y (p) results in 
F 4 
Y(p) = τ. (15.486) 
p+ co 
i a A 
Special case: For f(t) 2 Ae" with F(p) — » (A, const) : (15.49a) 
p-n 
^ Yo 
Y(») — + i 15.49b 
97 pra) sto 
A A 
DES e" E yo — pO 15.49 
(0 = et (n - 2). (15499 


3. Second-Order Differential Equations 


The original and transformed equations are: 
y (t)  2ay (t) + bylt) = f(t), —-y(40)=y0, y (0) — s. (15.50a) 


(p? -- 2ap 4- b) Y (p) — 2ayo — (pyo + yo) = F(p). (15.50b) 
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The solution for Y (p) results in 
F(p) + (2a + p) yo + Yh 








Y(p)= 15. 
(p) + Dap +b (15.50c 
Distinction of Cases: 

a) b<a@: G(p)=(p—a)(p—ay) (ar, a2 real; ay 4 a), (15.51a 
1 1 

Ð= L- — oat ρα») 15.5} 

ο η το... (15.510 

b) b=a*®: G(p)-(p—oy, (15.52a) q(t) =te™. (15.52b 

c) ba^: G(p) has complex roots, (15.53a 
1 1 

q(t) = L lon] m e * sin vb — a?t. (15.53b 

The solution y(t) can be obtained as the convolution of the original function of the numerator of Y (p 


and q(t). The application of the convolution can be avoided if a direct transformation of the right-hand 
side can be found. 


Bl The transformed equation for the differential equation y(t) + 2y'(t) + 10y(t) = 37 cos 3t + 9e* 





, p+2 37p 9 
th yo = 1 and yh = 0 is Y (p) = j . TI 
ωρα P2pr10 (P909 -2»410) (p-1p *2p-10) 
=p 19 p , 18 


| s 
PP+2p+10 (p +2p+10) (+9) (5949) (p+1) aa .. 
partial fraction decomposition of the second and third terms of the right-hand side but not separating 
the second-order terms into linear ones. The solution after termwise transformation is (see Table 
21.13, p. 1109) y(t) — (— cos 3t — 6sin 3t)e~* + cos 3t + 6sin 3t + e™. 

4. n-th Order Differential Equations 

The characteristic equation G(p) = 0 of this differential equation (see (15.47a)) has only simple roots 
04,023, ...,04, and none of them is equal to zero. Two cases are distinguished for the perturbation 
function f(t). 

1. If the perturbation function f(t) is the jump function u(t) which often occurs in practical problems, 
then the solution is: 


1 for t» 0, 
u(t) = {0 for t « 0, 





representation Y (p) — 





5.545 t pus 15.541 
(15.544) y(t) = ao πες "ma (15.54b) 
2. For a general perturbation function f(t), one gets the solution g(t) from (15.54b) in the form of the 
Duhamel formula which uses the convolution (see 15.2.1.2, 11., p. 773): 


=Å fut- nfd = Sly» f) (15.55) 


15.2.3.2 Ordinary Linear Differential Equations with Coefficients 
Depending on the Variable 

Differential equations whose coefficients are polynomials in t can also be solved by Laplace transfor- 

mation. Applying (15.16), in the image space yields a differential equation, whose order can be lower 

than the original one. 

If the coefficients are first-order polynomials, then the differential equation in the image space is a first- 

order differential equation and may be it can be solved more easily. 
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1 
EŒ Bessel differential equation of 0 order: E + B +tf = 0 (see (9.52a, p. 562) for n = 0). The 
transformation into the image space results in 
d, 2 dF (p) dF p 
F (0) | + pF( =0 or - F(p). 
ap? FG») - »/(0) - /(] - pF) - f) - 5-0 e o --u Fo) 
πεί p dp 
eparat f the variables and integrat elds log F(p) — — = —logyp? +1 +1 
Separation of the variables and integration yields log F(p) P+ og /p? +1 + logC, 


F(p) = ue 


the Bessel function of 0 order). 
15.2.3.3 Partial Differential Equations 


1. Generallntroduction 

The solution of a partial differential equation is a function of at least two variables: u = u(x,t). Since 
the Laplace transformation represents an integration with respect to only one variable, the other vari- 
able should be considered as a constant in the transformation: 


(C is the integration constant), f(t) = CJo(t) (see MM in 15.2.2.3,1., p. 779 with 


oo 


L{u(x,t)} = [279.5 dt = U (x, p). (15.56) 
0 
x also remains fixed in the transformation of derivatives: 


£ ex = pL{u(x,t)} — u(x, +0), 





Ot 
Pur, t 
£ [AS] = PPL{u(0,1)} — u(x, +0)p — ue(2, +0). 
The differentiation with respect to x is supposed to be interchangeable with the Laplace integral: 
Ou(x, t) ὃ : a 
al Ba \ — a, Plu 0) — 3: Un»). (15.58) 


Inthis way, an ordinary differential equation is obtained in the image space. Furthermore, the boundary 
and initial conditions are to be transformed into the image space. 


2. Solution of the One-Dimensional Heat Conduction Equation for a 





Homogeneous Medium 


1. Formulation of the Problem Suppose the one-dimensional heat conduction equation with van- 
ishing perturbation and for a homogeneous medium is given in the form 


ügs — à ^u; — Use — uy —0 (15.594) 
in the original space 0 < t < oo, 0 « x < l and with the initial and boundary conditions 
u(x, +0) = uog(r), u(+0,t) = ao(t), u(l —0,t) — ax(t). (15.59b) 


The time coordinate is replaced by y = at. (15.59a) is also a parabolic type equation, just as the 
three-dimensional heat conduction equation (see 9.2.3.3, p. 591). 
2. Laplace Transformation The transformed equation is 


d 
= pU — ug(z), (5.604) 
da? 
and the boundary conditions are 
U(+0,p) = Ao(p), | U(L—0,p) — Ai(p). (15.60b) 


The solution of the transformed equation for zero starting temperature uo(x) = 0 is 
U(x, p) = cie” YP 4- coe? VP, (15.60c) 
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It is a good idea to produce two particular solutions U; and U2 with the properties 


Ui(0,p) 21, Ur(I,p) = 0, (15.61a) Us(0,p) — 0, Us(l p) — 1, ie., 
ell—2) VP = e7 (2) VP : et vP — e-t VP 
Ui(z, p) — ~ope pP c (15.61c) U(x, p) = el VP — e IVP ` 


The required solution of the transformed equation has the form 


U(x, p) = Ao(p) U(x, p) + A (p) U2(x, p). 


(15.615) 


(15.614) 


(15.62) 


3. Inverse Transformation The inverse transformation is especially easy in the case of | — oo: 





" f ao( — T) x? 
U(r,p)- ao(p)e ^V?, — (15.63a) u(x,t) E "um P (ο) dr. 


15.3 Fourier Transformation 
15.3.1 Properties ofthe Fourier Transformation 
15.3.1.1 Fourier Integral 


1. Fourier Integral in Complex Representation 


(15.63b) 


The basis of the Fourier transformation is the Fourier integral, also called the integral formula of 
Fourier: If a non-periodic function f(t) satisfies the Dirichlet conditions (see 7.4.1.2, 3., p. 475) in 


an arbitrary finite interval, and furthermore the integral 
+0 1 ete 
r |f(t)|dt | (15.64a) is convergent, then -- 7 Je (177) F(7) dw dr 
“oo 2r —00 —00 


at every point where the function f(t) is continuous, and 


oo +00 
t+0 t—0 1 
η =— [à / f(7)cos v (t — 7) dr 
2 TO 00 
at the points of discontinuity. 


2. Equivalent Representations 
Other equivalent forms for the Fourier integral (15.64b) are: 


+00 +00 
1. fü) - z- / f EO) cos[w (t — 7)] de dr. 
mu. κ 

t) = flaw )coswt + b(w)sinwt|dw with the coefficients 

Ὁ 

1 +00 1 +00 

== [19 cos wt dt (15.65c) b(w) — — f O sinwtat. 

T. T. 


oo 


3. f(t) = f Ate w) cos [wt + (uw) ] du. 


0 


(15.64b) 


(15.64c) 


(15.65a) 


(15.65b) 


(15.65d) 
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4. f(t)= / Alw) sin [wt + p(w) ] dw. (15.67) 


The following relations are valid here: 








A(w)- VaXw)-- Uu), ^ (15.682) φ(ω) -- ψ(ω) 4- a (15.68b) 
"os i) = ata) 5.68c sin (uw E o) 5 

cos p(w) = Aw)’ (15.68c) sin p(w) = Ales? (15.68d) 
cos p(w) = nu (15.68e) sin y(w) = n (15.668 


15.3.1.2 Fourier Transformation and Inverse Transformation 


1. Definition of the Fourier Transformation 
The Fourier transformation is an integral transformation of the form (15.1a), which comes from the 
Fourier integral (15.64b) by substituting 
-ο0 
F(w) = / e" f(7) dr. (15.69) 
The following relation is valid between the real original function f(t) and the usually complex transform 
F(w): 
iP. 
/ elt Pw) dus. (15.70) 


t= 
ΠΩ 

—oo 
In the brief notation one uses F: 

+oo 
F(w) =F{ f()} = / e f(t) dt. (15.71) 
τος 

The original function f(t) is Fourier transformable if the integral (15.69), i-e., an improper integral with 
the parameter w, exists. If the Fourier integral does not exist as an ordinary improper integral, then it 
is considered as the Cauchy principal value (see 8.2.3.3, 1., p. 510). The transform F'(w) is also called 
the Fourier transform; it is bounded, continuous, and it tends to zero for |w| — oo: 


lim F(w) =0. (15.72) 


|w|—co 


The existence and boundedness of F (w) follow directly from the obvious inequality 


IES f Ode f irat. (15.73) 


The existence of the Fourier transform is a sufficient condition for the continuity of F (w) and for the 
properties F (w) —> 0 for |w| > oo. This statement is often used in the following form: If the function 
f (t) in (—oo, oo) is absolutely integrable, then its Fourier transform is a continuous function of v, and 
(15.72) holds. 

The following functions are not Fourier transformable: Constant functions, arbitrary periodic func- 
tions (e.g., sin wt, cos wt), power functions, polynomials, exponential functions (e.g., e® , hyperbolic 
functions). 
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2. Fourier Cosine and Fourier Sine Transformation 

In the Fourier transformation (15.71), the integrand can be decomposed into a sine and a cosine part. 
So, one gets the sine and the cosine Fourier transformation. 

1. Fourier Sine Transformation 


E) - X f()- / f(t) sin (wt) dt. (15.74a) 
0 
2. Fourier Cosine Transformation 
Few) = Fel £0} = f F(0) cos (wt) at (15.74b) 
0 


3. Conversion Formulas Between the Fourier sine (15.74a) and the Fourier cosine transformation 
(15.74b) on one hand, and the Fourier transformation (15.71) on the other hand, the following relations 
are valid: 


F(w) =F{ fO)} = Fel f() + f(-t) } — Fst FQ) — F(t) h}, (15.75a) 
F,(w) = TF f(lt\)signt},  (15.75b) Fw) = STU) ]. (15.56) 


For an even or for an odd function f(t) the following representations hold: 
f(t) even: F{ f(t) } — 27:4 F(t)}, ] 
F(t) odd: Ff f(t) } =—2iF sf FO) }- 


3. Exponential Fourier Transformation 
Differently from the definition of F'(w) in (15.71), the transform 


+00 
Flu) = FASO} = 5 f erdt (15.76) 


is called the exponential Fourier transformation, so that 
F(w) = 2F.(—w). (15.77) 


4. Tables of the Fourier Transformation 

Based on formulas (15.75a,b,c) one either does not need special tables for the corresponding Fourier 
sine and Fourier cosine transformations, or one uses tables for Fourier sine and Fourier cosine transfor- 
mations and calculates F (w) with the help of (15.75a,b,c). In Table 21.14.1 (see p. 1114) and Table 
21.14.2 (see p. 1120) the Fourier sine transforms F,(w), the Fourier cosine transforms F,(w) respec- 
tively, in Table 21.14.3 (sce p. 1125) for some functions the Fourier transform F(w) and in Table 
21.14.4 (see p. 1127) the exponential transform F.(w) are given. 


W The function of the unipolar rectangular impulse f(t) = 1 for |t| < to, f(t) = 0 for |t| » to (A.1) 

(Fig. 15.21) satisfies the assumptions of the existence of the Fourier integral (15.64a). According to 
1 stto 9 ] stto 

(15.65c,d) the coefficients are a(w) = =f cos wtdt = — sin w to and b(w) = =f sin wtdt = 0 
T J—to TW T J—to 


2 [9 si to cos w t 
(A.2) and so from (15.65b) follows f(t) = = " αμα κο. 
7 JO ω 


5. Spectral Interpretation of the Fourier Transformation 

Analogously to the Fourier series of a periodic function, the Fourier integral for a non-periodic function 
has a simple physical interpretation. A function f(t), for which the Fourier integral exists, can be rep- 
resented according to (15.66) and (15.67) as a sum of sinusoidal vibrations with continuously changing 
frequency w in the form 
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A(w) dw sin [wt + y(w)], (15.78a) A(w) dw cos [wt 4- φ(ω)]. (15.78b) 


The expression A(w) dw gives the amplitude of the wave components and y(w) and y(w) are the phases. 
The same interpretation holds for the complex formulation: The function f(t) is a sum (or integral) of 
summands depending on w of the form 


- gj du e*t, (15.79) 
2m 


1 
where the quantity οτε (ο) also determines the amplitude and the phase of all the parts. 
π 


This spectral interpretation of the Fourier integral and the Fourier transformation has a big advantage 
in applications in physics and engineering. The transform 


F(w) =|F(w)|e* or Fw) =|F(w)| i? (15.80a) 
is called the spectrum or frequency spectrum of the function f(t), the quantity 
|F(w)| 2 7 A(w) (15.80b) 


is the amplitude spectrum and p(w) and v (w) are the phase spectra of the function f(t). The relation 
between the spectrum F(w) and the coefficients (15.65c,d) is 


F(w) = n[a(w) — ib(w) ], (15.81) 
from which one gets the following statements: 
1. If f(t) is a real function, then the amplitude spectrum |F(w)| is an even function of w, and the phase 
spectrum is an odd function of w. 
2. If f (t) is a real and even function, then its spectrum F(uw) is real, and if f(t) is real and odd, then 
the spectrum F(w) is imaginary. 








|F(o) | 
2t; 
2 
oO 
> > 
t -3T -27 -π 0| π 2π 3n ot, 
Figure 15.21 Figure 15.22 


W Substituting the result (A.2) for the unipolar rectangular impulse function on p. 786 into (15.81), 
then one gets for the transform F(w) and for the amplitude spectrum |F(w)| (Fig. 15.22) 

sin wt sin wt 

0 F(u)| 2 2| ο 


Ρίω) -- Ξ{{Θ}- παίω) - 25-55 (Α3), 








(A.4). The points of contact of 








the amplitude spectrum |F'(w)| with the hyperbola — are at wtg = «(2n + D; 
w 





(n=0,1,2,...). 


15.3.1.3 Rules of Calculation with the Fourier Transformation 
As it has been already pointed out for the Laplace transformation, the rules of calculation with integral 
transformations mean the mappings of certain operations in the original space into operations in the 
image space. Supposing that both functions f(t) and g(t) are absolutely integrable in the interval 
(—oo, 00) and their Fourier transforms are 

F(w) = F{ f(t)} and G(w) = F{ g(t) } (15.82) 


then the following rules are valid. 
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1. Addition or Linearity Laws 
If a and £ are two coefficients from (—oo, oo), then: 

F{af(t) - Bg(t) ) 2 aF(w) -- BG(w). (15.83 
2. Similarity Law 
For real a Æ 0, 

F{ f(t/a) } = |a| F(aw). (15.84 
3. Shifting Theorem 
For real a ¥ 0 and real β, 


F{ flat + B)} = (1/Jo|) &*/*F (w/a) or (15.85a 
F{ f(t — to) } 9 e^ F(u). (15.85b 

If to is replaced by —to in (15.85b), then 
Ff f(t +to) } =e’ F(w). (15.85¢ 


4. Frequency-Shift Theorem 
For real a > 0 and 8 € (—co, 00), 
Fe? f(ot) Y — (1/0)F((w — 8B)/o) or (15.86a 
F{ elo f(t) } = F(w — wy). (15.86b 
5. Differentiation in the Image Space 
If the function t" f (t) is absolutely integrable in (—oo, 00), then the Fourier transform of the function 





f (t) has n continuous derivatives, which can be determined for k — 1,2,...,n as 
dF TP Bru x ΤΈΣ 
TOL f P pono] ao car f emat (15.870 
where 
dF 
lim p =0. (15.87b 


wotoo dwf 
With the above assumptions these relations imply that 


ref) e o ΤΩ) 


6. Differentiation in the Original Space 


(15.87¢ 





1. First Derivative Ifa function f(t) is continuous and absolutely integrable in (—oo, 00) and i 
tends to zero for t — coo, and the derivative f'(t) exists everywhere except, maybe, at certain points, 
and this derivative is absolutely integrable in (—oo, 00), then 

Fi f'(t)} =wF{ f(t}. (15.88 
2. n-th Derivative Ifthe requirements of the theorem for the first derivative are valid for all deriva- 
tives up to f@-), then 


FAFO} = F FE (15.88b 


These rules of differentiation will be used in the solution of differential equations (see 15.3.2, p. 791). 








7. Integration in the Image Space 


/ F(w) dw = i[G(a2) — G(o4)] with G(w) — F{g(t)} and g(t) = a. (15.89) 
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8. Integration in the Original Space and the Parseval Formula 
1. Integration Theorem If the assumption 


coo t 
/ J(t)dt 20 (15.904) is fulfilled, then F | j f(t) dt | = “πίω, (15.905) 
3 iw 


2. Parseval Formula If the function f(t) and its square are integrable in the interval (—oo, 00), 
then 
+00 


/ ωρα { LF(w)P dw. (15.91) 


9. Convolution 
The two-sided convolution 


AO * fal) = f Aht- T)ar (15.92) 


is considered in the interval (—oo, 00) and it exists under the assumptions that the functions f1(t) and 
f(t) are absolutely integrable in the interval (—oo, 00). If fi(¢) and f2(t) both vanish for t < 0, then 
one gets the one-sided convolution from (15.92) 


filt) * falt) = [ Athe - n dr for ¢20, (15.93) 


0 for t « 0. 


So, it is a special case of the two-sided convolution. While the Fourier transformation uses the two- 
sided convolution, the Laplace transformation uses the one-sided convolution. 


For the Fourier transformation of a two-sided convolution 


F{ filt)  ftt)) - FURQO Fi RE) (15.94) 
holds, if both integrals 


+00 tög 
/ |f dt and / | fa(t)|? dt (15.95) 
exist, i.e., the functions and their squares are integrable in the interval (—oo, 00). 
+0 
W Calculation of the two-sided convolution y(t) = f(t) * f(t) = I f(T)f(t— T)dr (A.1) for the 


function of the unipolar rectangular impulse function (A.1) in 15.3.1.2, 4., p. 786. 


t t+ti 
Since w(t) = ie f(t—T)dr — Í i f(T)dr (A.2) one gets for t < —2to and t > 2to, Y(t) = 0 and 
J—to Jt—to 
tH 
for-2t «t€ 0, ψίϑ- / * dr =t+2%ty. (A.3) 
J to 


t 
Analogously, for 0 < t < 2to: w(t) = [. dr = —t + 2tọ (A.4) holds. 
Jt-to 


Altogether, for this convolution (Fig. 15.23) 
t+ 2tg for —2tg < t < 0, 
w(t) = f(t) * f(t) = —t+2ty for O<t< 2%, (A.5) 
0 for |t| > 2to 
follows. For the Fourier transform F(w) of the unipolar rectangular impulse (A.1) (see p. 786 and 
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Fig. 15.21) V(w) = F{ V(t) } — F(f(t) * f(0) - [FG)P —4 


amplitude spectrum of the function f(t) |F(w)| = 2 


w(t) 






2u 0| 2 


Figure 15.23 


2 
sinw t 
LM (A.6) follows and for the 
ω 
sin w to | 


(A.7) holds. 





e(t) 





Figure 15.24 


10. Comparing the Fourier and Laplace Transformations 

There is a strong relation between the Fourier and Laplace transformation, since the Fourier trans- 
formation is a special case of the Laplace transformation with p = iw. Consequently, every Fourier 
transformable function is also Laplace transformable, while the reverse statement is not valid for every 
f(t). Table 15.2 contains comparisons of several properties of both integral transformations. 


Table 15.2 Comparison of the properties of the Fourier and the Laplace transformation 





Fourier transformation 


Laplace transformation 





Foo, 
Fw) =F{f)}= f ew fide 

—oo 
w is real, it has a physical meaning, e.g., 
frequency. 


F(p) = Lf f(t),p} = r e»t f(t) dt 


pis complex, p = r + ix. 





One shifting theorem. 


Two shifting theorems. 





interval: (—oo, +00) 

Solution of differential equations, problems de- 
scribed by two-sided domain, e.g., the wave 
equation. 


interval: [0, oo) 

Solution of differential equations, problems de- 
scribed by one-sided domain, e.g., the heat con- 
duction equation. 





Differentiation law contains no initial values. 


Differentiation law contains initial values. 





Convergence of the Fourier integral depends only 


on f(t). 


Convergence of the Laplace integral can be im- 
proved by the factor e™™. 





It satisfies the two-sided convolution law. 








It satisfies the one-sided convolution law. 








15.3.1.4 Transforms of Special Functions 
E A: Which image function belongs to the original function f(t) = e~*"!, Rea » 0 (A.1)? Con- 





+A 
sidering that |t| = —t for t < 0 and |t| = t for t > 0 with (15.71) one gets: Ϊ g totdi = 
J-A 
Α ο ο tA ο” ο 
Γ g ecu a [^ e eta e © =e - = = Ete e £ = (A.2). 
-A 0 iw—a |a iw+a jo iv—a lw +a 





Since 
2a 


a? + w? 





(A.3). 


B D: Which image function belongs to the original function f(t) = e7 





e^?^| 2 e-^F*? and Rea > 0, the limit of (A2) exists for A > oo, so that F(w) = F{ el} = 


αἱ Rea > 0? The function is 
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A, 
not Fourier transformable, since the limit of / e tat dt does not exist for A > oo. 
J-A 


W C: Determination of the Fourier transform of the bipolar rectangular impulse function (Fig. 15.24) 
1 for —2tg « t « 0, 
—1 for 0<t<2toọ, (C.1) 
0 for |t| >2to, 
where y(t) can be expressed by using equation (A.1) given for the unipolar rectangular impulse on 
p. 786. There is y(t) = f(t + to) — f(t — to) (C.2). With the Fourier transformation according to 
(15.85b, 15.85c) one gets B(w) = F{ p(t) } = #8 F(w)—e™tF(w), (C.3) from which, using (A.1), 
P M^ 
Hw) = (to ewt) 2 Sin wto — 4,982 w to 
w 
W D: Image function of a damped oscillation: The damped oscillation represented in Fig. 15.25a is 
0 for t « 0, 
e-“ cos wot for t > 0. 


p(t) = 





(C.4) follows. 


given by the function f(t) = { 


To simplify the calculations, the Fourier transformation is calculated with the complex function f* (t) = 
ο αωο)έ with f(t) = Re (f*(t)). The Fourier transformation gives 
eH ei(w—wo)t 1 


F{f*(t)} = [ e irte corio) qu [ εἰ ο ωτωολέ 1 -- = B 


—a + i(wo — w) fo a — iwọ — w) 


oo 





a + i(wo — w) 


a? + (w — wo)? 
Fif(t)- a uci (Fig. 15.25b). A damped oscillation in the time domain corresponds to 


a unique peak in the frequency domain. 


The result is the Lorentz or Breit-Wigner curve (see also 2.11.2, p. 95) 


α 


y(t) 





Figure 15.25 Figure 15.26 


15.3.2 Solution of Differential Equations using the Fourier 


Transformation 


Analogously to Laplace transformation, an important field of application of the Fourier transformation 
is the solution of differential equations, since these equations can be transformed by the integral trans- 
formation into a simple form. In the case of ordinary differential equations one gets algebraic equations, 
in the case of partial differential equations one gets ordinary differential equations. 
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15.3.2.1 Ordinary Linear Differential Equations 
The differential equation 

. 1 for [| < to, 

y! = j. L) = 2 ^ a 

y (t) +ay(t) = f(t) with f(t) { 0 for ltl > to, (15.96a) 

i.e., with the function f(t) of Fig. 15.21, is transformed by the Fourier transformation 

F{y(t)} 9 Y(w) (15.96b) 


into the algebraic equation 





SEA 2sin wt το » sin wto . 
iwY +aY =", (15.960) giving Y(w)= Paca (15.96d) 
The inverse transformation gives 
Too ; 
. = sin wto 1 et sin wto 
y(t) 2 F4Y(u)) 2 7112 =- f d 15.96e 
ye) Ur { xui πλ w(a+ iw) ( e) 
Rud 0 for —oo « t € —tg, 
1 
En — ealttto) r —tr <t< 
ee [1--ε ] for —ty <t < +o, (5.965 
1 


B [6555 = een] fo tg«t«oo. 
a 


Function (15.96f) is represented graphically in Fig. 15.26. 
15.3.2.2 Partial Differential Equations 


1. General Remarks 

The solution of a partial differential equation is a function of at least two variables: u = u(x,t). As the 
Fourier transformation is an integration with respect to only one variable, the other variable is consid- 
ered a constant during the transformation. Here the variable x is kept constant and the transformation 
is to be performed with respect to t: 


+00 
F{u(x,t)} = I e tu(a, t) dt = U(a,w). (15.97) 
—oo 
During the transformation of the derivatives the variable x is again kept constant: 
OC ux, t 
[nu 


'The differentiation with respect to x is supposed to be interchangeable with the Fourier integral: 


\ = (iw)"F{ u(x,t) } = (iw)"U (2, w). (15.98) 


Ox 


In this way an ordinary differential equation is obtained in the image space. Furthermore the boundary 
and initial conditions are to be transformed into the image space. 


Ou(a, t) ὃ . em 9 - 
7| — 4 } = asi u(x,t)) — az U 59). (15.99) 


2. Solution of the One-Dimensional Wave Equation for a Homogeneous Medium 


1. Formulation of the Problem The one-dimensional wave equation with vanishing perturbation 
term and for a homogeneous medium is: 


Ure — Ure = 0. (15.100a) 
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Like the three-dimensional wave equation (see 9.2.3.2,p. 590), the equation (15.100a) is a partial dif- 


ferential equation of hyperbolic type. The Cauchy problem is correctly defined by the following initia 





conditions 

u(x,0) = f(z) (-co<a<oo), u(x,0)= g(x) (0<t < oo). (15.100b 
2. Fourier Transformation The Fourier transformation is to be performed with respect to z where 
the time coordinate is kept constant: 

F(u(z,t)) — U(w,t). (15.101a 
One gets: 

2 

wU (w, t) — ΠΠ =0 with (15.101b 

F{ u(x,0)} = U(w,0) = F{ f(x) } = F(w), (15.101c 

F{uz(x,0)} = U'(w, 0) = F{ g(x) } = G(w). (15.101d 

wU +U" =0. (15.101e 

The result is an ordinary differential equation with respect to t with the parameter w of the transform. 
The general solution of this known differential equation with constant coefficients is 

U(w,t) = Ciel”? + Coe“*. (15.102a 
Determining the constants C; and C» from the initial values 

U(w,0) = C1 + C2 = F(w), U'(w,0) 2i» C1 —iv Co — G(w), (15.102b 
gives 

1 1 1 1 
Ci = =[F(w)+—Giw)], Co= =| Fw) - —Gw)]. (15.102c 
2 iw 2 iw 

The solution is therefore 

Ulu, t) = FF) + 2G) Je" + SL Fle) — Go) e. (15.1024 

2 2 iw 

3. Inverse Transformation Using the shifting theorem 

F{ flax +b)}=1/a- e*/*F(w/a), (15.103a 
for the inverse transformation of F(w), yields 

FUfe®'r(w)} = flat+t), Fle Ρ(ω)]-- [ία -- 0. (15.103b 


Applying the integration a 


FÍ | os r= pt) gives (15.103c 


FS Gq) = fe 1 G(u)e*t) dr — fo g(t +t)drT= o (15.103d 


after substituting T + t = z. Analogously to the previous integral 
rm—t 
ge '(- 560 y pun -- f sd: (15. 
follows. Finally, the solution in the original space is 
att 





1036) 


u(z,t) — af f(e +t) + ge-0 [οὐ )dz. (15.104) 
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15.4 Z-Transformation 


In natural sciences and also in engineering one often has to distinguish between continuous and discrete 
processes. While continuous processes can be described by differential equations, the discrete pro- 
cesses result mostly in difference equations. The solution of differential equations mostly uses Fourier 
and Laplace transformations, however, to solve difference equations other operator methods have been 
developed. The best known method is the z-transformation, which is closely related to the Laplace 
transformation. 


15.4.1 Properties of the Z-Transformation 
15.4.1.1 Discrete Functions 


Ifa function f(t) (0 < t < œœ) is known only at discrete values 
tn = nT (n = 0,1,2,...; T > 0 is a constant) of the argu- 
ment, then one writes f(nT) = fn and forms the sequence 
{fn}. Such a sequence is produced, e.g., in electrotechnics by 
“scanning” a function f(t) at discrete time periods tn. Its rep- 
resentation results in a step function (Fig. 15.27). 

















0 T2T3T ^ The sequence { f,} and the function f (nT) defined only at dis- 
crete points of the argument, which is called a discrete func- 
Figure 15.27 tion, are equivalent. 


15.4.1.2 Definition of the Z-Transformation 


1. Original Sequence and Transform 
The infinite series 


js ΣΑ (J (15.105) 


is assigned to the sequence { fn}. If this series is convergent, then the sequence { f,} is called z-transfor- 
mable, and it is denoted by 

F(z) =Z{fn}. (15.106) 
{fn} is called the original sequence, F(z) is the transform, z denotes a complex variable and F(z) is a 
complex-valued function. 
E /,—1 (n=0,1,2,...). The corresponding infinite series is 


F(z) = È o. (15.107) 


It represents a geometric series with common ratio 1/z, which is convergent if < 1 and its sum is 








> 1. Therefore, the sequence (1) is z-transformable for |-| « 1, 


F(z)— πι It is divergent for 

















i.e., for every exterior point of the unit circle |z| — 1 in the z plane. 

2. Properties 

Since the transform F'(z) according to (15.105) is a power series of the complex variable 1/2, the prop- 
erties of the complex power series (see 14.3.1.3, p. 750) imply the following results: 

a) For a z-transformable sequence { f,,}, there exists a real number R such that the series (15.105) is 
absolutely convergent for |z| > 1/R and divergent for |z| < 1/R. The series is uniformly convergent 
for |z| > 1/Ro > 1/R. Ris the radius of convergence of the power series (15.105) of 1/z. If the series 
is convergent for every |z| > 0, then R = oo. For non z-transformable sequences there is R = 0. 

b) If {f,} is z-transformable for |z| > 1/R, then the corresponding transform F(z) is an analytic func- 
tion for |z| > 1/R and it is the unique transform of { f,,}. Conversely, if F(z) is an analytic function 


15.4 Z-Transformation 795 





for |z| > 1/R and is regular also at z = oo, then there is a unique original sequence { f,} for F(z). 
Here, F(z) is called regular at z — oo, if F(z) has a power series expansion in the form (15.105) and 
F(co) = fo. 

3. Limit Theorems 

Analogously to the limit properties of the Laplace transformation ((15.7b), p. 770), the following limit 
theorems are valid for the z-transformation: 

a) If F(z) = Z{ fn} exists, then 








fo = lim F(z). (15.108) 
Here z can tend to infinity along the real axis or along any other path. Since the series 
1 1 
z{F(2)- fo} = fi foe fa er (15.109) 
1 1 1 
z {F(2) — fo — A) = fat far T fa denetg (15.110) 











are obviously z transforms, analogously to (15.108) one gets 


1 

fı = lim 2{F(2) — fo}, fo= Jim 2 [FG) —fo- As} ΗΕ (15.111) 
The original sequence { fn} can be determined from its transform F(z) in this way. 
b) If lim fn exists, then 

Jim fn = lim (2 — 1)F(z). (15.112) 
However the value lim fn from (15.112) can be determined only if its existence is guaranteed, since the 
above statement is not reversible. 
B /,—(-1" (n=0,1,2,...). Then Z{f,} = 
not exist. 
15.4.1.3 Rules of Calculations 


In applications of the z-transformation it is very important to know how certain operations defined on 
the original sequences affect the transforms, and conversely. For the sake of simplicity here the notation 
F(z) 2 Z(f,) for |z| » 1/R is used. 


1. Translation 


2 7 z 
1 and lim (— Ὁ LI 








— 0, but lim (—1)" does 
2 noo 


Forward and backward translations are distinguished. 
1. First Shifting Theorem: Z{f,.}=2*F(z)  (k=0,1,2,...), (15.113) 
here f, , — 0 is defined for n — k « 0. 


k-1 ν 
2. Second Shifting Theorem: Z(/,,4) = 2" [Fe -5f (3) | (k —1,2,...). (15.114) 
v=0 d 


2. Summation 


1 
For |z| > ma: (1, =) 
r |z| > max R 


z [£ η) = FO. (15.115) 


796 16. Integral Transformations 





3. Differences 
For the differences 


Afa = fati— fa, A™fn=A(A™ fn)  (m=1,2,...; A°fa = fn) (15.116) 
the following equalities hold 
Z{Afn} = (2-1F(z)-zfo, 
Z{A? fa} = (2-1)? F(z) — 2(2 -1) fo —zAfo, 
: = : (15.117) 
k-1 
Z{A" fa} = (2-1)FF(z)-—2z E (2—1) 1A" fy. 
v=0 
4. Damping lA 
For an arbitrary complex number \ # 0 and |z| > R 
Z{A fha} =F (=) ; (15.118) 


5. Convolution 
The convolution of two sequences { fn} and {gn} is the operation 


fn * In = Yo fo Inv (15.119) 


If the z-transformed functions Z{ f,} = F(z) for |z| > 1/R; and Z{g,} = G(z) for |z| > 1/Rg exist, 
then 


Zh fn * ga) — F(z)G(z) (15.120) 


for |z| > max (E z) . Relation (15.120) is called the convolution theorem of the z-transformation. 


Ri Ba 


It corresponds to the rules of multiplying two power series. 
6. Differentiation of the Transform 





IF (z 
Zí(nf,) = —2° a ). (15.121) 
Higher-order derivatives of F(z) can be determined by the repeated application of (15.121). 
7. Integration of the Transform 
Under the assumption fo = 0, 
nl _ fF 
s {5 - | "Qa. (15.122) 
T J 


15.4.1.4 Relation to the Laplace Transformation 
Describing a discrete function f(t) (see 15.4.1.1, p. 794) as a step function, then 
f(t) 2 f(nT) 5 f. for nT <t<(n4+1)T (n-0,,2,...; T»0, T const) (15.123) 
holds. Using the Laplace transformation (see 15.2.1.1, 1., p. 770) for this piecewise constant function, 
for T — 1 yields: 
oo Mtl 


ο ΙΟ) F9) = È | re tdt= 5h 


The infinite series in (15.124) is called the discrete Laplace transformation and is denoted by D: 


DEF} = Ῥ{4}- ΣΟ διε’. (15.125) 


ene — e—(n+l)p ise τ» co 


2 fre. (15.124) 





p 
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After the substitution of e? = z in (15.125) D{f,} represents a series with powers of 1/z, which is 
a so-called Laurent series (see 14.3.4, p. 752). The substitution e? = z suggested the name of the z 
transformation. With this substitution from (15.124) one finally gets the following relations between 
the Laplace and z-transformation in the case of step functions: 


pF(p) = (1 = =) F(z) (15.126a) o p£(f(0) - (1 - ‘) Z{fa}. α5.1360) 


In this way the relations of z-transforms of step functions (sce Table 21.15, p. 1128) can be transformed 
into relations of Laplace transforms of step functions (see Table 21.13, p. 1109), and conversely. 


15.4.1.5 Inverse of the Z- Transformation 
The inverse of the z-transformation is to find the corresponding unique original sequence { f;,} from its 
transform F(z): 

Z^ (F(3) - 0.) (15.127) 
There are different possibilities for the inverse transformation. 
1. Using Tables 
If the function F(z) is not given in tables, then one can try to transform it to a function which is given 
in Table 21.15. 
2. Laurent Series of F(z) 
Using the definition (15.105), p. 794 the inverse transform can be determined directly if a series expan- 
sion of F(z) with respect to 1/z is known or if it can be determined. 


1 
3. Taylor Series of F (=) 


1 
Since F (2) is a series of increasing powers of z, from (15.105) and using the Taylor formula follows 


1 d^ 1 
= aa F (5) 

n! dz” x 
4. Application of Limit Theorems 
Using the limits (15.108) and (15.111), p. 795, the original sequence {fn} can be directly determined 
from its transform F(z). 
ο 22 
παρα 
1. Partial fraction decomposition (see 1.1.7.3, p. 15) of F(z)/z yields functions which are contained in 
Table 21.15. 





(n =0,1,2,...). (15.128) 


2Ξ0 


E F(z) 


. Using the previous four methods: 

















2 (z—-2)0—1?» 2-92. (α ΠΤ; Τ᾽ a 
2z 2z 2 
F(z) = 72 IFT zi and therefore f, =2(2"—n—1) forn> 0. 
2. By division F(z) gets a series with decreasing powers of z: 
22 1 1 1 1 1 
πλ αερα σα 5 GI IL. ει: (15.129) 








From this expression one gets fo = f1 — 0, f2 — 2, f3 — 8, f4 — 22, f; — 52, fg — 114, ..., but not a 
closed expression is obtained for the general term fy. 


1 
3. For formulating F ( ) and its required derivatives, (see (15.128)) it is advisable to consider the 
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x 
— 


partial fraction decomposition of F( 











r(2) -2 2z 2 Ρ()-ο ΝΗ 
1 | 1-232 (1-z) 1-z' S 3 B ΡΕ 
dF (-) " i i dF 5) 
dz unos mca πε ρε ge bec 
1 1 
2 2 
= 1.6. = 4 r z = 
a (1-225 (1—2) (1—2z)" m ᾿ 
ΦΡ :) er(-) 
z Sh = i ie \2/ — 48 for z=0 





from which fo, fi, fo, fs, ... are easily obtained considering (15.128). 
4. Application of the limit theorems (see 15.4.1.2, 3., p. 795) gives: 











fo = lim F(z) — lim. ΠΠ EC =0, (15.131a) 
fi = lim 2(F(z) — fo) = lim a =Q; (15.131b) 
200 2300 23 — A2? E 5z — 2 f 
„3 
fo = lim 2? GO fo fiz) = jim aT = oa ον (15.131c) 
-.. οᾱ 1 20.8 2z 2 
fs = jim 2 (τω fo hz fo A) = jim z (3 ΠΕ 5) —8, ...(15.131d) 


where the Bernoulli-l'Hospital rule is applied (see 2.1.4.8, 2., p. 56). The original sequence 1 f,) can 
be determined successively. 


15.4.2 Applications ofthe Z- Transformation 


15.4.2.1 General Solution of Linear Difference Equations 


A linear difference equation of order k with constant coefficients has the form 








OnYn+k + Ok—1Yntk—1 +++ + G2Yn42 + G1Yn41 + GoYn = Jn (n=0,1,2...). (15.132) 
Here k is a natural number. The coefficients a; (4 — 0,1,..., k) are given real or complex numbers and 
they do not depend on n. Here ag and a; are non-zero numbers. The sequence {gn} is given, and the 
sequence {yp} is to be determined. 
To determine a particular solution of (15.132) the values yo, y1,..., Yx—-1 have to be previously given. 
Then the next value y, can be determined for n = 0 from (15.132). Next one gets yx41 for n = 1 from 
Ui. Uo. e. Yr and from (15.132). In this way all values y, can be calculated recursively. However 
a general solution can be given for the values y, with the z-transformation, using the second shifting 
theorem (15.114) applied for (15.132): 


ayz* [Y(2) =y- yz me gk-iz 670] +: + az[Y (2) — yo]  aoY (2) 7 G(z). (15.133) 
Here one denotes Y (z) = Z(y,) and G(z) 2 Z(g,). Substituting ayz^ -- ay 4z^7 az ap — 


p(z), the solution of the so-called transformed equation (15.133) is 


MOL HRS ox" > ga (15.134) 


p(z) j=i+1 
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As in the case of solving linear differential equations with the Laplace transformation, there is the 
similar advantage of the z-transformation that initial values are included in the transformed equation, 
so the solution contains them automatically. The required solution {yn} = Z-!( Y (z)) follows from 
(15.134) by the inverse transformation discussed in 15.4.1.5, p. 797. 


15.4.2.2 Second-Order Difference Equations (Initial Value Problem) 
The linear second-order difference equation has the form 

Yn+2 + A1Yn+1 + A0Yn = Yn, (15.135) 
where yo and y; are given as initial values. Using the second shifting theorem for (15.135) the trans- 
formed equation is 

1 

2? [Y (z) — yo — dp aiz[Y (z) — yo] + aoY (z) = G(z). (15.136) 

Substituting z? + aız + ao = p(z), the transform is 


1 z(z +a) z 
Glz) +0 H. 
p(z) p(2) p(z) 
If the roots of the polynomial p(z) are o4 and as, then a4 Z 0 and a» ¥ 0, otherwise ao is zero, and 
then the difference equation could be reduced to a first-order one. By partial fraction decomposition 
and applying Table 21.15 for the z-transformation one gets from 


Y(z) = (15.137) 








1 2 2 
z — ( = ) for o1 £ as, 
e Q1 — 09 NZ —03 Z — 0» 
7 = 2 
p(z) 9 for o4 — a», 
(z — 01) 
n n 
at — az 


91-92 for 
2-1 tas} tpi ae. ο ον (15.138a) 


p(z) na?! for ay = ap. 


Since po = 0, by the second shifting theorem there is 


= [s] d [35] = {Pnsa} (15.138b) 


and by the first shifting theorem 


z^ [xs] -z sl = {pp}. (15.138c) 


Substituting here p_; = 0, based on the convolution theorem one gets the original sequence with 


Yn = >> Pv-19n—v + Yo(Pn+1 + 41Pn) + Pn - (15.138d) 


v=0 


Since p_1 = po = 0, this relation and (15.138a) imply that in the case of o4 Z a it follows 








n ν-1 ν-1 n+l n+l n n n n 
ay —05 , aj? τα, α-αδὶ, , αἲ-- αν 15 
n= N uu + Yo Har Hy : (15.138e) 
v=o Q1 — ag Q1 — Q3 Q1 — 03 Q1 — a2 
This form can be further simplified, since a — —(o1 4- 03) and ag — 0402 (see the root theorems of 
Vieta, 1.6.3.1, 3., p. 44), so 
ν-1 ν-1 n-1 n—1 


al — 05 at — ay 


2 oq - a 
Un Σ Ün—v : à Joao + Y1 : (15.1386) 
v=2 








Q1 — 03 " 01 — 03 Q1 — Q2 
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In the case of ay = az similarly 
n 
Yn = >> Gn—v(v — lat? — yoao(n — 1)ot ? 4 yino 1. (15.138g) 
v=2 


In the case of second-order difference equations the inverse transformation of the transform Y (z) can 
be performed without partial fraction decomposition using correspondences such as, e.g., 


zg- { z \ Lama sinh bn (15.139) 





2? — 2az cosh b 4- a? sinh n 
and the second shifting theorem. By substituting a; — —2a cosh b, and ag — a? the original sequence 
of (15.137) becomes: 
1 - v—2 -; nos n-1l.: 
τη ΣΟ θα. να” 7 sinh(v — 1)b — yoa" sinh(n — 1)b + ya” sinh nb| . (15.140) 
sin = 


This formula is useful in numerical computations especially if ay and a; are complex numbers. 
Remark: Notice that the hyperbolic functions are also defined for complex variables. 


15.4.2.3 Second-Order Difference Equations (Boundary Value Problem) 


It often happens in applications that the values y, of a difference equation are needed only for a finite 
number of indices 0 € n € N. In the case of a second-order difference equation (15.135) both boundary 
values yo and yy are usually given. To solve this boundary value problem one starts with the solution 
(15.138f) of the corresponding initial value problem, where instead of the unknown value y; it is to 
introduce yy. Substituting n = N into (15.138f), y; can be obtained which depends on yo and yy: 
N 
y= do yoao(o1 | — a2) -- yy (a1 — 03) — Σία -αΐ-Ἴθν.ν]. (5.141) 
η = 


Substituting this value into (15.138f) 


1 ^ ( ν-1 En 1 at uni ay ( ν-1 vol) 
Yn = —— (ot -a -ν-------π--- att -a - 
Yn Q1 — ag νο 1 2 Une Q1 — ag a? = αἴ νο 1 2 9u-v 
1 
HUN [uo(a o — oa) -- yx (o — at]. (15.142) 
1 —03 


The solution (15.142) makes sense only if a’ — ag’ 4 0 holds. Otherwise, the boundary value prob- 
lem has no general solution, but analogously to the boundary value problems of differential equations 
eigenvalues and eigenfunctions emerge. 


15.5 Wavelet Transformation 
15.5.1 Signals 


If a physical object emits an effect which spreads out and can be described mathematically, e.g., by a 
function or a number sequence, then it is called a signal. 

Signal analysis means to characterize a signal by a quantity that is typical for the signal. This means 
mathematically: The function or the number sequence, which describes the signal, will be mapped into 
another function or number sequence, from which the typical properties of the signal can be clearly 
seen. For such mappings, of course, some informations can also be lost. 

The reverse operation of signal analysis, i.e., the reconstruction of the original signal, is called signal 
synthesis. 

The connection between signal analysis and signal synthesis can be well represented by an example 
of Fourier transformation: A signal f(t) (t denotes time) is characterized by the frequency w. Then, 
formula (15.143a) describes the signal analysis, and formula (15.143b) describes the signal synthesis: 
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F(w) = i e ™ F(t) dt (15.143a) and f(t) = = / et F (w) dw. (15.143b) 


15.5.2 Wavelets 


The Fourier transformation has no localization property, i.e., if a signal changes at one position, then 
the transform changes everywhere without the possibility that the position of the change could be 
recognized “at a glance”. The basis of this fact is that the Fourier transformation decomposes a signal 
into plane waves. These are described by trigonometric functions, which oscillate with the same period 
for arbitrary long time. However, for wavelet transformations there is an almost freely chosen function 
ib, the wavelet (small localized wave), that is shifted and compressed for analysing a signal. 

Examples are the Haar wavelet (Fig. 15.28a) and the Mexican hat (Fig. 15.28b). 


B A Haar wavelet: y(t) A y(t) 
1 df 0Oxr«l, 133 1 
w=4-1 if lzzrzl, (15.144) |a 


0 otherwise. m 
0 V t 


2 


WB B Mexican hat: | 0| 1 t 
: lt ae 
dues ts 2/2 (15.145) i = » 
NH a 
— (1— a?2)e-*/2, (15.146) 


Figure 15.28 


Generally, it holds that every function ~ comes into consideration as a wavelet if it is quadratically 
integrable and its Fourier transform WV(w) according to (15.143a) results in a positive finite integral 


πω) 


Ιω 








dw. (15.147) 


Concerning wavelets, the following properties and definitions are essential: 
1. For the mean value of the wavelet: 


i: w(t) dt = 0. (15.148) 
2. The following integral is called the k-th moment of a wavelet w: 
i= J tolt) dt. (15.149) 


The smallest positive integer n such that 4n Æ 0, is called the order of the wavelet v. 
W For the Haar wavelet (15.144), n — 1, and for the Mexican hat (15.146), n — 2. 


3. When up = 0 for every k, w has infinite order. Wavelets with bounded support always have finite 
order. 


4. A wavelet of order n is orthogonal to every polynomial of degree € n — 1. 


15.5.3 Wavelet Transformation 


For a wavelet y(t) a family of curves can be formed with parameter a: 


p=) (απ). (15.150) 





802 15. Integral Transformations 





In the case of |a| > 0 the initial function v(t) is compressed. In the case of a « 0 there is an additional 
reflection. The factor 1/,/|a] is a scaling factor. 


The functions w(t) can also be shifted by a second parameter b. Then a two-parameter family of curves 
arises: 








ος «(52 (a,breal; a #0). (15.151) 


la| a 
The real shifting parameter b characterizes the first moment, while parameter a gives the deviation of 
the function Ya p(t). The function Ya p(t) is called a basis function in connection to the wavelet trans- 
formation. 
The wavelet transformation of a function f(t) is defined as: 








£f.) ^ c f FOvaal)at= —— f OW (=) dt. (15.1522) 


y lal ce j 


For the inverse transformation: 


fies / / Eu fF Usa) 3s da db. (15.152b) 


—00 —oo 
Here c is a constant dependent on the special wavelet v. 
W Using the Haar wavelets (15.146) gives 


a 1 if b<t<b+a/2, 
«( )- —1 if b+a/2<t<b+a, 





a 0 otherwise 


and therefore 


1 b+a/2 b+a 
Lyf(a,b) = Ja (/ f(t) dt - [fe a) 


2 pora/2 2 pb+a 
Z “a a E aja- a eun f a) (15.153) 


The value £, f (a, b) given in (15.153) represents the difference of the mean values of a function f(t) 


a 
over two neighboring intervals of length 3 connected at the point b. 


Remarks: 
1. The dyadic wavelet transformation has an important role in applications. As basis functions are 
used the functions 


1 t— 2j 
wO = Feo (SP), (15.154) 
i.e., different basis functions can be generated from one wavelet (t) by doubling or halving the width 
and shifting by an integer multiple of the width. 





2. A wavelet v(t) is called an orthogonal wavelet, if the basis functions given in (15.154) form an 
orthogonal system. 

3. The Daubechies wavelets have especially good numerical properties. They are orthogonal wavelets 
with compact support, i.e., they are different from zero only on a bounded subset of the time scale. 
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They do not have a closed form representation (see [15.9]). 


15.5.4 Discrete Wavelet Transformation 
15.5.4.1 Fast Wavelet Transformation 


The integral representation (15.152b) is very redundant, and so the double integral can be replaced 
by a double sum without loss of information. Considering this idea at the concrete application of the 
wavelet transformation one needs 

1. an efficient algorithm of the transformation, which leads to the concept of multi-scale analysis, and 
2. an efficient algorithm of the inverse transformation, i.e., an efficient way to reconstruct signals from 
their wavelet transformations, which leads to the concept of frames. 

For more details about these concepts see [15.9], [15.1]. 

Remark: The great success of wavelets in many different applications, such as 

e calculation of physical quantities from measured sequences 

e pattern and voice recognition 

e data compression in news transmission 

is based on “fast algorithms”. Analogously to the FFT (Fast Fourier Transformation, see 19.6.4.2, 
p. 993) one talks here about FWT (Fast Wavelet Transformation). 


15.5.4.2 Discrete Haar Wavelet Transformation 


An example of a discrete wavelet transformation is the Haar wavelet transformation: The values f; (i — 
1,2,..., N) are given from a signal. The detailed values d; (i — 1,2,..., N/2) are calculated as: 


Si = Ja + fai), di= Ja — fri). (15.155) 


The values d; are to be stored while the rule (15.155) is applied to the values s;, i.e., in (15.155) the 
values f; are replaced by the values s;. This procedure is continued, sequentially so that finally from 


1 1 
ο n n ] n+l n n 56 


a sequence of detailed vectors is formed with components df. Every detailed vector contains infor- 
mation about the properties of the signals. 


Remark: For large values of N the discrete wavelet transformation converges to the integral wavelet 
transformation (15.152a). 


15.5.5 Gabor Transformation 

Time-frequency analysis is the characterization of a signal with respect to the contained frequencies 

and time periods when these frequencies appear. Therefore, the signal is divided into time segments 

(windows) and a Fourier transform is used. It is called a Windowed Fourier Transformation (WFT). 

The window function should be chosen so that a signal is con- 

sidered only in the window. Gabor applied the window function 
2 


g(t) = : e 202 (15.157) 
πσ 








(Fig. 15.29). This choice can be explained as g(t), with the 
“total unit mass”, is concentrated at the point t = 0 and the 


. width of the window can be considered as a constant (about 
Figure 15.29 20). 
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The Gabor transformation of a function f (t) then has the form 
Gf(w,s) = / f(t)g(t — λε να. (15.158) 


This determines, with which complex amplitude the dominant wave (fundamental harmonic) e^! oc- 
curs during the time interval [s — c, s + a] in f, i.e., if the frequency w occurs in this interval, then it 
has the amplitude |G f (w, s)|. 


15.6 Walsh Functions 
15.6.1 Step Functions 


Orthogonal systems of functions have an important role in the approximation theory of functions. For 
instance, special polynomials or trigonometric functions are used since they are smooth, i.e., they are 
differentiable sufficiently many times in the considered interval. However, there are problems, e.g., the 
transition of points of a rough picture, when smooth functions are not suitable for the mathematical 
description, but step functions, piecewise constant functions are more appropriate. Walsh functions 
are very simple step functions. They take only two function values +1 and —1. These two function 
values correspond to two states, so the Walsh functions can be implemented by computers very easily. 


15.6.2 Walsh Systems 


Analogously to trigonometric functions also periodic step functions can be considered. The interval 
I = [0,1) is used as a period interval and it is divided into 2* equally long subintervals. Suppose Sp, 
is the set of periodic step functions with period 1 over such an interval. The different step functions 
belonging to S» can be considered as vectors of a finite dimensional vector space, since every function 


g E Sn is defined by its values go, g1, g2, . .. , g2n 1 in the subintervals and it can be considered as a 
vector: 
g” = (g0, 91, 92, - +- 92 1) (15.159) 


The Walsh functions belonging to S,, form an orthogonal basis with respect to a suitable scalar product 
in this space. The basis vectors can be enumerated in many different ways, so one can get many different 
Walsh systems, which actually contain the same functions. There are three of them which should be 
mentioned: Walsh-Kronecker functions, Walsh-Kaczmarz functions and Walsh-Paley functions. 


The Walsh transformation is constructed analogously to the Fourier transformation, where the role 
of the trigonometric functions is taken by the Walsh functions. One gets, e.g., Walsh series, Walsh 
polynomials, Walsh sine and Walsh cosine transformations, Walsh integral, and analogously to the fast 
Fourier transformation there is a Fast Walsh Transformation. For an introduction in the theory and 
applications of Walsh functions see [15.6]. 


16 Probability Theory and 
Mathematical Statistics 


When experiments or observations are made, various outcomes are possible even under the same con- 
ditions. Probability theory and statistics deal with regularity of random outcomes of certain results 
with respect to given experiments or observations. (In probability theory and statistics, observations 
are also called experiments, since they have certain outcomes.) It is supposed, at least theoretically, 
that these experiments can be repeated arbitrarily many times under the same circumstances. Its ap- 
plication have these disciplines of mathematics in the statistic assessment of mass phenomena. The 
mathematical handling of random phenomena is also summarized in the notion stochastics. 


τ 5 
16.1 Combinatorics 

From the elements of a given set often new sets, systems or sequences are composed. Depending on 
the way how to do it, one gets the notions permutation (ordering), combination (selection), and partial 
permutation or arrangement. The notion arrangement combines ordering and selection. The basic 
problem of combinatorics is to determine how many different choices or arrangements are possible with 
the given elements. 


16.1.1 Permutations 


1. Definition 
A permutation of n elements is an ordering of the n elements. 
2. Number of Permutations without Repetition 
The number of different permutations of n different elements is 

Py =n, (16.1) 
W ina classroom 16 students are seated on 16 places. There are 16! different possibilities for seating. 
3. Number of Permutations with Repetitions 


The number P,,) of different permutations of n elements containing k identical elements (k < n) is 


n! 
QI 
Pw = πι (16.2) 
W In à classroom 16 school-bags of 16 students are placed on 16 chairs. Four of them are identical. 
There are 16!/4! different placements of the school-bags. 
4. Generalization 
The number P, 1^2) of different permutations of n elements containing m different types of ele- 
ments with multiplicities ky, k2,...,km respectively (kı + k2 +... + km = n) is 


n! 
pP (Erk2rokm) — : 16.3 
E kaı!ka! . . . km! ( ) 
H Suppose one composes five-digit numbers from the digits 4, 4, 5, 5, 5. One can compose P5?) — 
5! . 
ΖΒ 10 different numbers. 


16.1.2 Combinations 
1. Definition 


A combination is a choice of k elements from n different elements not considering the order of them. 
This is called a combination of k-th order and one distinguishes between combinations with and without 
repetition. 
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2. Number of Combinations without Repetition 
The number C, ? of different possibilities to choose k elements from n different elements not consid- 
ering the order is 

Cc, = (3) with 0<k<n (sce binomial coefficient in 1.1.6.4, 3., p. 13), (16.4) 


if any element is chosen at most once. This is called a combination without repetition. 


30 eus ΙΙ 
W There are ( A = 27405 possibilities to choose an electoral board of four persons from 30 partici- 
pants. 


3. Number of Combinations with Repetition 


The number of possibilities to choose k elements from n different ones, repeating each element arbi- 
trarily times and not considering the order is 


c, 9 - ( m = Ἰ (16.5) 


In other words, the number of different selections of k elements chosen from n different elements is 
considered, where each of n can be chosen more then once. 


k+6-1\ : ; 
k ) different results are possible. Consequently, for two dice there 


H Rolling k dice, Cg = ( 


are C4?) = H — 2] different results. 


16.1.3 Arrangements 


1. Definition 
An arrangement is an ordering of k elements selected from n different ones, i.e., arrangements are 
combinations considering the order. 


2. Number of Arrangements without Repetition 


The number V, of different orderings of k different elements selected from n different ones is 





Va) = a(t) —n(n—1)(n—2)...(n—- k-1) (0€ kx n). (16.6) 
W How many different ways are there to choose a chairman, his deputy, and a first and a second assistant 


T" : : . [30 
for them from 30 participants at an election meeting? The answer is ( 4 ) 4! = 657720. 


3. Number of Arrangements with Repetition 
An ordering of k elements selected from n different ones, where any of the elements can be selected 
arbitrarily many times, is called an arrangement with repetition. Their number is 

ye = nk. (16.7) 
E A: In a soccer-toto with 12 games there are 3” different outcomes. 


B B: With the digital unit called a byte which contains 8 bits can be represented 25 — 256 different 
symbols (see for example the well-known ASCII table). 
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16.1.4 Collection of the Formulas of Combinatorics (see Table 16.1) 


Table 16.1 Collection of the formulas of combinatorics 

















Type of choice or Number of possibilities 
selection of kfrom | without repetition with repetition 
n elements (k € n) (k € n) 

n! 
Permutations P, = ni(n=k)| PM = a 

n+k—-1 
Combinations c,) = " σι = Bor 
k k 

Arrangements V,) = k! 0 Vv, = nk 





16.2 Probability Theory 


16.2.1 Event, Frequency and Probability 
16.2.1.1 Events 
1. Different Types of Events 


All the possible outcomes of an experiment are called events in probability theory, and they form the 
fundamental probability set A. 
There are to be distinguished the certain event, the impossible event and random events. 

The certain event occurs every time when the experiment is performed, the impossible event never oc- 
curs; a random event sometimes occurs, sometimes does not. All possible outcomes of the experiment 
excluding each other are called elementary events (see also Table 16.2). Here the events of the funda- 
mental probability set A are denoted by A, B,C,..., the certain event by J, the impossible event by 
O. Some operations and relations between the events are denoted as given in Table 16.2. 


2. Properties of the Operations 
The fundamental probability set forms a Boolean algebra with complement, addition, and multiplica- 


tion defined in Table 16.2, and it is called the field of events. 


1. a) 


2. a) 


A 


ΑΗ 


ΑΗ 


+B=B+A, 
-A-— A, 
+(B+C)=(A+B)+C, 





(16.8) 1. b) 
(16.10) 2. b) 
(16.12) 3. b) 
(16.14) 4. b) 
(16.16) 5. b) 
(16.18) 6. b) 
(16.20) 7. b) 
(16.22) 8. b) 


The following rules are valid: 








AB = BA. (16.9 
AA — A. (16.11 
A(BC) = (AB)C. (16.13 
AA =O (16.15 
A+BC =(A+B)(A+C). (16.17 
AB=A+B (16.19 
A=I-A. (16.21 
AB-C-(A-C)B-C) (1623 
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9.32 OCA, (16.24) 9.b) ACI. (16.25 


10. From A C B follows a) A= AB (16.26) and b B=A+BA_ and conversely. (16.27 


11. Complete System of Events: A system of events Aa (a € 0, @ is a finite or infinite set o 
indices) is called a complete system of events if the following is valid: 


11.4) AgAg=O fora # B (16.28) ad 11. b) $ A=]. (16.29 


acd 


Table 16.2 Relations between events 








Name Nota- Definition 
tion 

1. | Complementary event of A: A A occurs exactly if A does not. 

2. | Sum of events A and B: A+B | A+Bisthe event which occurs if A or B or both occur. 

3.| Product of the events A AB AB is the event which occurs exactly if both A and B 
and B: occur. 

4. | Difference of the events 4| A— B | A— B occurs exactly if A occurs and B does not. 
and B: 

5. | Event as a consequence of € A C B means that from the occurrence of A follows 
the other: the occurrence of B. 

6. Elementary or simple E From E = A + B it follows that E = Ao E = B. 
event: 

7. | Compound event: Event, which is not elementary. 

8. | Disjoint or exclusive events | AB = O | The events A and B cannot occur at the same time. 
A and B: 




















E A: Tossing two coins: Elementary events for the separate toss- 
ing: See the table on the right. 








: . : . Head | Tail | 
1. Elementary event for tossing both coins, e.g.: First coin shows I. Coin | An | An 
head, second shows tail: 411422. @ Get. | Aa, || ey 














Compound event for tossing both coins: First coin shows head: 
Ay = Ay Aa t 41422 
2. Compound event for tossing one coin, e.g., the first one: First coin shows head or tail: A11 + Ayg = I. 
Head and tail on the same coin are disjoint events: A11.419 — O. 
W B: Lifetime of light-bulbs. 
Defining the elementary events A,: the lifetime f satisfies the inequalities (n — 1)At < t < nAt (n = 
1,2,..., and At > 0, arbitrary unit of time). 
Compound event A: The lifetime is at most nAt, i.e., A = Σ Αν. 


νξι 


16.2.1.2 Frequencies and Probabilities 


1. Frequencies 

Let A be an event belonging to the field of events A of an experiment. If event A occurred n4 times 
repeating the experiment n times, then ny is called the frequency, and na/n = ha is called the relative 
frequency of the event A. The relative frequency satisfies certain properties which can be used to built 
up an axiomatic definition of the notion of the probability P(A) of event A in the field of events A. 

2. Definition of the Probability 

A real function P defined on the field of events is called a probability if it satisfies the following proper- 
ties: 
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1. For every event A € A 


0< P(A)<1, and 0<hg<l. (16.30) 
2. For the impossible event O and the certain event [ 
P(O)—0, P(I) 21, and ho=0, hy =1. (16.31) 


3. If theevents A; € A. (i — 1,2,...) are finite or countably many mutually exclusive events 
(A; Ay — O for i Æ k), then 
P(A, + Ag+ asa) — P(A1) 4- P(A3) +..., and hA Ag = ha, + ha, ees (16.32) 
Remark: The axiomatization of the probability theory, which is based on the assumption of three 
conditions of the preceding kind, succeeded in 1933 A.N. KOLMOGOROFF (see [16.11]). 


3. Rules for Probabilities 


1. BCA yields P(B) < P(A). (16.33) 
2. P(A)+P(A)=1. (16.34) 
3.a) For n mutually exclusive events A; (i — 1,...,n; A:Ax = O,i # k), holds 

P(A, + Ao  ... A4) 2 P(Ai) + P(Aa) +... + P(An). (16.35a) 
3.b) In particular for n = 2 holds 

P(A + B) = P(A) + P(B). (16.35b) 
4. a) For arbitrary events A; (¢ = 1,...,n), holds 

P(A +--+ + An) = P(A) +-+- + P(An) - P(AA3) — ++» — P(A1A;) 

P(A243) — +++ — P(A2An) — +++ — P(An-1An) 





+P(A,A2A3) + +--+ P(AyAgAn) +++: + P(An—-2An-1An) — 





+(-1)""1 P(A, Ag... An). (16.36a) 
4.b) In particular for n = 2 holds P(A; + A2) = P(A1) + P(A2) — P(A1A2). (16.36b) 
5. Equally likely events: If every event A; (i = 1,2,...,n) ofa finite complete system of events occurs 
with the same probability, then 
1 
P(A;) 2 -. 16. 
(4) = = (16.37) 

If Aisasum of m (m X n) events A; (i — 1,2,...,n) with the same probability of a complete system, 
then 

P(A) = (16.38) 
4. Examples of Probabilities 

1 

A: The probability P(A) to get a 2 rolling a fair die is: P(A) = 6 
E B: What is the probability of guessing four numbers for the lotto “6 from 49", i.c., 6 numbers are 
to be chosen from the numbers 1,2,...,49. 


If 6 numbers are drawn, then there are (Ὁ possibilities to choose 4. On the other hand there are 


49 


E = iy possibilities for the false numbers. Altogether, there are ( o 


6—4 
draw 6 numbers. Therefore, the probability P(A4) is: 


) different possibilities to 








43 
P(A) - i = ae = 0.0968 %. 
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Similarly, the probability P(As) to get a direct hit is: 


1 ; 
P(Ag) = ayy = 0.715 - 1077 = 7.15- 107° %. 
(5) 
HC: What is the probability P(A) that at least two persons have birthdays on the same day among 
k persons? (The years of birth must not be identical, and one supposes that every day has the same 
probability of being a birthday.) 
It is easier to consider the complementary event A: All the k persons have different birthdays. One 
gets: 
=, 365 365-1 365-2 365 —k +1 
P(A . . ur 
(4) 365 960 365 365 

From this it follows that 








3605 : 964 - 363 - .. . - (365 — k + 1) 
365% 
Some numerical results: Ke a = 30 ie 
P(A) | 0.117 0.411 0.507 0.706 0.994 
It is to see that the probability that among 23 and more persons at least two have the same birthday is 
greater than 50 96. 


16.2.1.3 Conditional Probability, Bayes Theorem 


1. Conditional Probability 
The probability of an event B, when it is known that some event A has already occurred, is called a 
conditional probability and it is denoted by P(B|A) or P4(B) (read: The probability that B occurs 
given that A has occurred). It is defined by 
P(AB) 
P(A) ° 
The conditional probability satisfies the following properties: 
a) If P(A) Z 0 and P(B) 4 0 holds, then 
P(B|A4)  P(A|B) 


Ρ(Α) --1-- Ρ(Α) --1 














P(B|A) — P(A) £0. (16.39) 








P(B) P(A) (16.40a) 

b) If P(A, Ag A3... An) 4 0 holds, then 

P(A145... A4) 2 P(Ai)P(A2| A)) ... P(As| Ai 42 . .. A5 1). (16.40b) 
2. Independent Events 
The events A and B are independent events if 

P(A|B) = P(A) and P(B|A) = P(B) (16.41a) 
holds. In this case, it is 

P(AB) = P(A)P(B). (16.41b) 


3. Events in a Complete System of Events 
If A is a field of events and the events B; € A with P(B;) » 0 (i — 1,2,...) form a complete system 


of events, then for an arbitrary event A € A the following formulas are valid: 
a) Total Probability Theorem 


P(A) =) P(A|Bi) P(Bi). (16.42) 
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b) Bayes Theorem with P(A) 20 


(— P(AJBy)P(Bi) i 
P(B,|A) = Y PAWBOPUS) ασε" (16.43) 


E Three machines produce the same type of product in a factory. The first one gives 20 96 of the total 
production, the second one gives 30 96 and the third one 50 96. It is known from past experience that 
5 96, 4 96, and 2 96 of the product made by each machine, respectively, are defective. Two types of 
questions often arise: 
a) What is the probability that an article selected randomly from the total production is defective? 
b) If the randomly selected article is defective, what is the probability that it was made, e.g., by the 
first machine? 
Using the following notation gives: 
e A; denotes the event that the randomly selected article is made by the i-th machine (i = 1, 2,3) with 
P(A) = 0.2, P(A2) = 0.3, P(A3) = 0.5. The events A; form a complete system of events: 
9 EX — 0, A + Ap + Az = I. 
e A denotes the event that the chosen article is defected. 
e P(A|A1) = 0.05 gives the probability that an article produced by the first machine is defective; 
analogously P(A|A2) = 0.04 and P(A|A3) = 0.02 hold. 
Now, the answers to the questions are: 
a) P(A) 2 P(A)P(A[A) + P(A2)P(Al Aa) + P(As)P(AlAs) 

= 0.2- 0.05 + 0.3 - 0.04 + 0.5 - 0.02 = 0.032. 
P(A|A1) 0.05 


—0.2— 4 — 0.31. 


b) P(A1|A) — P(A1) P(A) 0.032 


16.2.2 Random Variables, Distribution Functions 


To apply the methods of analysis in probability theory, the notions of variable and function are neces- 
sary. 


16.2.2.1 Random Variable 


A set of elementary events is to be described by a random variable X. 'The random variable X can be 
considered as a quantity which takes its values x randomly from a subset R of the real numbers. 

If R contains finitely or countably many different values, then X is called a discrete random variable. In 
the case of a continuous random variable, R can be the whole real axis or it may contain subintervals. 
For the precise definition see 16.2.2.2, 2., p. 812. There are also mixed random variables. 

W A: Assigning the values 1, 2, 3, 4to the elementary events A11, A12, A21, 422, respectively, in example 
A, p. 808, then a discrete random variable X is defined. 

W B: Thelifetime T ofarandomly selected light-bulb is a continuous random variable. The elementary 
event T' — t occurs if the lifetime T' is equal to t. 


16.2.2.2 Distribution Function 


1. Distribution Function and its Properties 
The distribution of a random variable X can be given by its distribution function 

F(r)—- P(X €x) for —oo€ x € oo. (16.44) 
It determines the probability that the random variable X takes a value between —oo and z. Its domain 
is the whole real axis. The distribution function has the following properties: 
1. F(—oo) = 0, F(+oo) =1. 
2. F(a) is a non-decreasing function of x. 
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3. F(x) is continuous on the right. 

Remarks: 

1. From the definition it follows that P(X = a) = F(a) — lim F (a). 
r—aG— 


2. In the literature, also the definition F(v) — P(X < x) is often used. In this case 
P(X =a)= lim, F(x) — F(a). 
ra 


2. Distribution Function of Discrete and Continuous Random Variables 
a) Discrete Random Variable: A discrete random variable X, which takes the values x; (i = 
1,2,...) with probabilities P(X = x;) = p; (i = 1,2,...), has the distribution function 

F(x) = Op. (16.45) 

πισω 

b) Continuous Random Variable: A random variable is called continuous if there exists a non- 
negative function f(x) such that the probability P(X € S) can be expressed as P(X € S) = fg f(x)dx 
for any domain S such that it is possible to consider an integral over it. This function is the so-called 
density function. A continuous random variable takes any given value x; with 0 probability, so one 
rather considers the probability that X takes its value from a finite interval (a, 0]: 


b 
P(ac X «5)— [ro dt. (16.46) 
A continuous random variable has an everywhere continuous distribution function: 
F(a) = P(X <2) = / f(t) dt. (16.47) 


F'(x) = f(x) holds at the points where f(x) is continuous. 

Remark: When there is no confusion about the upper integration limit, often the integration variable 
is denoted by x instead of t. 

3. Area Interpretation of the Probability, Quantile 

By introducing the distribution function and density function in (16.47), the probability P(X <x) = 
F(x) can be represented as an area between the density function f(t) and the x-axis on the interval 
—oo « t € x (Fig. 16.1a). 


f(t) f(t) 
F(x) 








oy 


0 x t 0| Xs 
a) b) 
Figure 16.1 
Often there is given (frequently in 96) a probability value a. If 
Ρ(Χ»α)-α (16.48) 
holds, the corresponding value of the abscissa x = za is called the quantile or the fractile of order a 
(Fig. 16.1b). This means the area under the density function f(t) to the right of aq is equal to a. 
Remark: In the literature, the area to the left of x, is also used for the definition of quantile. 
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In mathematical statistics, for small values of a, e.g., a = 5% or a = 1%, is also used the notion 
significance level of first type or type 1 error rate. The most often used quantile for the most important 
distributions in practice are given in tables (Table 21.16, p. 1131, to Table 21.20, p. 1138). 


16.2.2.3. Expected Value and Variance, Chebyshev Inequality 


For a coarse characterization of the distribution of a random variable X , mostly the parameters expected 
value, denoted by ji, and variance, denoted by o? are used. The expected value can be interpreted with 
the terminology of mechanics as the abscissa of the center of gravity of a surface bounded by the curve of 
the density function f(a) and the x-axis. The variance represents a measure of deviation of the random 
variable X from its expected value ju. 


1. Expected Value 
If g(X) is a function of the random variable X, then g(.X) is also a random variable. Its expected value 
or expectation is defined as: 


a) Discrete Case: E(g(X)) 2 M g(vx)py, if the series V7 |g(vy)|py. exists. (16.492) 
k k=1 
+00 es 
b) Continuous Case: E(g(X)) — J g(x) f(a) dx, if / [g(x)| f (x) dx. exists. (16.49b) 
The expected value of the random variable with g(.X) — X is defined as 
+00 
pix = E(X) 2 M map, or / uf (x) dx, (16.50a) 
k EC 


if the corresponding sum or integral with the absolute values exists. Because of (16.49a,b) 
E(aX +b) =apx+b (a,b const) (16.50b) 
is also valid. Of course, it is possible that a random variable do not have any expected value. 


2. Moments of Order n 


Furthermore one introduces: 


a) Moment of Order n: E(X”), (16.51a) 
b) Central Moment of Order n: E((X — ux)"). (16.51b) 


3. Variance and Standard Deviation 
In particular, for n — 2, the central moment is called the variance or dispersion: 


Yu τ ux) p or 


E(X — ux) 2 D(X) 2 ok — 4 + (16.52) 


if the expected values occurring in the formula exist. The quantity ox is called the standard deviation. 
The following relations are valid: 


D?(X) = 0% = E(X?) — p, D?(aX +6) = a?D?(X). (16.53) 
4. Weighted and Arithmetical Mean 
In the discrete case, the expected value is obviously the weighted mean 

E(X) =piti - ... 9 pan (16.54) 
of the values z,...,c, with the probabilities py as weights (k — 1,...,n). The probabilities for the 
uniform distribution are py = pp =... = Pn = 1/n, and E(X) is the arithmetical mean of the values 
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Tk: 
21 221... 1-2π 
n ` 


E(X) = (16.55) 


In the continuous case, the density function of the continuous uniform distribution on the finite interval 


a, 9] is 





: fora<a<b 
f(x) = δα... (16.56) 
0 otherwise, 
and it follows that 
b 
1 3. atb »  (b—a)? 
E(X) - L— f zdr = τν Oe ae (16.57) 


5. Chebyshev Inequality 
If the random variable X has the expected value ju and standard deviation a, then for arbitrary \ > 0 
the Chebyshev inequality (see 1.4.2.10, p. 31) is valid: 


P 1 
P(|X — u| > ào) < Xx (16.58) 
That is, it is very unlikely that the values of the random variable X are farther from the expected value 
js than a multiple of the standard deviation (A large). 


16.2.2.4 Multidimensional Random Variable 


If the elementary events mean that n random variables X4,..., X, take mreal values z1,..., r4, then 
a random vector X. — (X1, Xs,..., X4) is defined (see also random vector, 16.3.1.1, 4., p. 830). The 
corresponding distribution function is defined by 


P(i;oigmQ)ePXs€umegXy € 4). (16.59) 
The random vector is called continuous if there is a function f(t1,...,tn) such that 
21 Zn 
Ε(αι,...., Tn) = / θα / f(&,... t5) dt ... día (16.60) 


holds. The function f(ti,...,tn) is called the density function. It is non-negative. If some of the 
variables z1,...,x, tend to infinity, then one gets the so-called marginal distributions. Further inves- 
tigations and examples can be found in the literature. 

'The random variables X4,..., X, are independent random variables if 


F(zi...,24) 2 Fi(zi)Fo(z2)...Fu(za), f(ti,.--, tr) = Sti) Jalin) (16.61) 


16.2.3 Discrete Distributions 


1. Two-Stage Population and Urn Model 

Suppose one has a two-stage population with N elements, i.e., the population to be considered has two 
classes of elements. One class has M elements with a property A, the other one has N — M elements 
which does not have the property A. If investigating the probabilities P(A) = p and P(A) = 1 — p for 
randomly chosen elements, then one has to distinguish between two cases: When selecting n elements 
one after the other, either the previously selected element is replaced before selecting the next one, or 
itis not. The selected n elements, which contain k elements with the property A, is called the sample, 
n being the size of the sample. This can be illustrated by the urn model. 


2. Urn Model 
Suppose there are a lot of black balls and white balls in a container. The question is: What is the 
probability that among n randomly selected balls there are k black ones. If putting every chosen ball 
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back into the container after the determination of its color, then the number k of black ones among the 
chosen n balls has a binomial distribution. If one does not put back the chosen balls and n < M and 
n < N — M, then the number of black ones has a hypergeometric distribution. 


16.2.3.1 Binomial Distribution 

Suppose in an experiment only the two events A and A are possible and the experiment is repeated 
n times and the accompanying probabilities are P(A) = p and P(A) = 1 — p every time, then the 
probability that A takes place exactly k times is 





Wp (k) = (o *(1— p)-*  (k—0,1,2,...,n). (16.62) 
For every choice of an independent element from the population, the probabilities are 
M _ N-M 
P(A) = — = P(A)= Ξ]--ρςξα. 16. 
(A)= 37 =p, P(A) N p=4q (16.63) 


The probability of getting an element with property A for the first k choices, then an element with 
the remaining property A for the n — k choices is p*(1 — p)"~*, because the results of choices are 
independent of each other. The sequence of the choices plays no role because the combinations have 
the same probability independently of the order of the choices, and these events are mutually exclusive, 
so one adds 


n n! 
μαμα ομως 16.64 
(;) k!(n— k)! (16:64) 
equal numbers to get the required probability. 
A random variable X,, for which P(X„ = k) = Wp (k) holds, is called binomially distributed with 
parameters n and p. 
1. Expected Value and Variance 


E(Xn) =H =n: p, (16.65a) D?(X,) 2 o? 2 n- p(1— p). (16.65b) 


2. Approximation of the Binomial Distribution by the Normal Distribution 
If Xn has a binomial distribution, then 


ο Xn — E(X4) e 5 
E a) = vale: ) ae (16.656) 


This means that, if n is large, the binomial distribution can be well approximated by a normal distri- 
bution (see 16.2.4.1, p. 818) with parameters px = E(X4) and o? = D?(X,,), if p or 1 — p are not too 
small. The approximation is the more accurate the closer p is to 0.5 and the larger n is, but acceptable if 
np 7 4and n(1— p) 7 4 hold. For very small p or 1 — p, the approximation by the Poisson distribution 
(see (16.68) in 16.2.3.3) is uscful. 

3. Recursion Formula 

The following recursion formula is recommended for practical calculations with the binomial distribu- 
tion: 


Wr(k +1) = 





LI F, ο W(k). (16.65d) 
4. Sum of Binomially Distributed Random Variables 

If X, and X,, are both binomially distributed random variables with parameters n, p and m, p, then 
the random variable X = Xn + Xm is also binomially distributed with parameters n 4 m, p. 

Fig. 16.2a,b,c represents the distributions of three binomially distributed random variables with pa- 
rameters n — 5; p — 0.5, 0.25, and 0.1. Since the binomial coefficients are symmetric, the distribution 
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is symmetric for p = q = 0.5, and the farther p is from 0.5 the less symmetric the distribution is. 
16.2.3.2 Hypergeometric Distribution 


Just as with the binomial distribution, a two-stage population with N elements is considered, i.e., the 
population has two classes of elements. One class has M elements with a property A, the other one has 
N — M elements which does not have the property A. In contrast to the case of binomial distribution, 
the chosen ball of the urn model is not replaced before choosing the next one. 

The probability that among the n chosen balls there are k black ones is 


MY(N—M 
. : k n—k . . 
P(X = k) = Wi, (k) = MN with (16.66a) 
o) 
O<k<n, k<M, n-k<N-M. (16.66b) 


If also n € M and n € N — M hold, then the random variable X with the distribution (16.66a) is 
called hypergeometrically distributed. 
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Figure 16.2 


1. Expected Value and Variance of the Hypergeometric Distribution 
0 S — E) 
τ k n—k M : 
u= E(X) = E E: eie. (16.672) 
k 
a G — k) 
n k]Nn—k My? 
2 p B 2 ge. 2 
o? = D(X) - EQ) - IEQOP - 3 (^ (n=) 
k 


M M\ N-n : 
πι (1 — x) Mea" (16.67b) 








2. Recursion Formula 

4 (n — K)(M — k) : 

Wr, v(k +1) = ——  _W yj, n(k). 16.67c 
ink +1) = Gyo Mont ket) en) (16:676) 

In Fig. 16.3a,b,c three hypergeometric distributions are represented for the cases N = 100, M = 50, 

25 and 10, for n = 5. These cases correspond to the cases p = 0.5; 0.25, and 0.1 of Fig. 16.2a,b,c. 
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There is no significant difference between the binomial and hypergeometric distributions in these ex- 
amples. If also M and N — M are much larger than n, then the hypergeometric distribution can be 
well approximated by a binomial one with parameters as in (16.63). 


0.7 0.7 0.7 

0.6 0.6 0.6 

0.5 0.5 0.5 

04 p=0.50 04 p=0.25 04 p=0.10 
0.3 0.3 0.3 

0.2 0.2 0.2 

0.1 0.1 0.1 

0.0 0.0 i 

012345 κ 012345 κ 00012345 k 


a) b) c) 
Figure 16.3 


16.2.3.3 Poisson Distribution 


If the possible values of a random variable X are the non-negative integers with probabilities 
X 
P(X-k)- ΠΕ (k 20,1,2,...; A» 0), (16.68) 


then it has a Poisson distribution with parameter A. 


1. Expected Value and Variance of the Poisson Distribution 


E(X)=), (16.69a) D(X) =), (16.69b) 


2. Sum of Independent Poisson Distributed Random Variables 


If X, and X» are independent Poisson distributed random variables with parameters Àj and As, then 
the random variable X = X, + X» also has a Poisson distribution with parameter A — A, 4- As. 


3. Recursion Formula 
"T _ A os 
P(X =k +1) = 77 >P(X =). (16.696) 


4. Connection between Poisson and Binomial Distribution 


The Poisson distribution can be obtained as a limit of binomial distributions with parameters n and p 
if n — oo, and p (p — 0) changes with n so that np = à = const, i.e., the Poisson distribution is a 
good approximation for a binomial distribution for large n and small p with A = np. In practice, one 
uses it if p < 0.08 and n > 1500p hold, because the calculations are easier with a Poisson distribution. 
Table 21.16, p. 1131, contains numerical values for the Poisson distribution. Fig. 16.4a,b,c repre- 
sents three Poisson distributions with A — np — 2.5, 1.25 and 0.5, i.e., with parameters corresponding 
to Figs. 16.2 and 16.3. 


5. Application 


The number of independently occurring point-like discontinuities in a continuous medium can usually 
be described by a Poisson distribution, e.g., number of clients arriving in a store during a certain time 
interval; number of misprints in a book, the rate of radioactive decay, etc. 
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16.2.4 Continuous Distributions 
16.2.4.1 Normal Distribution 
1. Distribution Function and Density Function 


A random variable X has a normal distribution if its distribution function is 


(t=)? 
20? dt. (16.70a) 





T 
1 
P(X € x) 2 F(x) 2 —— fe 
πη, 
-σο 
Then it is also called a normal variable, and the distribution is called a (p, o?) normal distribution. The 
function 
f(t) E" (16.70b) 
= e 2c (UD 
cv 2n 
is the density function of the normal distribution. It takes its maximum at t = p and it has inflection 
points at u + ø (see (2.59), p. 73, and Fig. 16.5a). 








p(t) 
(x) 








=y 
=Y 


u 


0| u-0 m 
a) b) 


Figure 16.5 


2. Expected Value and Variance 
The parameters u and g? of the normal distribution are its expected value and variance, respectively, 
i.e., 


1 Y 5 (e 


E(X) = — = | te 3? dr-y, (16.71a) 


-roo 2 


(71) 
D?(X) = E[(X — p)?] = σσ] (x — u)?e^ x7 dr = 0?. (16.71b) 
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If the normal random variables X1 and X» are independent with parameters ji, 01 ἃπι] μα, σα, resp., 
then the random variable X — kj X1-- koX3 (Ki, ko real constants) also has a normal distribution with 


parameters u = kıyı + kouz, o = V k1?012 + k2?02?. 


-H 





t 
By the substitution T = in (16.70a), the calculation of the values of the distribution function 
of any normal distribution is reduced to the calculation of the values of the distribution function of 
the (0, 1) normal distribution, which is called the standard normal distribution. Consequently, the 
probability P(a < X < b) of a normal variable can be expressed by the distribution function d(x) of 
the standard normal distribution: 


o 


P(ac Xxb)-ó (=) -Φ (=) i (16.72) 


16.2.4.2 Standard Normal Distribution, Gaussian Error Function 


1. Distribution Function and Density Function 
From (16.70a) with u = 0 and o? = 1 follows the distribution function 


P(X < 1) = (£) = i J 7 = [oat (16.73a) 


of the so-called standard normal distribution. Its density function is 


(t 1 e 

t)- —e 2, 

T Υπ 

it is called the Gaussian error curve (Fig. 16.5b). 
The values of the distribution function ®(x) of the (0, 1) normal distribution are given in Table 21.17, 
p. 1133. Only the values for the positive arguments x are given, while the values for the negative 
arguments can be got from the relation 


@(—x) 5 1— (x). (16.74) 
2. Probability Integral 


The integral (x) is also called the probability integral or Gaussian error integral. In the literature the 
functions $9(x) and erf (r) are sometimes denoted as error integral with the following definitions: 


(16.73b) 


Bo (x 


1 7 9 1 2 f p 
t) = — e αἱ -- Φ(α) -- -, (16.754) οτε(α) -- ---. | ε-5 ἀἑ-- 3: Φο(να). (16.750) 
van à 2 vel 


With erf the error function is denoted. 
16.2.4.3 Logarithmic Normal Distribution 
1. Density Function and Distribution Function 


The continuous random variable X has a logarithmic normal distribution, or lognormal distribution 
with parameters uz and ø? if it can take all positive values, and if the random variable Y, defined by 
Y — log X, (16.76) 
has a normal distribution with expected value iz, and variance ø? (see also remark b) on page 820). 
Consequently, the random variable X has the density function 
0 for 1, 


f=} loge (κ) foc ded. 


un (16.77a) 
ἴσιν 2π 202 
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and the distribution function 


0 for «<0, 
logg 


F(x) = 1 / f Cu n (16.77b) 
e |. exp οσα dt for x90. 


In practical applications either the natural or the decimal logarithm is used. 
2. Expected Value and Variance 


Using the natural logarithm one gets the expected value and the variance of the lognormal distribution 
as: 





2 
σ 

jt = exp (m + i) g^ (exp o2 — 1) exp (2u: + σὲ). (16.78) 
3. Remarks 
a) The density function of the lognormal distribution is continuous everywhere and it has positive 
values only for positive arguments. Fig. 16.6 shows the density functions of lognormal distributions 
for different zz and oy. Here has been used the natural logarithm. 
b) Here the values ji; and o? are not the expected value and variance of the lognormal random variable 
itself, but of the variable Y = log X, while p and o? are in conformity with (16.78) expected value and 
variance of the random variable X. 
c) The values of the distribution function F(x) of the lognormal distribution can be calculated by the 
distribution function (x) of the standard normal distribution (see (16.73a)), in the following way: 


log z — u 

Ε(α) --Φ (=z) . (16.79) 
σι 

d) The lognormal distribution is often applied in lifetime analysis of economical, technical, and biolog- 

ical processes. 

e) The normal distribution can be used in additive superposition of a large number of independent ran- 


dom variables, and the lognormal distribution is used for multiplicative superposition of a large number 
of independent random variables. 
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16.2.4.4 Exponential Distribution 
1. Density Function and Distribution Function 


A continuous random variable X has an exponential distribution with parameter A (A > 0) if its density 
function is (Fig. 16.7) 
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0 fo t«0 
f(t) = e for t>0, (16.80a) 


consequently, the distribution function is 


f 0 for «<0, 4 
F(z) = i f(Odt = { ate ee (16.80b) 


2. Expected Value and Variance 


μπα ο--. (16.81) 


Usually, the following quantities are described by an exponential distribution: Length of phone calls, 
lifetime of radioactive particles, working time of a machine between two stops in certain processes, 
lifetime of light-bulbs or certain building elements. 


16.2.4.5 Weibull Distribution 


1. Density Function and Distribution Function 


The continuous random variable X has a Weibull distribution with parameters a and 6 (a > 0, 8 > 0), 
if its density function is 


0 for t<0, 


ΠΟΥ ΕΛ ji (16.822) 
s (s) exp |- (5) | for t>0 


and so its distribution function is 


0 for «<0, 
Bye 1— exp |- (5) | fo «>0. VOR) 
B 
2. Expected Value and Variance 
u- Br (1 3), =e (r(1+=)-re(i+-)]. (16.83) 
Q Q Q 


Here I(x) denotes the gamma function (see 8.2.5, 6., p. 514): 
Γ(α) -- / fedt for x0. (16.84) 
0 


In (16.82a), a is the shape parameter and £ is the scale parameter (Fig. 16.8, Fig. 16.9). 
Remarks: 

a) The Weibull distribution becomes an exponential distribution for a = 1 with A = 3 
b) The Weibull distribution also has a three-parameter form by introducing a position parameter y. 
Then the distribution function is: 


F(x) = 1 — exp |- (3) } (16.85) 
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c) The Weibull distribution is especially useful in life expectancy theory, because, e.g., it describes the 
functional lifetime of building elements with great flexibility. 


16.2.4.6 x? (Chi-Square) Distribution 
1. Density Function and Distribution Function 


Let X1, X9,..., X, be n independent (0,1) normal random variables. Then the distribution of the 
random variable 
x! - XP 4 X22 +---4 X,? (16.86) 


is called the X? distribution with n degrees of freedom. Its distribution function is denoted by Fy2(x), 
and the corresponding density function by f,2(t). 


m. t 
t2 ^e for(t» 0) 


fa(t) 24 mar (5) (16.872) 
0 for t € 0. 
1 [a t 
Fela) = P(x? € x) - — r [2 lo-3dt (sz 0). (10.870) 
πο 


2. Expected Value and Variance 
E(x?) =n, (16.88a) D?(x?) = 2n. (16.88b) 


3. Sum of Independent Random Variables 
If X4 and X» are independent random variables both having a x? distribution with n and m degrees of 
freedom, then the random variable X = Xı + X2 has a X? distribution with n +m degrees of freedom. 


4. Sum of Independent Normal Random Variables 


If X4, Xo...., X4 are independent, (0, 7) normal random variables, then 
n 1 t 
X = Σ Χρ has the density function f(t) = are (3) : (16.89) 
X =Ċ15 X2 ρωσ. f(t) =% f (5) (16.90) 
=> i 1as the density function ) -- -- =z), j 
nma » gi Y Aga 


12 2ἱ 0 
X= Xj? has the density function f(t) = ashe (5) ; (16.91) 
nz c σ 
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5. Quantile 


For the quantile (see 16.2.2.2, 3., p. 812) x2, of the x? distribution with m degrees of freedom (Fig. 


16.10), 
P(X > Xm) =a. 


Quantiles of the x? distribution can be found in Table 21.18, p. 1135. 


felt) f,(t) 








(16.92) 
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16.2.4.7 Fisher F Distribution 


1. Density Function and Distribution Function 


If X4 and X3 are independent random variables both having x? distribution with m4 and ma degrees 


of freedom, then the distribution of the random variable 


Xi Χο 


mi/ ma 


Finim2 = 


(16.93) 


is a Fisher distribution or F distribution with m1, mz degrees of freedom. The density function is 





mi ma m 
ee me τ E: t *) tr : fort > 0 
Th m: E 2 
frat <2 2 r C) r (=) m, m 2713 
LP Xe eS) 
0 for t € 0. 


Forx € 0 holds Fre(x) — P(Fmı,m € £) = 0, for z > 0: 
Fro) =P Emma c2) 


m m m] 
--- --- T 12 1t 
212 ) ( ) : 

T3 


my me2\ , mi 4 m2 
ΠΠ (Tam)? ? 


2. Expected Value and Variance 


ο ΓΗ τί 


2 2 


mma 2ma3? (m, 4- ma — 2) 





E(Fm, m3) = (16.95a) D* (Fo, ma) = 


mg —2° 


ma(ma — 2)?(m3 — 4) 


(10.944) 


(16.940) 


(16.958) 





824 106. Probability Theory and Mathematical Statistics 





3. Quantile 
The quantiles (see 16.2.2.2, 3., p. 812) ta,mı,mz Of the Fisher distribution (Fig. 16.11) can be found in 
Table 21.19, p. 1136. 


16.2.4.8 Student t Distribution 
1. Density Function and Distribution Function 


If X is a (0, 1) normal random variable and Y is a random variable independent from X and it has a 
x? distribution with m — n — 1 degrees of freedom, then the distribution of the random variable 


X 
Ta 4 (16.96) 


y Y/m 


is called a Student t distribution or t distribution with m degrees of freedom. The distribution function 
is denoted by Fs(x), and the corresponding density function by fs(t). 




















(* + -) 
1 9 1 
t) = — A 16.97a 
fs( πε o ae (16.97a) 
Ισ 
m 
: j το) - a 
= <az)= = j. 
Fs(z) 2 P(T < 2) Í ΠΠ r [ "m (16.97) 
2 (: + =) 
m 
£,(t) f(t) 
α α 
ο 2: 2 
» » 
a) ol tam t b) —lo/2, m 0| Ttto/2,m t 
Figure 16.12 
Es] 2. Expected Value and Variance 
E(T)=0 (m»1) (16.982) D(T),-— (m»2) (16.950) 


3. Quantile 
The quantiles tom and ty/2,m of the t distribution (Fig. 16.12a,b), for which 


P(T > tam) =a (16.99a) or P(|T| > ta/2m) = (16.99b) 


holds, are given in Table 21.20, p. 1138. 

The Student ¢ distribution, introduced by Gosset under the name Student, is used in the case of sam- 
ples with small sample size n, when only estimations can be given for the mean and for the standard 
deviation. The standard deviation (16.98b) no longer depends on the deviation of the population from 
where the sample is taken. 
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16.2.5 Law of Large Numbers, Limit Theorems 

The law of large numbers gives a relation between the probability P(A) of a random event A and its 
relative frequency n4/n with a large number of repeated experiments. 

1. Law of Large Numbers of Bernoulli 

The following inequality holds for arbitrary given numbers ¢ > 0 and 7 > 0 


1 1 
P( ms P(A) < e) 21-34, (16.100a) if n>—— (16.100») 
For other similar theorems see [16.5]. 





n ΠΟ 


E How many times should a not necessarily fair die be rolled if the relative frequency of the 6 should 
be closer to its probability than 0.1 with a probability of at least 95 % ? 

Now, € — 0.01 and rj — 0.05, so 4c?r — 2- 1075, and according to the law of large numbers of Bernoulli 
n > 5-10* must hold. This is an extremely large number, which can be reduced, if the distribution 
function is known (see [16.12]). 


2. Central Limit Theorem of Lindeberg—Levy 


If the independent random variables Xj,..., Xn all have the same distribution with an expected value 
ji and a variance o?, then the distribution of the random variable 


f n 
-> Xiu 
ye (16.101) 
^ σ/ψη ` 


tends to the (0, 1) normal distribution for n — oo, i.e., for its distribution function F;(y) follows 


1 f$ 8 
dim, Fal) = Je | eT dt. (16.102) 


If n > 30 holds, then (y) can be replaced by the (0, 1) normal distribution. Further limit theorems 
can be found in [16.5], [16.7]. 

W Given a sample of 100 items from a production of resistors. It is supposed that their actual resistance 
values are independent and they have the same distribution with deviation o? — 150. 'The mean value 
for these 100 resistors is T = 1050 Q. In which domain is the true expected value yz with a probability 
of 99 96 ? 

Looking for an ¢ such that P(|X — u| < £) = 0.99 holds. Supposing (see (16.101)) that the random 


Us has a (0.1) normal distribution. From P(|Y| x ) 2 P(-A€Y XA)-P(YX 
A) — P(Y < —A), and from P(Y € —3) 2 1— P(Y < 4) it follows that P(Y| € A) — 2P(Y € 
à) — 1 = 0.99. 

So, P(Y € A) — (A) = 0.995 and from Table 21.17, p. 1133, one gets \ = 2.58. Since σ/ν100 -- 
1.225 there is with a 99 % probability: |1050 — u| < 2.58 - 1.225, i.e., 1046.8 Q < u < 1053.20. 


variable Y = 





16.2.6 Stochastic Processes and Stochastic Chains 


Many processes occurring in nature and those being studied in engineering and economics can be real- 
istically described only by time-dependent random variables. 

W The electric consumption of a city at a certain time ¢ has a random fluctuation that is dependent 
on the actual demand of the households and industry. The electric consumption can be considered 
as a continuous random variable X. When the observation time t changes, electric consumption is a 
continuous random variable at every moment, so it is a function of time. 


The stochastic analysis of time-dependent random variables leads to the concept of stochastic pro- 
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cesses, which has a huge literature of its own (see, e.g., [16.7], [16.9]). Some introductory notions will 
be given next. 


16.2.6.1 Basic Notions, Markov Chains 


1. Stochastic Processes 
A set of random variables depending on one parameter is called a stochastic process. The parameter, 
in general, can be considered as time t, so the random variable can be denoted by X, and the stochastic 
process is given by the set 
Gut e T). (16.103) 
The set of parameter values is called the parameter space T, the set of values of the random variables 
is the state space Z. 
2. Stochastic Chains 
If both the parameter space and the state space are discrete, i.e., the state variable X, and the param- 
eter t can have only finite or countably infinite different values, then the stochastic process is called a 
stochastic chain. In this case the different states and different parameter values can be numbered: 
Z5112,...,541,...,] (16.104) 
T= Το ο πο μις es} with 0 <to< ti < 22. < ῃ tind Steals (16.105) 


The times to, t1,... are not necessary equally spaced. 
3. Markov Chains, Transition Probabilities 
If the probability of the different values of X;,,,, in a stochastic process depends only on the state at 
time tm, then the process is called a Markov chain. The Markov property is defined precisely by the 
requirement that 

PUK cee = im+1|Xto = to, Xn =1h,... Ata = im) te P(Xtinas = Ímai| Xu, Bu im) 


for all m € {0,1,2,...} and for all i, i1,...,¢myi € Z. (16.106) 
Consider a Markov chain and times tm and tm41. The conditional probabilities 
P(X&,4, — j| Xu, = i) 2 dij (tm ἐπ) (16.107) 


are called the transition probabilities of the chain. 'The transition probability determines the probability 
by which the system changes from the state X;,, — i at £,, into the state Xy, ,, — j at fi. 

If the state space of a Markov chain is finite, i.e., Z = (1,2,..., N), then the transition probabilities 
Dij (f, t2) between the states at times t; and ty can be represented by a quadratic matrix P(t1, t2), by 
the so-called transition matrix: 


pu(tt2) pro(ti,te) τν  pin(ta, t2) 
pa(ti,t2) pea(ti,te) --.. Ῥαν({ι, te) 
P(t, te) = . (16.108) 
pui(h.t2) pw»(tu t2)... pwn(th, te) 
The times tı and tz are not necessarily consecutive. 
4. Time-Homogeneous (Stationary) Markov Chains 
If the transition probabilities of a Markov chain (16.107) do not depend on time, i.e., 
Pij (tm, tm+1) = Pij, (16.109) 
then the Markov chain is called time-homogeneous or stationary. A stationary Markov chain with a 
finite state space Z = {1,2,..., N} has the transition matrix 
pi pa -. PIN 
P21 po ... P2N 
P-|. ; (16.1102) 


DNi DN2 ... DNN 
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where 
a) pij > 0 for alli, j and (16.110b) 
N 
b) Mp =1 for alli. (16.110c) 
ja 


Being independent of time pij gives the transition probability from the state i into the state j during 
time unit. 

W The number of busy lines in a telephone exchange can be modeled by a stationary Markov chain. 
For the sake of simplicity it is supposed that there are only two lines. Hence, the states are i = 0,1,2. 
Let the time unit be, e.g., 1 minute. Suppose the transition matrix pi; is: 


0.7 0.3 0.0 

(pij) Ξ 02. 05 03 (ij = 0, 1,2). 
0.1 0.4 0.5 

In the matrix (pij) the first row corresponds to the state i — 0. The matrix element pi2 = 0,3 (second 

row, third column) shows the probability that two lines are busy at time tm given that one was busy at 

Lcid: 

Remark: Every quadratic matrix P = (pij) of size N x N satisfying the properties (16.110b) and 

(16.110c) is called a stochastic matrir. Their row vectors are called stochastic vectors. 

Although the transition probabilities of a stationary Markov chain do not depend on time, the distri- 

bution of the random variable X, is given at a given time by the probabilities 


N 
P(X, =i) =pi(t) (6—1,2,..., N) (16.111a) with Y^pi(t) 21 (16.111b) 
i=1 
since the process is in one of the states with probability one at any time f. 
5. Probability Vector and Transition Matrix 
Probabilities (16.111a) can be written in the form of a probability vector 
P = (pi(t) pa(t), . .-, p (0). (16.112) 
The probability vector p is a stochastic vector. It determines the distribution of the states of a station- 
ary Markov chain at time period t. Let the transition matrix P of a stationary Markov chain be given 
(according to (16.110a,b,c)). Starting with the probability distribution at time period t determine the 
probability distribution at t 4- 1, that is, calculate p(¢ + 1) from P and p(t): 


p(t+1)=p(t)-P (16.113) 
and furthermore 

p(t + k) = p(t): PF. (16.114) 
Remarks: 
1. For t — 0 it follows from (16.114) that 

p(k) = p(0)P, (16.115) 


that is, a stationary Markov chain is uniquely determined by the initial distribution p(0) and the tran- 
sition matrix P. 

2. If matrices A and B are stochastic matrices, then C = AB is a stochastic matrix, as well. Conse- 
quently, if P is a stochastic matrix, then the powers P* are also stochastic matrices. 

Β A particle changes its position (state) X,(1 € x < 5) along a line in time periods t — 1,2,3,... 
according to the following rules: 

a) If the particle is at x — 2,3,4, then it moves to the right by a unit during the next time unit with 
probability p — 0.6 and to the left with probability 1 — p — 0.4. 
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b) At points z — 1 and x — 5 the particle is absorbed, i.e., it stays there with probability 1. 

c) At time t = 0 the position of the particle is 7 = 2. 

Determine the probability distribution p(3) at time period t — 3. 

By (16.115) the probability distribution p(3) = p(0) - P? holds with p(0) — (0, 1,0,0,0) and with the 
transition matrix 


1 0 0 0 ο 1 0 0 0 0 
04 0 06 0 0 0.496 0 0.288 0 0.216 
P-|0 04 0 06 0 |. Hence P? — | 0.160. 0.192 0 0.288 0.360 
0 0 04 0 06 0.064 0 0.192 0 0.744 
0 0 0 0 1 0 0 0 0 1 


and finally p(3) — (0.496; 0; 0.288; 0; 0.216) . 
16.2.6.2 Poisson Process 


1. The Poisson Process 

In the case of a stochastic chain both the state space Z and the parameter space T are discrete, that 
is, the stochastic process is observed only at discrete time periods to, ti, t2,... . Now, there is studied a 
process with continuous parameter space T, and it is called a Poisson process. 

1. Mathematical Formulation of the Poisson Process The following assumptions are made for 
the mathematical formulation of the Poisson process: 

a) Let the random variable X, be the number of signals in the time interval [0,t); 

b) Let the probability px (t) = P(X, = x) be the probability of x signals during the time interval [0, £). 
Additionally, the following assumptions are required, which hold in the process of radioactive decay 
and many other random processes (at least approximately): 

c) The probability P(X, = x) of x signals in a time interval of length t depends only on x and t, and 
does not depend on the position of the time interval on the time axis. 

d) The numbers of signals in disjoint time intervals are independent random variables. 

e) The probability to get at least one signal in a very short interval of length At is approximately 
proportional to this length. The proportionality factor is denoted by A (A > 0). 

2. Distribution Function By properties a)-e) the distribution of the random variable X; is deter- 
mined. One gets: 


P(X, =2)= 





zx 
ο e, (16.116) 
where pi = At is the expected value and a? = At the variance. 
3. Remarks 
1. From (16.116) the Poisson distribution follows as a special case for t — 1 (see 16.2.3.3, p.817). 
2. To interpret the parameter A or to estimate its value from observed data the following properties 
are useful: 
e Aisthe average number of signals during a time unit, 


es the average distance (in time) between two signals in a Poisson process. 


3. The Poisson process can be interpreted as the random motion of a particle in the state space Z — 
{0,1,2,...}. The particle starts in the state 0, and at every sign it jumps from state i into the next 
state i + 1. Furthermore, for a small interval At the transition probability p;;41 from state 7 into the 
state i + 1 should be: 

Digi © AAt. (16.117) 
À is called the transition rate. 
4. Examples of Poisson Processes 
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HM Radioactive decay is a typical example of a Poisson process: The number of decays (signals) are 
registered with a counter and marked on the time axis. The observation interval should be relatively 
small with respect to the half-period of the radiating matter. 
W Consider the number of calls registered in a telephone exchange until time t and calculate, e.g., the 
probability that at most x calls are registered until time t with the assumption that the average number 
of calls during a time unit is À. 
W 1n reliability testing, the number of failures of a reparable system is counted during a period of duty. 
E Queuing theory considers the number of customers arriving at the counter of a department store, to 
a booking office or to a gasoline station. 
2. Birth and Death Processes 
One of the generalizations of the Poisson process is the assumption that the transition rate A; in (16.117) 
depends on the state i. Another generalization is the transition from state 7 into state 4 — 1 is allowed. 
The corresponding transition rate is denoted by j1;. The state 7 can be considered, e.g., as the number of 
individuals in a population. It increases by one at transition from state i into state i+ 1, and decreases 
by one at transition from i into i — 1. These stochastic processes are called birth and death processes. 
Let p(X; = 7) = p;(t) be the probability that the process is in state i at time t. Analogously to the 
Poisson process for the transition probability holds: 

from i— 1 into i: pi ji λι ιΔί, 

from i41 into 4: pia F2 paa AM, (16.118) 

from i into 4: pi; £& 1 — (Ai - pi) At. 
Remark: The Poisson process is a pure birth process with a constant transition rate. 
3. Queuing 
The simplest queuing system is considered as a counter where customers are served one by one in the 
order of their arrival time. The waiting room is sufficiently large, so no one needs to leave because 
it becomes full. The customers arrive according to a Poisson process, that is, the inter-arrival time 
between two clients is exponentially distributed with parameter A, and these inter-arrival times are 
independent. In many cases also the serving time has an exponential distribution with parameter p. 
The parameters À and u have the following meanings: 
e A: average number of arrivals per time unit, 


1 ; : : 
ο X average inter-arrival time, 
e u: average number of served clients per time unit, 


1 zog 
e —: average serving time. 
H 


Remarks: 


1. Ifthe number of clients standing in the queue is considered as the state of this stochastic process, then 
the above simple queuing model is a birth and death process with constant birth rate \ and constant 
death rate p. 

2. The above queuing model can be modified and generalized in many different ways, e.g., there can be 
several counters where the clients are served and/or the arrival times and serving times follow different 
distributions (see [16.9], [16.19]). 
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16.3 Mathematical Statistics 

Mathematical statistics provides an application of probability theory for given mass phenomena. Its 
theorems allow to make statements with certain probability about properties of given sets, which state- 
ments are based on the results of experiments whose number should be kept low for economical reasons. 


16.3.1 Statistic Function or Sample Function 
16.3.1.1 Population, Sample, Random Vector 


1. Population 

The Population is the set of all elements of interest in a particular study. Any set of things having the 
same property in a certain sense can be considered, e.g., every article of a certain production process or 
all the values of a measuring sequence occurring in a permanent repetition of an experiment. The num- 
ber N of the elements of a population can be very large, even practically infinite. The word population 
is often used to denote also the set of numerical values assigned to the elements. 

2. Sample 

In order not to check the total population about the considered property, data are collected only from 
a subset, from a so-called sample of size n (n < N). Talking about a random choice means that every 
element of the population has the same chance of being chosen. A random sample of size n from a 
finite population of size N is a sample selected such that each possible sample of size n has the same 
probability of being selected. A random sample from an infinite population is a sample selected such 
that each element is selected independently. The random choice can be made by mixing, blind taking 
out or by so-called random numbers. The word sample is used for the set of values assigned to the 
selected elements. 


3. Random Choice with Random Numbers 

It often happens that a random selection is physically impossible on the spot, e.g., in the case of piled 
material, like concrete stabs. Then random numbers are applied for a random selection (see Table 
21.21, p. 1139). 

Most calculators can generate uniformly distributed random numbers from the interval [0, 1]. Pressing 
the key RAN yields a number between 0.00...0 and 0.99...9. The digits after the decimal point form 
a sequence of random numbers. 

Random numbers are often taken from tables. Two-digit random numbers are given in Table 21.21, 
p. 1139. If larger ones are necessary, then one can compose several-digit numbers by writing them after 
each other. 

W A random sample is to be examined from a transport of 70 piled pipes. The sample size is supposed 
to be 10. First the pipes are numbered from 00 to 69. A two-digit table of random numbers is applied 
to select the numbers. Then the way is fixed how to choose the numbers, e.g., horizontally, vertically 
or diagonally. If during this process random numbers occur repeatedly, or they are larger than 69, then 
they are simply omitted. The pipes corresponding to the chosen random numbers are the elements of 
the sample. If there is a several-digit table of random numbers, they can be decomposed into two-digit 
numbers. 


4. Random Vector 

A random variable X can be characterized by its distribution function, by its parameters, where the 
distribution function itself is determined completely by the properties of the population. These are 
unknown at the beginning of a statistical investigation, so one want to collect as much information as 
possible with the help of samples. Usually the investigation is not restricted to one sample but more 
samples are applied (with same size n if it is possible, for practical reasons). The elements of a sample 
are chosen randomly, so the realizations take their values randomly, i.e., the first value of the first 
sample is usually different from the first value of the second sample. Consequently, the first value of 
a sample is a random variable itself denoted by X;. Analogously, the random variables X5, X3,... X, 
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can be introduced for the second, third,..., n-th sample values, and they are called sample variables. 
Together, they form the random vector 


X = (Xi, Xo,..., Xn). (16.119a) 
Every sample of size n with elements x; can be considered as a vector 
μα 2525); (16.119b) 


as a realization of the random vector. 


16.3.1.2 Statistic Function or Sample Function 


Since the samples are different from each other, their arithmetic means x are also different. They can 
be considered as realizations of a new random variable denoted by X which depends on the sample 
variables X1, Xo, ... , Xs. 





1. sample: £11, 12, ... zq& with mean 7. 
2. sample: x91, 200, ... to, with mean T3. 
"en -o oa g (16.120) 
m-th sample: m1, Um2, --» Umn With mean Fm. 


The realization of the j-th sample variable in the i-th sample is denoted by vij; (i — 1,2,...,m; 7 = 
1,2, ;f1). 

A function of the random vector X — (Xi, X5,..., X) is again a random variable, and it is called a 
statistic or sample function. The most important sample functions are the mean, variance, median and 
range. 


1. Mean 
The mean X of the random variables X; is: 
E μ. Ἡ 
X=- X. (16.121a) 
n iz 
The mean @ of the sample (2, £2,..., En) İS 
1 n 
T=- Y ti (16.121b) 
TL {1 


It is often useful to introduce an estimate value xo in the calculations of the mean. It can be chosen 
arbitrarily but possibly close to the mean z. If, e.g., v; (i = 1,2,...) are several-digit numbers in a 
long measuring sequence, and they differ only in the last few digits, it is simpler to do the calculations 
only with the smaller numbers 


Ži = Ti — πρ. (16.121c) 
Then follows 
1 n 
T= to + = Σ Zi = To +Z. (16.121d) 
Dic 


2. Variance 
The variance 62 of the random variables X; with mean X is defined by: 
L ΙΙ 
8? 2.—— MX; - Xy. (16.1222) 


nli 
The realization of the variance with the help of the sample (21,12, ... , 25) is 
2 1 Š πα : 
s = 2-2). (16.122b) 


n—lzi 
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It is proven that the estimation of the variance of the original population gives a more accurate estima- 
tion by dividing n — 1 than by dividing n. With the estimated value xo follows 


n 


z — 2» zi z? — n(T — t0)? 
Zai ει. {ει . (16.122c) 
n—1 n—1 


Me 


ll 
5 





n 
For xg = T the correction is zy, zi = 0 because z = 0 holds. 
i-l 


3. Median 
Let the n elements of the sample be arranged in ascending (or descending) order. If n is odd, then the 
n+1 





median X is the value of the -th item; if n is even, then the median is the average value of the 


Sth and 5 + 1) -th items, the two items on the middle. 








The median & in a particular sample (£1, £2, ..., £n), whose elements are arranged in ascending (or 
descending) order, is 
Emih if n — 2m 4 1, 
T= 4 Umqi + Im if n — 2m. (16.123) 
2 
4. Range 
R — max X; — min X; (1212. π]ν (16.124a) 
The range R of a particular sample (21, 12, ..., c5) is 
δὲ τι ως Uu (16.124b) 
Every particular realization of a sample function is denoted by a lowercase letter, except the range R, 
i.e., for a particular sample (21, x9, ..., z,) the particular values T, s?, 7, and R are calculated. 
i m i m i m E Choosing a sample of 15 loudspeakers from a running 
production the interesting quantity X is the air gap induc- 
1 | 1.01 | 6 | 1.00}, 11 | 1.00 tion B, measured in Tesla. From the measured data in the 
2 |102 || 7 | 0.99 | 12 | 1.00 table to the left follows: 
3 | 1.007 8 | L01| 13 | 1.02 T — 1.0027 or z — 1.0027 with «y — 1.00; 
4 [098] 9 | 1.01 | 14 | 1.00 s? — 1.2095 - 107^ or s? — 1.2076 - 1074 with xp = 1.00; 
5 |0.99 | 10 | 1.00 || 15 | 1.01 & = 1.00; R = 0.04. 
































16.3.2 Descriptive Statistics 


16.3.2.1 Statistical Summarization and Analysis of Given Data 


In order to describe statistically a property of a certain element this property must be characterized 
by a random variable X. Usually, the n measured or observed values x; of the property X form the 
starting point of a statistical investigation, which is made to find some parameters of the distribution 
or the distribution itself of X. 

Every measured sequence of size n can be considered as a random sample from an infinite population, if 
the experiment or the measurement could be repeated infinitely many times under the same conditions. 
Since the size n of a measuring sequence can be very large, the statistical investigation proceeds as 
follows: 

1. Protocol, Prime Notation The measured or observed values x; are recorded in a protocol list. 
2. Intervals or Classes Grouping the measured n data x; (i = 1,2,...,n) of the sample into k 
subintervals, so-called classes or class intervals of equal length or width h ; usually 10-20 classes. 
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3. Frequencies and Frequency Distribution The absolute frequenciesh; (j =1,2,...,k) are the 
numbers /;j of data (occupancy number) belonging to a given interval Azx;. The ratios h;/n (in %) are 
called relative frequencies. If the values h;/n are represented over the classes as rectangles, then one 
gets a graphical representation of the given frequency distribution, and this representation is called a 
histogram (Fig. 16.13a). The values h;/n can be considered as the empirical values of the probabilities 
or the density function f(x). 





























Table 16.3 Frequency table 4. Cumulative Frequency Adding the absolute or rel- 
ative frequencies, gives the cumulative absolute or relative 
Class hi | hi/n(96) | Fi (96) frequency 
50 — 70 1 0.8 0.8 hy thot: +h; . . 
71-90} 1] 08 16) 0 Fy= (1 -- 1,2... κ). (16.125) 
a E E : 2 ia A graphical representation of the empirical distribution 
131 —150 15 120 22. 4 function, which can be considered as an approximation 
151 - 17 22 17.6 40.0 of the unknown underlying distribution function F(x) is 
171 — 190 30 | 240 64.0 shown in Fig. 1613h: : 
191 910 27| 21.6 85.6 W Suppose during a study n = 125 measurements have 
211 — 230 9 7.2 92.8 been performed. The results spread in the interval from 
231 — 250 6 48 97.6 50 to 270, so it is reasonable to divide this interval into 
251 — 270 3 24 100.0 k = 11 classes with a length h = 20. The frequency table 
see Table 16.3. 
F,/% 

100 

80 

60 

40 

20 

) 70 110 150 190 230 270 x b) 70 110 150 190 230 270 x 
a 


Figure 16.13 
16.3.2.2 Statistical Parameters 


After summarizing and analyzing the data of the sample as given in 16.3.2.1, p. 832, it follows the 
approximation of the parameters of the distribution belonging to the random variable by the following 
parameters: 


1. Mean 
Using all measured data from the sample directly, the sample mean is 
1 n 
T=- r (16.126a) 
n isl 


Using the means T; and frequencies hy of the classes 


1 k 
g= n (16.126b) 
nii 
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2. Variance 
Using all measured data directly the sample variance is 
1 n 
gi Σι -- π)». (16.1272) 


n=1 





Using the means T; and frequencies hj of the classes 


97 -- 





Ια 
My - zy. (16.127b) 
n—174 

gm 
The class midpoint uj (the midpoint of the corresponding interval) is also often used instead of z}. 
3. Median 


The median 3 of a distribution is defined by 


1 

P(X <z)= z (16.128a) 

The median may not be a uniquely determined point. The median of a sample is 
Όπιει, ifn = 2m +1, 
=< Όπιει 1 Tm ifn = 2m. (16.128b) 
2 

4. Range 

R = Tmax — Tmin- (16.129) 


5. Mode or Modal Value 


is the data value that occurs with greatest frequency. It is denoted by D. 


16.3.3 Important Tests 


One of the fundamental problems of mathematical statistics is to draw conclusions about the population 
from the sample. There are two types of the most important questions: 

1. The type of the distribution is known, and one wants to get some estimate for its parameters. A 
distribution can be characterized mostly quite well by the parameters jj and o? (here p is the exact value 
of the expected value, and o? is the exact variance), consequently one of the most important questions 
is how good an estimation can be given for them, based on the samples. 

2. Some hypotheses are known about these parameters, and it is desirable to check if they are true. 
The most often occurring questions are: 

a) Is the expected value equal to a given number or not? 

b) Are the expected values for two populations equal or not? 

c) Docs the distribution of the random variable with p and o? fit a given distribution or not? etc. 
Because in observations and measurements, the normal distribution has a very important role, it is to 
be discussed the goodness of a fit test for a normal distribution. The basic idea can be used for other 
distributions, too. 


16.3.3.1 Goodness of Fit Test for a Normal Distribution 

There are different tests in mathematical statistics to decide if the data of a sample come from a normal 
distribution. Here a graphical one is discussed based on normal probability paper , and a numerical 
one based on the use of the chi-square distribution (*x? test”). 

1. Goodness of Fit Test with Probability Paper 

a) Principle of Probability Paper The z-axis in a right-angled coordinate system is scaled equidis- 
tantly, while the y-axis is scaled on the following way: It is divided equidistantly with respect to Z, but 
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scaled by 
1 f 
y = (z) = V / e 2 dt. (16.130) 


If a random variable X has a normal distribution with expected value j4 and variance o?, then for its 
distribution function (see 16.2.4.2, p. 819) 








F(x) 2 6 (= = £) — d(z) (16.1312) 
holds, i.e., (16.131c) 
zat (16.131b) 
σ 


must be valid, and so there is a linear relation between x and z and (16.131c). 


b) Application of Probability Paper 
Considering the data of the sample, calculat- 
ing the cumulative relative frequencies accord- 
ing to (16.125), and sketch these onto the proba- 
bility paper as the ordinates of the points with 
abscissae the upper class boundaries. If these 
points are approximately on a straight line, 
(with small deviations) then the random variable 
can be considered as a normal random variable 
(Fig. 16.14). 

As it can be seen from Fig. 16.14, the distribu- 
tion to which the data of Table 16.3 belong, can 
be considered as a normal distribution. Further- 
more one can see that u ~ 176, o ~ 37.5 (from 














the x values belonging to the 0 and +1 values of 0 τ 
2). μ-σ µ µσ 

Remark: The values F; of the relative cumula- 

tive frequencies can be plotted more easily on the Figure 16.14 


probability paper, if its scaling is equidistant with respect to y, which means a non-equidistant scaling 
for the ordinates. 

2. x?test for Goodness of Fit 

It is to check if a random variable X can be considered as normal in the sense of 16.2.4.2, p. 819. The 


range of X is divided into k classes and the upper limit of the j-th (j = 1,2,...,) class is denoted by 
£j . Let pj be the "theoretical" probability that X is in the j-th class, i.e., 
pi = F(&j) — F(8j-1) (10.132) 


where F(x) is the distribution function of X (j = 1,2,...,k; & is the lower limit of the first class with 
F'(&) = 0). Because X is supposed to be normal, then 


F(£;) = 8 655) (16.132b) 


must hold, where d(x) is the distribution function of the standard normal distribution (see 16.2.4.2, 
p. 819). The parameters jz and o? of the population are usually not known. Therefore z and s? are to 
be used as an approximation of them. The decomposition of the range of X should be made so that the 
expected frequencies for every class should exceed 5, i.e., if the size of the sample is n, then np; > 5. 

Now, considering the sample (21, 12, . .. , 5) of size n and calculating the corresponding frequencies /;; 
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(for the classes given above), then the random variable 


k (hj — np;)? 
{2a (16.132c) 
i np; 

has approximately a y? distribution with m = k—1 degrees of freedom, if jj and o? are known, m — k—2 
if one of them is estimated from the sample, and m = k — 3 if both are estimated by 7 and s?. 
The test, if a random variable X has a normal distribution (x?-approximation test) consists in the com- 
parison of the experimental x2 of the sample with the corresponding χάπι from Table 21.18, p. 1135 
for a given statistical significance level a and m degrees of freedom. 
Now one decides a significant level a and one takes out of the Table 21.18, p. 1135 the quantile X k-i 
of the corresponding x? distribution (i depends on the number of unknown parameters). This means 
P(x? > xl,,)- a holds. Comparing the value y% from (16.132c) and this quantile, and if 

X$ «Xia (16.132d) 
holds, one can accept the assumption that the sample came from a normal distribution. This test is 
also called the x? test for goodness of fit. 
Bl The following x? test is based on the example on p. 833. The sample size is n. — 125, with the 
mean T — 176.32 and variance s? — 36.70. These values are used as approximations of the unknown 
parameters jz and c? of the population. Determining the test statistic X2 according to (16.132c) after 
performing the calculations according to (16.132a) and (16.132b), yields the data in Table 16.4. 


Table 16.4 x? test 









































7" mm hecno? 
Éj hj &-n Φ (=+) Pj npj (np 
σ σ πρ 
70 | 1 —2.90 0.0019 0.0019 | 0.2375 
90| 1 —2.35 0.0094 0.0075 | 0.9375 
| 2) -161) 0.0351 | 0.0257 | 3.2125 f 129750 | 900005 
130 | 9 —1.26 0.1038 0.0687 | 8.5857 
150 | 15 —0.72 0.2358 0.1320 | 16.5000 0.1635 
170 | 22 —0.17 0.4325 0.1967 | 24.5875 0.2723 
190 | 30 0.37 0.6443 0.2118 | 26.4750 0.4693 
210 | 27 0.92 0.8212 0.1769 | 22.1125 1.0803 
230 | 9 1.46 0.9279 0.1067 | 13.3375 1.4106 
250 | 6 2.01 0.9778 0.0499 | 6.2375 
270 | 3 } 9 | 255| 09946 | 0.0168 | 2.1000 j 3.3375 0:0526 
x2 = 3.4486 
It follows from the last column that x2 — 3.4486. Because of the requirement np; > 5, the number of 
classes is reduced from k = 11 to k* = k—4 = 7. Since for the calculation of the theoretical frequencies 
np; the estimated values % and s? of the sample are used instead of u and o? of the population, so the 


number of degrees of freedom of the corresponding X? distribution is reduced by 2. The critical value is 
the quantile 24+ 19. For a = 0.05 one gets x05, — 9.5 from Table 21.18, p. 1135, so because of the 
inequality x3 < X0.o5; 4 there is no contradiction to the assumption that the sample is from a population 
with a normal distribution. 


16.3.3.2 Distribution of the Sample Mean 


Let X be a continuous random variable. Suppose arbitrarily many samples of size n can be taken 
from the corresponding population. Then the sample mean is also a random variable X, and it is also 
continuous. 
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1. Confidence Probability of the Sample Mean 

If X has a normal distribution with parameters jj and c?, then X is also a normal random variable with 
parameters j1 and c? /n, i.e., the density function f(x) of X is concentrated more around p than the 
density function f(x) of the population. For any value € 7 0 


E = eyn 
P(|X — ul « e) 230 (2) - 1, P(|X - u| < £) = 28 (=) =] (16.133) 
σ σ 
Table 16.5 Confidence level holds. Consequently with increasing sample size n the probability 
for the sample mean that the sample mean is a good approximation of ju is also increasing. 





1 = 1 1 
p(x- PE 7) M Fore = 50 from (16.133) P(X -μς σσ) = 20 (3v7) ai, 
2 and for different values of n one gets the values listed in Table 16.5. 

















1 38.29 % As can be seen from Table 16.5, e.g., that with a sample size n = 49, 
4 68.27 % the probability that the sample mean 7 differs from yz by less than 
16 95.45 % do. i 
25 98.76 % oo is 99.95 %. 
49 99.96 % 





2. Sample Mean Distribution for Arbitrary Distribution of the Population 

The random variable X has an approximately normal distribution with parameters jz and o?/n for any 
distribution of the population with expected value jz and variance o?. This fact is based on the central 
limit theorem. 


16.3.3.3 Confidence Limits for the Mean 


1. Confidence Interval for the Mean with a Known Variance c? 
If X is a random variable with parameters j4 and c?, then according to 16.3.3.2, p. 836, X is approxi- 
mately a normal random variable with parameters jz and o?/n. Then substitution of 


GI n (16.134) 


Z= 
yields a random variable Z which has approximately a standard normal distribution, therefore 





P(|Z| < €) = f ote) dx = 26(e) — 1. (16.135) 


If the given significance level is a, namely, 

P(|Z| xe) 21- a, (16.136) 
is required, then £ = e(a) can be determined from (16.135), e.g., from Table 21.17, p. 1133, for the 
standard normal distribution. From |Z| € &(o) and from (16.134) the relation 


(a (16.137) 











=T} 
νη 


follows. The values z 3 ---ε(α) in (16.137) are called confidence limits for the expected value and the 





interval between them is called a confidence interval for the expected value ji with a known c? and given 
significance level a. In other words: The expected value ju is between the confidence limits (16.137) 
with a probability 1 — a. 

Remark: If the sample size is large enough, then s? can be used instead of o? in (16.137). The sample 
size is considered to be large, if n 100, but in practice, depending on the actual problem, it is consid- 
ered to be sufficiently large if n 7 30. If n is not large enough, then in the case of a normally distributed 
population the ¢ distribution is applied to determine the confidence limits as in (16.140). 
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2. Confidence Interval for the Expected Value with an Unknown Variance σ2 


If the population is approximately normally distributed and its variance c? is unknown, then it is re- 

placed by the sample variance s? and instead of (16.134) one gets the random variable 

X-u 
8 





T- Jn, (16.138) 


which has a t distribution (see 16.2.4.8, p. 824) with m = n — 1 degrees of freedom. Here n is the size 


of the sample. If n is large, e.g., n > 100 holds, then T can be considered as a normal random variable 
as Z in (16.134). For a given significance level a holds 


P(Ir| «e - n pp E M E e)=1-a (16.139) 


From (16.139) follows that ¢ = e(a,n) = ta/am—1, Where ty/2n—1 is the quantile of the t distribution 
(with n — 1 degrees of freedom) for the significance level a (Table 21.20, p. 1138). From |T| = ta/2n—1 





s 
HZTE Van (16.140) 
follows. The values 7 + s /an—1 are called the confidence limits for the expected value p of the 


vn 
distribution of the population with an unknown variance o? and with a given significance level a. The 
interval between these limits is the confidence interval. 
WA sample contains the following 6 measured values: 0.842; 0.846; 0.835; 0.839; 0.843; 0.838. Then 
i — 0.8405 and s — 0.00394 is obtained. 
What is the maximum deviation of the sample mean z from the expected value yz of the population 
distribution, if the significance level o is given as 5 96 or 1 96 ? 
1. a — 0.05: one reads from Table 21.20, p. 1138, that ta/2;5 — 2.57, consequently 
[X — u| € 2.57 - 0.00394//6 — 0.0042. Thus, the sample mean 7 = 0.8405 differs from the expected 
value jt by less than 40.0042 with a probability 95 96. 

a = 0.01: ta/o5 = 4.03; |X — u| < 4.03 - 0.00394//6 = 0.0065, i-e., the sample mean 7 differs from 
u by less than +0.0065 with a probability 99 %. 


16.3.3.4 Confidence Interval for the Variance 


If the random variable X has a normal distribution with parameters 14 and c?, then the random variable 














2 
x! 2 (n- 155 (16.141) fe(x) 


has a x? distribution with m — n — 1 degrees of 

freedom, where n is the sample size and s? is the 

sample deviation. fy2(x) denotes the density func- α/2 α/2 
tion of the y? distribution in Fig. 16.15, and one 
sees that 





2 2 a 0 2 2 x 
PO? < xa) Ρα χο. (16.149) X, X, 
Thus, using the quantiles of the x? aes Figure 16.15 
(see Table 21.18, p. 1135) gives 
Xa = Xi afin) x = Hanai (16.143) 


Considering (16.141) one gets the following estimation for the unknown variance c? of the population 
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distribution with a significance level a: 


=] 2 zd 2 
me gee =u, (16.144) 
Xa/2;n—1 X1—-a/2;1—1 
The confidence interval given in (16.144) for c? is fairly large for small sample sizes. 
Wl For the numerical example (p. 838) with 6 measured values and for a = 5% one gets from Table 
21.18, p. 1135, X oos, — 0.831 and Xĝ gzs: = 12.8, so it follows from (16.144) that 0.625 - s « o € 
2.453 - s with s — 0.00394. 


16.3.3.5 Structure of Hypothesis Testing 

A statistical hypothesis testing has the following structure: 

1. First a hypothesis H is to be developed that the sample belongs to a population with some given 
properties, e.g., 

H: The population distribution has a normal distribution with parameters p and g? (or another given 
distribution), or 

H: For the non-known p an approximative value (estimate value) yọ is inserted which can be got ,e.g., 
by rounding off the sample mean z, or 

H: Two populations have the same expected value, 41 — H2 = 0, etc. 

2. Aconfidence interval B is to be defined, based on the hypothesis H (in general with tables). The 
value of the sample function should be in this interval with the given probability, e.g., with probability 
99% for a = 0.01. 

3. Calculating the value of the sample function and accepting the hypothesis if this value is in the 
given interval B, otherwise rejecting it. 

W Test of the hypothesis H: 1 — jio with a significance level a. 


X 
The random variable T = LE Vn has at distribution with m — n — 1 degrees of freedom according 


to 16.3.3.3, p. 837. It follows fom this that this hypothesis is to reject, if © is not in the confidence 
interval given by (16.140), i-e., if 


5 
|z- pol > Vm ta/2m-1 (16.145) 


holds. One says that there is a significant difference. For further problems about tests see [16.16]. 


16.3.4 Correlation and Regression 

Correlation analysis is used to determine some kind of dependence between two or more properties of 
the population from the experimental data. The form of this dependence between these properties is 
determined with the help of regression analysis. 

16.3.4.1 Linear Correlation of two Measurable Characters 

1. Two-Dimensional Random Variable 

In the following the formulas are mostly used for continuous random variables, but it is easy to replace 


them by the corresponding formulas for discrete variables. Suppose that X and Y, as a two-dimensional 
random variable (X,Y), have the joint distribution function 


rz y 
F(z,y) - PX € z, Y € y) =f [tow dz dy, (16.146) 
Fi(z)- P(X €z,Y «oo) Fy(y)-P(X «oo,Y € y). (16.146b) 


The random variables X and Y are called independent of each other if 
F(x, y) = Fi(x) - Fo(y) (16.147) 
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holds. The fundamental quantities assigned to X and Y determined from their joint density function 
f(x,y) are: 
1. Expected Values 





Ux = 9-] fet (x,y) dady, (16.148a) py = E(Y) =f fy f(x,y)dvrdy,  (16.148b 
2. Variances 

o% = E((X — ux)?), (16.1492) oy = E((Y — uy))). (16.149b 
3. Covariance 

oxy — E((X — ux)(Y — uy)). (16.150 
4. Correlation Coefficient 

iyo. (16.151 

OXOy 


It is assumed that every expected value above exists. The covariance can also be calculated by the 
formula 


oxy — E(XY) — uxuy where E(XY) x fart x,y) dx dy. (16.152 


—o0o0—oo 





The correlation coefficient is a measure of the linear dependence of X and Y, because of the following 
facts: 

All points (X,Y) are on one line with probability 1 if o% , — 1 holds. If X and Y are independent 
random variables, then their covariance is equal to zero, oxy = 0. From oxy = 0, it does not follow 
that X and Y are independent, but it does if they have a two-dimensional normal distribution which is 
defined by the density function 


fida 1 | 1 (5 — px)? 
vy exp 
πσχσνψ] -- ον 2(1 — oy) 
exv(s —ux)y— uv) | (y— 222) 
σχσν o ` 








2 
ox 


2 





(16.153) 


2. Test for Independence of two Variable 

In practical problems often the question arises of whether the variables X and Y can be considered inde- 
pendent with oxy = 0, if the sample with size n and with the measured values (x;, y;) (i = 1,2,...,7) 
comes from a two-dimensional normal distributed population. The test is performed in the following 
way: 

1. Setting up the hypothesis H: oxy = 0. 


2. Determining a significance level a and determining the quantile tam of the t distribution from Table 
21.20, p. 1138, for m — n — 2. 

3. Calculation of the empirical correlation coefficients rzy and calculation of the test statistics (sample 
function) y 


{αι - z)(yi — V) 


Tuy Vn — 2 
cap 
γἱ-τ 


4. Rejection of the hypothesis if |t| 7 t4, holds. 


t= (16.154a) with Try = (16.154b) 
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16.3.4.2 Linear Regression for two Measurable Characters 


1. Determination of the Regression Line 
If a certain dependence is detected between the variables X and Y by the correlation coefficient, then 
the next problem is to find the functional dependence Y = f(X). Here mostly linear dependence is 
considered. 
In the simplest case of linear regression it is supposed that for any fixed value of x the random variable 
Y in the population has a normal distribution with the expected value 

E(Y) —- a bx (16.155) 
and a variance o? independent of x. The relation (16.155) means that the mean value of the random 
variable Y depends linearly on the fixed value of x. The values of the parameters a, b and c? of the 
population are usually unknown. To estimate them approximately from a sample with values (z;, y;) 
(i = 1,2,...,n) the least squares method can be used. The least squares method requires that 


X [yi — (a + ba,)|? = min! (16.156) 


is valid so one gets the estimates 


n 








0 Meo -» i = 
— , ᾱ-π-ἶπ, P= 55 rey) with (16.1572) 
» {αι -- zy "o 
i=1 
a he a ee 2 1 Ὁ κ . 
T=- Sou, y= — Nw s, = —— Ymi- y. (16.157b) 
nd n zi n—lzi 


The empirical correlation coefficient rey is given in (16.154b). The coefficients à and b are called regres- 
sion coefficients. The line y(x) = à + ba: is called the regression line. 

2. Confidence Intervals for the Regression Coefficients 

The next question is, after the determination of the regression coefficients à and b, how well do the 
estimates approximate the theoretical values a and b. Therefore the test variables are to be formed as 


b= (6-02? (16.158a) and t 


Ssyn=2 2 ovn 
Sy le K Syl — r2, δή 


These are realizations of random variables having a t distribution with m — n — 2 degrees of freedom. 
For a given significance level a the quantile 15/5, is taken from Table 21.20, p. 1138, and because 


P( |t| < ta/2m) = 1 — o holds for t — t, and t — ty : 
Sy L— δη dos? 
Ne (16.1591 
s,Vn-—2- ντ >) 


Sy /1— r2, 
s,vn—2' 

A confidence region for the regression line y = a + bx can be determined with the confidence interval 

given in (16.159a,b) for a and 6 (see Lit. [16.4], [16.11]). 


(16.158b) 


|b — b| < ta/an—2 (16.159a) |a@ — a| < ta/zn-2 





842 16. Probability Theory and Mathematical Statistics 





16.3.4.3 Multidimensional Regression 


1. Functional Dependence 
Suppose that there is a functional dependence between the characters X1, X2,..., Xn, and Y, which is 
described by the theoretical regression function 


5 
y — αι, ὅθι... Ὁ) ΞΕ So aj 9; (a1, Boios Bn): (16.160) 
j=0 
The functions gj(£1, £2, .. ., &n) are known functions of n independent variables. The coefficients aj 


in (16.160) are constant multipliers in this linear combination. The expression (16.160) is also called 
linear regression, although the functions g; can be arbitrary. 





E The function f (x1, £2) = ao + a124 4- 313 + a381? + Gagan” + G52122, which is a complete quadratic 
polynomial of two variables with g9 = 1, 9, = 71, g2 = 2, 93 = οι), g4 = πο), and gs = 21%, is an 
example for a theoretical linear regression function. 








2. Writing in Vector Form 


It is useful to write formulas in vector form in the multidimensional case 


X= (21, 29,...,2n)", (16.161) 
so, (16.160) now has the form: 
y — f(x) = Vo ajgy(x). (16.162) 
j-0 


3. Solution and Normal Equation System 
The theoretical dependence (16.160) cannot be determined by the measured values 


(x9, 5), (513, δὴ (16.1632) 
because of random measuring errors. Looking for the solution in the form 
y = f(x) 7 Y ájg;(x) (16.163b) 
j=0 


and using the least squares method (see 16.3.4.2, 1., p. 841) the coefficients à; as the estimations of the 
theoretical coefficients a; can be determined, from the equation 


N 


D [i - 7 (x9)] = mint. (16.163c) 


il 
Introducing the notation 


fig h g(x) οι (κ) ... s (x9?) 
a pa 2 α-]9 (x?) σι (x?) ΜΝ (x) (16.1634) 
ἂν fn so (x97). gr (2) 2. ga (x) 
one gets from (16.163c) the so-called normal system of equations 
GTGà - GTf (16.1630) 


to determine 4. The matrix GTG is symmetric, so the Cholesky method (see 19.2.1.2, p. 958) is espe- 
cially good to solve (16.163e). 

HM Consider the sample whose result is given in the next table. Determine the coefficients of the re- 
gression function (16.164): 
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21 5 3 5 3 


0.5 0.5 0.3 0.3 fF (a3, 22) — ao + ayy + ager. (16.164) 
f(r,z) L5 35 62 32 








ds 1.5 15 05 
kc d _ | 3.5 [13 05 
a= (5) | f7|o2|: 8711 5 03 (16.165) 
“a 3.2 1 3 03 
and (16.163e) is 
4ão + 16ã + 1.6 àz = 14.4, a = 7.0, 
16àg + 684, + 6.4 do = 58.6, i.e., ἄι-- 0.25, (16.166) 
1.6àg 4- 6.4a4 4- 0.68à5 — 5.32, ü9 — —11. 
4. Remarks . 
1. To determine the regression coefficients one starts with the interpolation f (x?) = fili = 
1,2,..., N), i.e., with 
Gà — f. (16.167) 


In the case s < N, (16.167) is an over determined system of equations which can be solved by the 
Householder method (see 19.6.2.2, p. 985). The multiplication of (16.167) by GT to get (16.163e) is 
also called Gauss transformation. If the columns of the matrix G are linearly independent, i.e., rank 
G = s+1 holds, then the normal system of equations (16.163e) has a unique solution, which coincides 
with the result of (16.167) got by the Householder method. 

2. Also in the multidimensional case, it is possible to determine confidence intervals for the regression 
coefficients with the t distribution, analogously to (16.159a,b). 

3. Bythe help of the F distribution (see 16.2.4.7, p. 823), it is possible to use a so-called equivalence test 
to analyze the assumption (16.163b). This test shows if a solution in the form (16.163b) but with less 
terms yields a sufficient approximation to the theoretic regression function (16.160). (see Lit. [16.10]). 


16.3.5 Monte Carlo Methods 
16.3.5.1 Simulation 


Simulation methods are based on constructing equivalent mathematical models. These models are then 
easily analyzed by computer. In such cases, one talks about digital simulation. A special case is given 
by Monte Carlo methods when certain quantities of the model are randomly selected. These random 
elements are selected by using random numbers. 


16.3.5.2 Random Numbers 

Random numbers are realizations of certain random quantities (see 16.2.2, p. 811) satisfying given 
distributions. In this way it is possible to distinguish different kinds of random numbers. 

1. Uniformly Distributed Random Numbers 


These numbers are uniformly distributed in the interval [0, 1], they are realizations of the random vari- 
able X with the following density function fo(x) and distribution function F(x): 


. - 0 for 0€ x, 
fo(x) — { T mE OU = 7 <1, Fo(t)= 42 for 0<2<1, (16.168) 
0 otherwise; 1l fr z21 


1. Method of the Inner Digits of Squares A simple method to generate random numbers is 
suggested by J. v. Neumann. It is also called the method of the inner digits of squares, and it starts 
from a decimal number z € (0,1) which has 2n digits. First z? is formed, getting a decimal number 
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which has 4n digits. Erasing its first and its last n digits, so one again has a number with 2n digits. To 
get further numbers, this procedure is repeated. In this way 2n digit decimal numbers are produced 
from the interval [0, 1] which can be considered as random numbers with a uniform distribution. The 
value of 2n is selected according to the largest number representable in the computer. For example, 
choosing 2n — 10. This procedure is seldom recommended, since it produces more smaller numbers 
than it should. Several other different methods have been developed. 

W 2» — 4: 
z — 29 — 0,1234, 22 — 0,01[5227 56, 

z= % = 0,5227, 2? = 0,27/3215 29, 

Z — zo — 0,3215 etc. 

The first three random numbers are zo, z; and 2ο. 




















2. Congruence Method The so-called congruence method is widely used: A sequence of integers 
z (i — 0,1,2,...) is formed by the recursion formula 


Zi =C- zi modm. (16.169) 


Here zọ is an arbitrary positive number and c and m denote positive integers, which must be suitably 
chosen. For z;,4 one takes the smallest non-negative integer satisfying the congruence (16.169). The 
numbers z;/m are between 0 and 1 and can be used for uniformly distributed random numbers. 

3. Remarks 

a) Choosing m — 2", where r is the number of bits in a computer word, e.g., r = 40. Then the number 
c is to be chosen in the order of ym. 

b) Random number generators using certain algorithms produce so-called pseudo-random numbers. 
c) On calculators and also in computers, “ran” or “rand” is used for generating random numbers. 


2. Random Numbers with other Distributions 


To get random numbers with an arbitrary distribution function F(x) the following procedure is in use: 
Consider a sequence of uniformly distributed random numbers &1, £,... from [0,1]. With these num- 
bers the numbers 7; = F'~1(&;) are formed for i = 1,2,... . Here F^! (x) is the inverse function of the 
distribution function F(x). Then one gets: 


Pim <2) = P(F HG) < 1) = PE < Fle) = | folt) d= Fla), (16.170) 
0 


i.e., the random numbers m, n2, . . . satisfy a distribution with the distribution function F(x). 


3. Tables and Application of Random Numbers 

1. Construction Random number tables can be constructed in the following way. Ten identical 
chips are indexed by the numbers 0, 1,2,...,9. Placing them into a box and shuffle them. One of them 
is then selected, and its index is written into the table. Then the chip is to be replaced into the box, 
again shuffling, and choosing the next one. In this way a sequence of random numbers is produced, 
which is written in groups (for easier usage) into the table. In Table 21.21, p. 1139, four random 
digits form a group. 

In the procedure, it must be guaranteed that the digits 0,1,2,...,9 always have equal probability. 


2. Application of Random Numbers The use of a table of random numbers is demonstrated with 
an example. 

W Suppose one chooses randomly n — 20 items from a population of N — 250 items. The objects 
are renumbered from 000 to 249. Then a number is chosen in an arbitrary column or row in Table 
21.21, p. 1139, and a rule is determined of how the remaining 19 random numbers should be chosen, 
e.g., vertically, horizontally or diagonally. Only the first three digits are considered from these random 


16.3 Mathematical Statistics 845 





numbers, and they are used only if they form a number smaller than 250. 


16.3.5.3 Example of a Monte Carlo Simulation 


The approximate evaluation of the integral 


i= Joo) dz (16.171) 


is considered as an example of the use of uniformly distributed random numbers in a simulation. Two 
solution methods are discussed here. 
1. Applying the Relative Frequency 
Supposing 0 < g(x) < 1 holds; this condition can always be guaranteed by 
a transformation (see (16.175), p. 845). Then the integral J is an area inside 
the unit square E (Fig. 16.16). Considering the numbers of a sequence 
of uniformly distributed random numbers from the interval [0, 1] in pairs 
as the coordinates of points of the unit square E, one gets n points P; (i = 
1,2,...,n). Denoting by m the number of points inside the area A, then 
considering the notion of the relative frequency (see 16.2.1.2, p. 808) for 
the integral follows: 

1 

m 
Figure 16.16 "Eo dos (16.172) 
0 





To achieve relatively good accuracy with the ratio in (16.172), a very large number of random numbers 
is necessary. This is the reason why one is looking for possibilities to improve the accuracy. One of 
these methods is the following Monte Carlo method. Some others can be found in the literature (see 
Lit. [16.20]). 

2. Approximation by the Mean Value 

To determine (16.171), one starts with n uniformly distributed random numbers &1, £,...,& as the 
realization of the uniformly distributed random variable X. Then the values g; = g(&) (i = 1,2,...,n) 
are realizations of the random variable g(X ), whose expectation according to formula (16.49a,b), p. 813, 


1s: 
co 


1 
1:55 ; 
BUX) = ο ος - 3o (16.173) 
S 0 | jl 
This method, which uses a sample to obtain the mean value, is also called the usual Monte Carlo 
method. 


16.3.5.4 Application of the Monte Carlo Method in Numerical 
Mathematics 
1. Evaluation of Multiple Integrals 


First, it is to show how to transform a definite integral of one variable (16.174a) into an expression 
which contains the integral (16.174b). 


b 1 
P= J h(x) de (6174) 1- / g(r)dz with Oxg(z)«1.  (16174b) 
a 0 
Then the Monte Carlo method given in 16.3.5.3 can be applied introducing the following notation: 
xz —a-d(b-—a)u, m= min h(x), M = max h(x). (16.175) 
αΕ[α,δὶ σε[α,δ] 


Then (16.174a) becomes 
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h(a 4- (b — a)u) — m 
M-m 





I* — (M — m)(b — a) ps du + (b — a)m, (16.176) 
h(a 4- (b — a)u) — m 
M-m 
W The approximate evaluation of multiple integrals with Monte Carlo methods is demonstrated by an 

example of a double integral 


V= / h(w,y)dady with h(a,y) > 0. (16.177) 
j 


where the integrand — g(u) satisfies the relation 0 € g(u) < 1. 


S denotes a plane surface domain given by the inequalities a € x < band yi(x) < y € p2(x), where 
pı(x) and φο(α) denote given functions. Then V can be considered as the volume of a cylindrical solid 
K, which stands perpendicular to the x, y plane and its upper surface is given by h(z, y). If h(x,y) € e 
holds, then this solid is in a block Q given by the inequalities a < «<b, e<y<d,0<2z< 
e (a,b,c,d,e const). After a transformation similar to (16.175), one gets from (16.177) an expression 
containing the integral 


y= ΠΟΙ with 0 € g(u,v) € 1, (16.178) 
ge 


where V* can be considered as the volume of a solid K* in the three-dimensional unit cube. The integral 
(16.178) is approximated by the Monte Carlo method in the following way: 
Triplets of the numbers of a sequence of uniformly distributed random numbers from the interval [0, 1] 
are considered as the coordinates of points P; (¢ = 1,2,...,n) of the unit cube, and count how many 
points among P; belong to the solid K*. If rm point belong to A*, then analogously to (16.172) 
m 
VIX, (16.179) 
n 
Remark: In definite integrals with one integration variable one should use the methods given in 19.3, 
p. 963. For the evaluation of multiple integrals, the Monte Carlo method is still often recommended. 


2. Solution of Partial Differential Equations with the Random Walk Process 


























The Monte Carlo method can be used for the approximate solution of partial differential equations with 
the random walk process. 
y | a) Example of a Boundary Value Problem: Con- 
sider the following boundary value problem as an ex- 
r ample: 
$E ὃ2 = 

y Au= 2 +2 =0 for (x,y) € G, (16.180a) 

y-r- x * Ox? 
yl- S u(x,y) — n for (x,y) ETL. (16.180b) 
Here G is a simply connected domain in the x, y plane; 























' I' denotes the boundary of G (Fig.16.17). Similarly 
i > to the difference method in paragraph 19.5.1, p. 976 
0 | x1 x x*1 x G is covered by a quadratic lattice, where it can be 





assumed, without loss of generality, that the step size 
Figure 16.17 can be chosen as h — 1. 


In this way interior lattice points P(x, y) and boundary points R; are obtained. The boundary points 
Ri, which are at the same time also lattice points, are considered in the following discussion as points 
of the boundary T of G, i.e.: 


u(R;) = f(R) (i—1,2,...,N) (16.181) 
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b) Solution Principle: One imagines that a particle starts a random walk from an interior point 
P(x,y). That is: 
1. The particle moves randomly from P(x, y) to one of the four neighboring points. To each of these 
four grid points 1/4 is assigned as the probability to move into them. 
2. If the particle reaches a boundary point R;, then the random walk terminates there with probability 
one. 
It can be proven that a particle starting at any interior point P reaches a boundary point R; after a 
finite number of steps with probability one. Denoting by 

p(P, Ri) = p(x, y), Ri) (16.182) 
the probability that a random walk starting at P(x,y) will terminate at the boundary point Ri, then 
one gets 

p(R; R;)-—1, p(R;R;)—0 for izj and (16.183) 








The equation (16.184) is a difference equation for p((x, y), R;). If starting n random walks from the 
point P(x,y), from which m; terminates at R; (m; < n), then 
mi 
p(x, y), Ri) D LIE 
n 
The equation (16.185) gives an approximate solution of the differential equation (16.180a) with the 
boundary condition (16.181). The boundary condition (16.180b) will be fulfilled if substituting 


(16.185) 


(P) = v2) = 3: fp. 9), Re) (16.186) 


because of (16.184), v(R;) = x f(Ri)p(R;, Ri) = f(R;)- 


To calculate v(, y) (16.184) is multiplied by f(2;). Summation yields the following difference equation 
for v(x, y): 


o(z,y) = Zle 1,y) + ole Log) + oley = 1) + oley + 1). (16.187) 


Starting n random walks from an interior point P(x, y), and among them m; terminate at the boundary 
point R; (i — 1,2,..., N), then an approximate value is obtained at the point P(x, y) of the boundary 
value problem (16.180a,b) by 


v(z, y) ez D (16.188) 
i=l 








16.3.5.5 Further Applications of the Monte Carlo Method 


Monte Carlo methods as stochastic simulation, sometimes called methods of statistical experiments, are 
used in many different areas. For examples: 

e Nuclear techniques: Neutrons passing through material layers. 

e Communication: Separating signals and noise. 

e Operations research: Queueing systems, process design, inventory control, service. 

For further details of these problem areas see for example [16.15]. 
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16.4 Calculus of Errors 


Every scientific measurement, giving certain numerical quantities — regardless of the care with which 
the measurements are made — is always subject to errors and uncertainties. There are observational 
errors, errors of the measuring method, instrumental errors and often errors arising from the inherent, 
random nature of the phenomena being measured. Together they compose the measurement error. 
All measurement errors arising during the measuring process are called deviations. As a consequence 
a measured quantity represented by a number of significant digits can be given only with a rounding 
error, i.e., with a certain statistical error, which is called the uncertainty of the result. 

1. The deviations of the measuring process should be kept as small as possible. On this basis it is to 
be evaluated the possible best approximation, what can be done with the help of smoothing methods 
which have their origin in the Gaussian least squares method. 

2. The uncertainties have to be estimated as well as possible, what can be done with the help of meth- 
ods of mathematical statistics. 

Because of the random character of the measuring results they can be considered as statistical samples 
(see 16.2.3, 1., p. 814) with its probability distribution, whose parameters contain the desired informa- 
tion. In this sense, measurement errors can be seen as sampling errors. 


16.4.1 Measurement Error and its Distribution 


16.4.1.1 Qualitative Characterization of Measurement Errors 
Qualifying the measurement errors by their causes, the following three types of errors are distinguished: 
1. Rough errors are caused by inaccurate readings or confusion; they are excludable. 





2. Systematic measurement errors are caused by inaccurately scaled measuring devices and by the 
method of measuring, where the method of reading the data and also the measured error of the mea- 
surement system can play a role. They are not always avoidable. 

3. Statistical or random measurement errors can arise from random changes of the measuring con- 
ditions that are difficult or impossible to control and also by certain random properties of the events 
observed. 


In the theory of measurement errors usually it is assumed that the rough errors and the systematic 
measurement errors are excludable, so that only the statistical properties and the random measure- 
ment errors are included into the calculation of the errors. 


16.4.1.2 Density Function of the Measurement Error 


1. Measurement Protocol 

Το calculate the characterization of the uncertainties it must be supposed that the measured results 
are listed in a measurement record as a prime notation and that the relative frequencies or the density 
function f(a) or the cumulative frequencies or the distribution function F(v) (see 16.3.2.1, p. 832) of 
the uncertain values are available. The realization of the random variable X under consideration is 
denoted by 2. 

2. Error Density Function 

Special assumptions about the properties of the measurement error result in certain special properties 
of the density function of the error distribution: 

1. Continuous Density Function Since the random measurement errors can take any value in a 
certain interval, they are described by a continuous density function f(x). 

2. Even Density Function If measurement errors with the same absolute value but with different 
signs are equally likely, then the density function is an even function: f(—x) — f(x). 

3. Monotonically Decreasing Density Function If a measuring error with larger absolute value 
is less likely than an error with smaller absolute value, then the density function f(x) is monotonically 
decreasing for x > 0. 
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4. Finite Expected Value The expected value of the absolute value of the error must be finite: 
E(X|) - J le|f (z) dz « oo. (16.189) 


Different properties of the errors result in different types of density functions. 

3. Normal Distribution of the Error 

1. Density Function and Distribution Function In most practical cases it can be supposed that 
the distribution of the measurement error is a normal distribution with expected value u = 0 and 


variance o°, i.e., the density function f(x) and the distribution function F(x) of the measurement 
error are: 








we i 2 
1 -# 1 t 7 
a) = : 20? — (16.190: i £) = -207 dt = (=) 
f(x) 5 o. ( a) and F(x) sm e 20 dt— z) (16.190b) 


Here (x) is the distribution function of the standard normal distribution (see (16.73a), p. 819, and 
Table 21.17, p. 1133). In the case of (16.190a,b) one speaks about normal errors. 

2. Geometrical Representation The density function (16.190a) is represented in Fig. 16.18a with 
inflection points and points at the center of gravity, and its behavior is shown in Fig. 16.18b when the 
variance changes. The inflection points are at the abscissa values +ø; the centers of gravity of the half- 








areas are at +7. The maximum of the function is at x = 0 and it is 1/(o v27). The curve widens as o? 
increases, the area under the curve always equals one. This distribution shows that small errors occur 
often, large errors only seldom. 








Figure 16.18 


4. Parameters to Characterize the Normally Distributed Error 


Beside the variance c? or the standard deviation c which is also called the mean square error or standard 
error, there are other parameters to characterize the normally distributed error, such as the measure 
of accuracy h, the average error or mean error n, and the probable error y. 


1. Measure of Accuracy Beside the variance c?, the measure of accuracy 
h=—z (16.191) 
is used to characterize the width of the normal distribution. A narrower Gauss curve results in better 


accuracy (Fig. 16.18b). Replacing o by the experimental value & or 64 obtained from the measured 
values, the measure of accuracy characterizes the accuracy of the measurement method. 
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2. Average or Mean Error The expected value y of the absolute value of the error is defined as 


= E(|X|) = 2 [ sf) duco cu (16.192 
0 


π 
3. Probable Error The bound » of the absolute value of the error with the property 


P(X| <1 =5 (16.193a 


2 
is called the probable error. It implies that 


nz de = 26( a Ef e? dt = (16.193b 


where zm is the distribution function of the standard normal distribution. The condition (16.193b 
is a non-linear equation which can be solved approximately by the help of a computer algebra system 
to get y/o. The result is 


1 & 0.6745 . (16.193c 





4. Given Error Bounds If an upper bound a > 0 of an error is given, then with (16.193b) can be 
calculated the probability that the error is in the interval [—a, a]: 


P(X| X a) =20 (2) Eh (16.194) 


5. Relations between Standard Deviation, Average Error, Probable Error, 
and Accuracy If the error has a normal distribution, then the following relations hold using 
(16.193c): 


1 
η-- Jae. (16.1952) y = 0.67450, (16.1955) h= Eg (16.195c) 


16.4.1.3 Quantitative Characterization of the Measurement Error 


1. True Value and its Approximations 

The true value «x, of a measurable quantity is usually unknown. Therefore the expected value of the 
random variables, whose realizations are the measured values x; (i = 1,2,...,m), is chosen as an esti- 
mated value of £y. Consequently, the following means can be considered as an approximation of £y. 
1. Arithmetical Mean 


Me 
R 


k 
(16.196a) or T= Y hj, (16.196b) 

T ici j=l 

if the measured values are distributed into k classes with absolute frequencies h; and class means 

Tj (j = 1525s). 

2. Weighted Mean 


=Y gm Y o (16.197) 
ici ici 


Here the single measured values are weighted by the weighting factors g; (g; > 0) (see 16.4.1.6, 1., 
p. 854). 

2. Error of a Single Measurement in a Measurement Sequence 

1. The True Error of a Single Measurement in a Measurement Sequence is the difference 
between the true value z,, and the measuring result. Because this is usually unknown, the true error 
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ει οἱ the i-th measurement with the result x; is also unknown: 

£j = Ly — %. (16.198) 
2. The Mean Error or Apparent Error of a Single Measurement in a Measurement Se- 
quence is the difference of the arithmetical mean and the measurement result z;: 

Ui = Y — m. (16.199) 
3. Mean Square Error of a Single Measurement or Standard Error of a Single 

Measurement Since the expected value of the sum of the true errors e; and the expected value 

of the sum of the apparent errors v; of n measurements are zero (independently of how large they are), 
they are characterized by the sum of the error squares: 


2 


=e’, (16.200a) v= Sv. (16.200b) 


1 i=1 


M: 


D 


From a practical point of view only the value of (16.200b) is interesting, since only the values of v; can 
be determined from the measuring process. Therefore, the mean square error of a single measurement 
of a measurement sequence is defined by 


ὅ- Στό - 1). (16.200c) 


The value & is an approximation of the standard deviation c of the error distribution. 
One gets for & — c in the case of normally distributed error: 


P(|e| € 6) = 28(1) — 1 = 0.6826. (16.200d) 
That is: The probability that the absolute value of the true value does not exceed oc, is about 68%. 
4. The Probable Error of a single measurement is the number y, for which 
1 
P(|le| € y) 2 z. 
(de $535 
That is: The probability that the absolute value of the error does not exceed », is 5096. "The abscis- 
sae +y divide the area of the left and the right parts under the density function into two equal parts 
(Fig. 16.18a). 
In the case of a normally distributed error in accordance to (16.193c) holds 
y £: 0.6745 a z 0.67450 — 5. (16.201b) 
5. Average Error of a single measurement is the number η, which is the expected value of the 
absolute value of the error: 


(16.2012) 





n = Ε(ε|) -- / |x| f(a) dx. (16.202a) 


/2 
In the case of a normally distributed error follows 7 = 4/—o and 
π 


Ρ(ε| «η) - 26 (2) -1-20 (5) — 1 = 0.5751 : (16.202b) 
e π 


The probability that the error does not exceed the value 7 is about 57.6 %. The centers of gravity of 
the left and right areas under the density function (Fig. 16.18a) are at abscissae +7. In the case of 
normally distributed errors: 


2 2 
n= 20 x a7 =ñ. (16.202c) 
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3. Error of the Arithmetical Mean of a Measurement Sequence 

The error of the arithmetical mean z of a measurement sequence is given by the errors of the single 
measurement: 

1. Mean Square Error or Standard Deviation 


OAM = » d fi n—1)- "i (16.203) 
i=1 
2. Probable Error 
Jam 0.6745 > u VAT n — 1)] — 0.6745 M (16.204) 


3. Average Error 


Nam © 0.7979 Σ» m n — 1)] 2 0. Tm. (16.205) 


4. Accessible Level of Error Since the three types of errors defined above (16.203)-(16.205) are 
directly proportional to the corresponding error of the single measurement (16.200c), (16.201b) and 
(16.202c) and they are proportional to the reciprocal of the square root of n, it is not reasonable to 
increase the number of the measurements after a certain value. It is more efficient to improve the 
accuracy h of the measuring method (16.191). 

4. Absolute and Relative Errors 

1. Absolute Uncertainty, Absolute Error The uncertainty of the results of measurement is char- 
acterized by errors £;i, Vi, Ci, Yi, Mi, Or €, V, O, Y, N, which measure the reliability of the measurements. 
The notion of the absolute uncertainty, given as the absolute error, is meaningful for all these types of 
errors and for the calculation of error propagation (see 16.4.2, p. 854). They have the same dimension 
as the measured quantity. 

The word “absolute” error is introduced to avoid confusion with the notion of relative error. Often the 
notation Ax; or Ax is used. The word “absolute” has a different meaning from the notion of absolute 
value: It refers to the numerical value of the measured quantity (e.g., length, weight, energy), without 
restriction of its sign. 

2. Relative Uncertainty, Relative Error The relative uncertainty, given by the relative error, 
is a measure of the quality of the method of measurement with respect to the numerical value of the 
measured quantity. In contrast to the absolute error, the relative error has no dimension, because it is 
the quotient of the absolute error and the numerical value of the measured quantity. If this value is not 
known, one replaces it by the mean value of the quantity x: 





Az; Ax; 
due — ea, (16.2062) 
T T 
The relative error is given mostly as a percentage and it is also called the percentage error: 
6x;/% = da; - 100%. (16.206b) 


5. Absolute and Relative Maximum Error 
1. Absolute Maximum Error If the quantity z is to be determined and is z a function of the mea- 
sured quantities 21, %2,...,Un, ie., 2 = f(#1,%2,..., Un), then the resulting error must be calculated 
taking also the function f into consideration. There are two different ways to examine errors. The 
first approach is that statistical error analysis is applied by smoothing the data values using the least 
squares method (min 7(z; — z)?). In the second approach, an upper bound Azmax is determined for 
the absolute error of the quantities. With n independent variables z;, holds: 
n 
Nga = Σ Z fen Teci rina Ati (16.207) 
1 


i=1 
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where the mean value z; should be substituted for xi. 
2. Relative Maximum Error The relative maximum error is the absolute maximum error divided 
by the numerical value of the measured value (mostly by the mean of z): 


AZmax AZmax a 
di 2c e. m (16.208) 


2 zZ 








16.4.1.4 Determining the Result of a Measurement with Bounds on the 


Error 
A realistic interpretation of a measurement result is possible only if the expected error is also given; 
error estimations and bounds are components of measurement results. It must be clear from the data 
what is the type of the error, what is the confidence interval and what is the significance level. 
1. Defining the Error The result of a single measurement is required to be given in the form 























c=ajtArerajtea, (16.209a) 
and the result of the mean has the form 
v=G+ Atay τε σΑΜ. (16.209b) 
Here for Ax the most often used standard deviation & is applied. ^ and 7] could also be used. 
X-—mzx 
2. Prescription of Arbitrary Confidence Limits The quantity T = fu u has a t distribution 
OAM 


(16.97b) with f — n — 1 degrees of freedom in the case of a population with a distribution N (p, o?) 
according to (16.96). For a required significance level a or for an acceptance probability S — 1 — o the 
confidence limits for the unknown quantity z, — ji with the t quantile ta/2;f are 


Ji — X c tay? OAM. (16.210) 





That is, the true value z,, is in the interval given by these limits with a probability S = 1 — a. 
Mostly it is of interest keeping the size n of the measurement sequence at its lowest possible level. The 
length 215/5.,6 Ay of the confidence interval decreases for a smaller value of 1 — o and also for a larger 





number n of measurements. Since &4y decreases proportionally to 1//n and the quantile t5/5;; with 


f =n-—1 degrees of freedom also decreases proportionally to 1/,/n (for values of n between 5 and 10, 
see Table 21.20, p. 1138) the length of the confidence interval decreases proportionally to 1/n for such 
values of n. 


16.4.1.5 Error Estimation for Direct Measurements with the Same 


Accuracy 
If there is the same variance o; for all n measurements, one talks about measurements with the same 
accuracy h = const. In this case, the least squares method results in the error quantities given in 
(16.200c), (16.201b), and (16.202b). 
W Determine the final result for the measurement sequence given in the following table which contains 
n — 10 direct measurements with the same accuracy. 



































a; | 1.592 | 1.581 | 1.574 | 1.566 | 1.603 | 1.580 | 1.591 | 1.583 | 1.571 | 1.559 
v-10|—12| —1| +6] +14] —23 0|-11| -δ 4+9] 421 
υ «1061 144 1 36] 196] 529 0. 121 9 81| 441 
T — 1.580, 6 = | Mov? /(n — 1) 2 0.0131, &4y — 6/ V/n — 0.004. 
i=l 














Final result: x = T + am = 1.580 + 0.004. 
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16.4.1.6 Error Estimation for Direct Measurements with Different 
Accuracy 


1. Weighted Measurements 

If the direct measurement results x; are obtained from different measuring methods or they represent 
means of single measurements, which belong to the same mean z with different variances 6;?, then a 
weighted mean 


zo =Y onj} (16.211) 
i=1 i=1 


is calculated, where g; is defined as 
53 
δ 
0) Sas 16.212 
9: Ge ( ) 
Here c is an arbitrary positive value, mostly the smallest ¢;. It serves as a weight unit of the deviations, 
i.e., for 6; = ¢ it is g; = 1. It follows from (16.210) that a larger weight of a measurement results in a 
smaller deviation 6;. 
2. Standard Deviations 


The standard deviation of the weight unit is estimated as 


a9) = Dain? /(n zu. (16.213) 
i=1 


It must be sure that 69 — 6. In the opposite case, if ¢% > G, then there are x; values which have 
systematic deviations. 
The standard deviation of the single measurement is 
a) a 
i σ ow, ' 
G9 = = —46é;, (16.214) 
gi σ 





where 6; 9 < 6; can be expected. 
The standard deviation of the weighted mean is: 


siy = sf s - ie (o = DX). (16.215) 
i=l 


i=l i=l 





3. Error Description 

The error can be described as it is represented in 16.4.1.4, p. 853, either by the definition of the error 
or by the t quantile with f degrees of freedom. 

W The final results of measurement sequences (n = 5) with different means 7; (i = 1,2,...,5) and 
with different standard deviations 96 4j, are given in Table 16.6. 

Calculating (T;)m = 1.5830 and choosing ap = 1.585 and & = 0.009 with z; = T; — £o, gi = 6? /6;? one 


gets Z = —0.0036 and 7 = zo + Z = 1.582. The standard deviation is 69) = Yan? /(n -1)- 
Vici 


0.0088 < 6 and o, = Gay = 0.0027. The final result is x = F +o, = 1.585 + 0.0027. 











16.4.2 Error Propagation and Error Analysis 


Measured quantities appear in final results often in the form of functional dependencies. One talks 
about error propagation. If the error is small, then a Taylor expansion with respect to the error can be 
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Table 16.6 Error description of a measurement sequence 





Ti ΖΑΜι 92 M, gi Zi gii zi gizi? 
1.573 0.010 | 1.0-10-4 | 0.81 | —1.2-10-2 | 9.71073 | 1.44-107* | 1.161074 
1.580 0.004 | 1.6-10-° | 5.06 | —5.0-10-3 | —2.5-107? | 2.50-1075 | 1.26-1074 
1.582 0005 | 2.5.40? | 3.24 | —3.0-10? | —9.7-10? | 9.0.1075 | 2.91-10- 
1.589 0.009 | 8.1105 | 1.00 | +4.0-1073 | 4.0-107? | 1.6.1075 | 1.6-10-5 
1.591 0.01 | L21-107^ | 0.66 | +6.0-1078 | 3.9107? | 3.6107 | 2.37.1079 


n n 
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— 1.583 | — 0.009 —10.7 — 3.6107? =3.1-10-4 





























used neglecting the terms of second and higher order. 
16.4.2.1 Gauss Error Propagation Law 


1. Problem Formulation 

Suppose it is to determine the numerical value and the error of a quantity z given by the function 
z — f(ri,22,..., x4) of the independent variables x; (j — 1,2,...,k). The mean value z; obtained 
from nj measured values is considered as realizations of the random variable z;, with variance o;?. The 
question is how the errors of the variables affect the function value f(z1, 9, .. . , y). It is assumed that 
the function f (x1, £2, . . . , £p) is differentiable and its variables are stochastically independent. However 
they may follow any type of distribution with different variances ση”. 

2. Taylor Expansion 

Since the error represents relatively small changes of the independent variables, the function f (21, x, . . . , 
x) can be approximated in the neighborhood of the mean z; by the linear part of its Taylor expansion 
with the coefficients a;, so its error A f is: 


Af = f(zi 22,... 2.) — fX 92... 04), (16.2162) 
k k 
Af x df = E tı + PT +: PEZI dr; = X --- of 11} -- αμα, (16.216b) 
ὅσα T Lk τι ὅτι ΞΙ 
where the partial derivatives O f /O; are taken at (74,9, .. . , 2). 


The variance of the function is 


k 
σι = On. + On ++ ak Orp = Σ α σαι”. (16.217) 


3. Approximation of the Variance oF 

Since the variances of the independent variables x; are unknown, they can be approximated by the 
variance of their mean, which is determined from the measured values aj (l = 1, 2, . . . , n4) of the single 
variables as follows: 


nj 
3 Gam) 
52. 151 : 
4 mm, 16.21 
Ons n(n; — 1) (10:218) 


Using these values one can form an approximation of o;?: 


k 
83 — Y ajo. (16.219) 
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The formula (16.219) is called the Gauss error propagation law. 
4. Special Cases 


1. Linear Case An often occurring case is the summation of the values of the errors of linearly oc- 
curring error quantities with a; — 1: 


Gp = GR + B+ +--+ Gp. (16.220) 


W The pulse length is to be measured at the output of a pulse amplifier of a detector channel for spec- 
trometry of radiation, whose error can be deduced for three components: 

1. statistical energy distribution of the radiation of the part passing through the spectometer with an 
energy Eo, which is characterized by 6g, 

2. statistical interference processes in the detector with Gpet, 

3. electronic noice of the amplifier of the detector impulse 94. 

The total pulse length has the error 


85 — 2, + ey + G3. (16.221) 


2. Power Rule The variables x; often occur in the following form: 





Z = f(11,22,...24) — axi < £3? ... Ep. (16.222) 
By logarithmic differentiation the relative error is 

1 da: da: dtp 

τω” (16.223) 

f qi 22 Ëg 


from which by the error propagation law follows for the mean relative error: 


(16.224) 





Bl Suppose the function f (£1, £2, £3) has the form f (x1, £2, £3) = /2125?v3?, and the standard devia- 
tions are Ori, Ceg and Ogg. The relative error is then 
O35 στι 


(52) *62) «62 


5. Difference to the Maximum Error 

Declaring the absolute or relative maximal error (16.207), (16.208) means that no smoothing for the 
values of the measurement is been made. For the determination of the relative or absolute error with 
the error propagation laws (16.219) or (16.222), smoothing between the measurement values zj means 
that a confidence interval for a previously given level is to be determined. This procedure is given in 


16.4.1.4, p. 853. 
16.4.2.2 Error Analysis 


The general analysis of error propagation in the calculations of a function y(x;), when quantities of 
higher order are neglected, is called error analysis. In the framework of the theory of error analysis one 
investigates using an algorithm, how an input error Az; affects the value of y(x;). In this context one 
also talks about differential error analysis. 











1&5, 








2 Tı 


In numerical mathematics, error analysis means the investigation of the affect of errors of methods, of 
rounding, and of input errors to the final result (see [19.24]). 


17 DynamicalSystems and Chaos 


17.1 Ordinary Differential Equations and Mappings 


17.1.1 Dynamical Systems 
17.1.1.1 Basic Notions 


1. The Notion of Dynamical Systems and Orbits 

A dynamical system is a mathematical object to describe the development of a physical, biological 
or another system from real life depending on time. It is defined by a phase space M, and by a one- 
parameter family of mappings v: M — M, where t is the parameter (the time). In the following 
discussion the phase space is often IR", a subset of it, or a metric space. The time parameter t is from 
R (time continuous system) or from Z or from Z, (time discrete system). Furthermore, it is required 
for arbitrary x € M that 

a) e? (x) = x and 

b) v! (q*(x)) — q'^*(x) for all t, s. The mapping v! is denoted briefly by v. 

Hereafter, the time set is denoted by T, hence, I = R, L = R}, I = Z or I = Z4. IfI = R, then the 
dynamical system is also called a flow; if l = Z or I = Z,, then the dynamical system is discrete. In 
case l’ = R and I = Z, the properties a) and b) are satisfied for every t € I, so the inverse mapping 
(τα -- wg also exists, and these systems are called invertible dynamical systems. 








If the dynamical system is not invertible, then y™(A) means the pre-image of A with respect to v, 
for an arbitrary set A C M and arbitrary t » 0, i.e, o^ '(A) = {x € M: t(x) € A}. If the mapping 
t: M > M is continuous or k times continuously differentiable for every t € I' (here M C R”), then 
the dynamical system is called continuous or C^-smooth, respectively. 

For an arbitrary fixed x € M, the mapping t > yt (x), t € T, defines a motion of the dynamical 
system starting from x at time t — 0. The image y(x) of a motion starting at x is called the orbit (or 
the trajectory) through x, namely (x) — (v(v) er. Analogously, the positive semiorbit through z is 
defined by 4*(r) — (vf(x))i2o and, if 7 Z R4 or DP z Z,, then the negative semiorbit through x is 
defined by 5^ (x) — (vf (x))izo- 

The orbit y(x) is a steady state (also equilibrium point or stationary point) if y(x) = {x}, and it is 
T-periodic if there exists a T € ΙΓ, Τ > 0, such that ^T (z) = t(x) for allt € I, and T € T is the 
smallest positive number with this property. The number T is called the period. 

2. Flow ofa Differential Equation 


Consider the ordinary linear planar differential equation 


i= f(x), (17.1) 
where f: M — R" (vector field) is an r-times continuously differentiable mapping and M = R” or 
M is an open subset of R”. From now on, the Euclidean norm || - || is used in R”, i.e., for arbitrary E 


n 
x € R”, £ — (z,...,2,), its norm is ||r|| 7 ,|9 27. If the mapping f is written componentwise 
Vil 


f = (fi,---,fn), then (17.1) is a system of n scalar differential equations z; — fi(1,..., v4), à — 
ey Me 





The Picard—Lindel6f theorem on the local existence and uniqueness of solutions of differential equations 
and the theorem on the r-times differentiability of solutions with respect to the initial values (see [17.11]) 
guarantee that for every zo € M, there exist a number & 7» 0, a sphere Bs(xo) — («: || — xol| < ô} in 
M and a mapping e: (—6,£) x Bs(vo) — M such that: 

1. y(-,-) is (r + 1)-times continuously differentiable with respect to its first argument (time) and r- 
times continuously differentiable with respect to its second argument (phase variable); 
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2. For every fixed x € Bs(co), vy (-, x) is the locally unique solution of (17.1) in the time interval (--ε, ε) 

dp 
ot 
(0, x) = x, and every other solution with initial point x at time t = 0 coincides with y(t, x) for all 
small |t]. 


which starts from x at time t = 0, i.e., (t,x) = ġ(t,x) = f(p(t,x)) holds for every t € (—e€,€), 


Suppose that every local solution of (17.1) can be extended uniquely to the whole of R. Then there 
exists a mapping y: R x M — M with the following properties: 

1. y(0,%) =a forall «eM. 

2. p(t + s,x) = v(t, p(s, x)) for allt, s € Randall z € M. 

3. (p(-,-) is continuously differentiable (r + 1) times with respect to its first argument and r times with 
respect to the second one. 

4. For every fixed x € M, y(-, x) is a solution of (17.1) on the whole of R. 


Then the C’-smooth flow generated by (17.1) can be defined by v: — c(t,-). The motions g(-, x): 
IR — M of a flow of (17.1) are called integral curves. 


E The equation 














$—c(y—-2) g-—rz—-y-zz, z-zy-—bz (17.2) 
is called a Lorenz system of convective turbulence (see also 17.2.4.3, p. 887). Here o » 0, r > 0 and 
b > 0 are parameters. A C^? flow on M = R? corresponds to the Lorenz system . 

3. Discrete Dynamical System 
Consider the difference equation 

vta — gx), (17.3) 

which can also be written as an assignment x —> (x). Here o: M — M is a continuous or r times 


continuously differentiable mapping, where in the second case M C IR". If y is invertible, then (17.3) 
defines an invertible discrete dynamical system through the iteration of v, namely, 


gt =o---oy, for t>0, gt=plo---op}, for t<0, y°=id. 
eS (17.4) 
t times, —t times, 


If y is not invertible, then the mappings y* are defined only for t > 0. For the realization of v see 
(5.74), p. 333. 
W A: The difference equation 

Ti+1 = QT; (1 — z), t=0,1,... (17.5) 
with parameter a € (0, 4] is called a logistic equation. Here M — [0,1], and q: [0,1] — [0,1] is the 
function y(x) = az(1 — x) for a fixed a. Obviously, y is infinitely many times differentiable, but not 
invertible. Hence (17.5) defines a non-invertible dynamical system. 


E B: The difference equation 





Tui =Y +1- ar, Yr =bn t=0,41,..., (17.6) 
with parameters a > 0 and b Æ 0 is called a Hénon mapping. The mapping y: R? — R? corresponding 
to (17.6) is defined by y(x, y) = (y+1—az?, bx), it is infinitely many times differentiable and invertible. 
4. Volume Contracting and Volume Preserving Systems 
The invertible dynamical system {%* }er on M C R” is called volume depressing or dissipative (volume 
preserving or conservative), if the relation vol(y’(A)) < vol(A) (vol(y*(A)) = vol(A)) holds for every 
set A C M with a positive n-dimensional volume vol(A) and every t 7 0 (t€ I). 

W A: Let ¢ in (17.3) be a C’-diffeomorphism (i.e.: o: M — M is invertible, M C R” open, y and 
yt are C"-smooth mappings) and let D(x) be the Jacobi matrix of y in x € M. The discrete system 
(17.3) is dissipative if | det Dy(x)| < 1 for all z € M, and conservative if | det Dop(x) | = 1 in M. 
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E B: For the system (17.6) Do(v,y) — Ti a and so | det Dy(x,y)| = b. Hence, (17.6) is 


dissipative if |b| < 1, and conservative if |b] = 1. 
The Hénon mapping can be decomposed into three mappings (Fig. 17.1): First, the initial domain 


is stretched and bent by the mapping i = z vU = y+ 1 — ax? in a area- preserving way, then it is 
contracted in the direction of the z'-axis liri? g= br E = y’ (at |b| « 1), and finally it is reflected with 
respect to the line y” = x” by z” = y", y" = x" 





y y y' y" 
» » » » 
x x x" x 


Figure 17.1 
17.1.1.2 Invariant Sets 


1. a-andw-Limit Sets, Absorbing Sets 
Let (v! er be a dynamical system on M. The set A C M is invariant under {p'}, if y'(A) = A holds 
for allt € I’, and positively invariant under {y'}, if p'(A) C A holds for all t 7 0 from I. 
For every x € M, the w-limit set of the orbit passing through wx is the set 
w(x) ={ye M:4t,ET, tn> +o, q^(r) —y asn— +00}. (17.7) 
The elements of w(a) are called w-limit points of the orbit. If the dynamical system is invertible, then 
for every x € M, the set 
a(zr)-(y € M:3t, € I, t, —oo, q^(z) y asn— oo) (17.8) 
is called the a-limit set of the orbit passing through x; the elements of a(x) are called the a-limit points 
of the orbit. 
For many systems which are volume decreasing under the flow there exists a bounded set in phase 
space such that every orbit reaching it stays there as time increases. A bounded, open and connected 
set U C M is called absorbing with respect to (o! );er, if e^ (U) C U holds for all positive t from T. (U 
is the closure of U.) 


E Consider the system of differential equations 


d y-z(l-a?—4?), g-zzc-y(l-a?—4?) (17.9a) 


in the plane. Using the polar coordinates x = r cos V, y = r sin ð, the solution of (17.9a) with initial 
state (ro, 9g) at time t — 0 has the form 

r(t,ro) — [L+ (r9? — 1) e?]712,. (t, 09) = t + o. (17.9b) 
This representation of the solution shows that the flow of (17.9a) has a periodic orbit with period 27, 
which can be given in the form 4((1,0)) — ((cost, sint), t € [0,27]. The limit sets of an orbit through 
p are: 








(0,0, lp] «1 
1,0)), 1 0,0), 
a) =4 (1,0), |p =1, ana wp) = {MEW BA OO 
20 dp] 1 
Every open sphere B, — f(x, y): a? + y? < r?} with r > 1 is an absorbing set for (17.9). 
2. Stability of Invariant Sets 


Let A be an invariant set of the dynamical system {y*},<r defined on (M, p). The set A is called stable, 
if every neighborhood U of A contains another neighborhood U, C U of A such that y*(U1) C U holds 
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for all t » 0. The set A, which is invariant under (f), is called asymptotically stable if it is stable and 
the following relations are satisfied: 


Yz € M 
dist(z, A) « A 


Here, dist(x, A) = inf p(x, y). 
ye 


3Δ 50 ) : dist(y'(x), A) —+0 for t— +00. (17.10) 


3. Compact Sets 

Let (M, p) be a metric space. A system {U;}ier of open sets is called an open covering of M if every 
point of M belongs to at least one U;. The metric space (M, p) is called compact if it is possible to 
choose finitely many Ui, ..., U;, ftom every open covering {U;}ier of M such that M = Un U --- UUj, 
holds. The set K C M is called compact if it is compact as a subspace. 

4. Attractor, Domain of Attraction 

Let (o! )ier be a dynamical system on (M, p) and A an invariant set for (v). Then W(A) — (x € 
M: w(x) C A} is called the domain of attraction of A. A compact set A C M is called an attractor of 
(oer on M if A is invariant under (o) and there is an open neighborhood U of A such that w(x) = A 
for almost every (in the sense of Lebesgue measure) x € U. 

HB A= ((1,0)) is an attractor of the flow of (17.9a). Here W(A) = R? \ {(0, 0)}. For some dynamical 
systems, a more general notion of an attractor makes sense. So, there are invariant sets A which have 
periodic orbits in every neighborhood of A which are not attracted by A, e.g., the Feigenbaum attractor. 
The set A may not be generated by a single limit set w. A compact set A is called an attractor in the 
sense of Milnor of the dynamical system {y*}:er on M if A is invariant under {pt} and the domain of 
attraction of A contains a set with positive Lebesgue measure. 


17.1.2 Qualitative Theory of Ordinary Differential Equations 
17.1.2.1 Existence of Flows, Phase Space Structure 
1. Extensibility of Solutions 


Besides the differential equation (17.1), which is called autonomous, there are differential equations 
whose right-hand side depends explicitly on the time and they are called non-autonomous: 

i -— f(t,x). (17.11) 
Let f: Rx M — M bea C"-mapping with M C IR". By the new variable z,4,1 :— t, (17.11) can 
be interpreted as the autonomous differential equation @ = f(@n41,2), $441 — 1. The solution of 
(17.11) starting from 2 at time to is denoted by ¢(-, to, zo). In order to show the global existence of 
the solutions and with this the existence of the flow of (17.1), the following theorems are useful. 
1. Criterion of Wintner and Conti If M — R" in (17.1) and there exists a continuous function 

+œ 1 
w: [0, +00) > [1, +00), such that || f(£)|| < w (llæ||) for all x € R” and f τ] dr — --co holds, then 
ο w(r 

every solution of (17.1) can be extended onto the whole of R+. 
W For example, the following functions satisfy the criterion of Wintner and Conti: w(r) = Cr + 1 or 
w(r) = Cr|In r| + 1, where C > 0 is a constant. 
2. Extension Principle Ifa solution of (17.1) stays bounded as time increases, then it can be ex- 
tended to the whole of R,. 
Assumption: In the following discussion the existence of the flow {y'}er of (17.1) is always assumed. 
2. Phase Portrait 
a) If y(t) is asolution of (17.1), then the function y(¢+c) with an arbitrary constant c is also a solution. 
b) Two arbitrary orbits of (17.1) have no common point or they coincide. Hence, the phase space of 
(17.1) is decomposed into disjoint orbits. The decomposition of the phase space into disjoint orbits is 
called a phase portrait. 
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c) Every orbit, different from a steady state, is a regular smooth curve, which can be closed or not 
closed. 

3. Liouville's Theorem 

Let {yt her be the flow of (17.1), D C M C R" be an arbitrary bounded and measurable set, D, : 


= y'(D) and V, :— vol(D;) be the n-dimensional volume of D; (Fig. 17.2a). Then the relation 
1 
a Y= / div f (x) da holds for arbitrary t € R. For n = 3, Liouville’s theorem states: 
Di 
1 
AU = 1 div f (£1, £2, £3) dxı dz da. (17:12) 
Jg. 


T 





) D 


Figure 17.2 


Corollary: If divf(x) < 0 in M holds for (17.1), then the flow of (17.1) is volume contracting. If 
div f(x) = 0 in M holds, then the flow of (17.1) is volume preserving. 


E A: For the Lorenz system (17.2), div f(x,y,z) = —(o +1 +b). Since o > 0and b > 0, divf (x,y,z) < 


d 
0 holds. With Liouville’s theorem, m Y= i --(σ-ε 1-0) dai dog dx — —(e 4- 1-4- b) V; obviously 
Jk 


holds for any arbitrary bounded and measurable set D C RË. The solution of the linear differential 
equation V; = — (ø + 1 + b) V; is V, — Vg - e-(^***"*. so that V; — 0 follows for t — --oo. 





OH 
E B: Let U C R^x R" bean open subset and H: U — RR a C?-function. Then, i; — δ (x,y), ji = 
Yi 


OH 
aq, y) (@=1,2,...,n) iscalled a Hamiltonian differential equation. The function H is called the 


Hamiltonian of the system. If f denotes the right-hand side of this differential equation, then obviously 
"| PH OH 
div f(x,y) = 2,1 z,y)| z 0. Hence, the Hamiltonian differential equations are 
f(x,y) L rop D T Dyn E q 
volume preserving. 


17.1.2.2 Linear Differential Equations 


1. General Statements 
Let A(t) = [a;;(t)]7;-, be a matrix function on R , where every component a;j: R — R is a continuous 
function, and let b: R —> R” be a continuous vector function on R. Then 





t = A(t)x + b(t) (17.13a) 
is called an inhomogeneous linear first-order differential equation in R”, and 
t = A(t)r (17.13b) 


is the corresponding homogeneous linear first-order differential equation. 

1. Fundamental Theorem for Homogeneous Linear Differential Equations Every solution 
of (17.13a) exists on the whole of R. The set of all solutions of (17.13b) forms an n-dimensional vector 
subspace Ly of the C'-smooth vector functions over R.. 
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2. Fundamental Theorem for Inhomogeneous Linear Differential Equations The set of all 
solutions Lr of (17.13a) is an n-dimensional affine vector subspace of the C1-smooth vector functions 
over R of the form Ly = yo + Ly, where ¢ is an arbitrary solution of (17.13a). 

Let q1,..., qn be arbitrary solutions of (17.13b) and & = [y1,..., Gn] the corresponding solution ma- 
triv. Then © satisfies the matrix differential equation Z(t) — A(t)Z(t) on R, where Z € R"^. If 
the solutions ¥),..., pn form a basis of Ly, then ® = [p1,..., Pn is called the fundamental matrix 
of (17.13b). W(t) = det &(t) is the Wronskian determinant with respect to the solution matrix & of 
(17.13b). The formula of Liouville states that: 


W(t) = rank A(t) W(t) (t € R). (17.13c) 


For a solution matrix, either W(t) Z2 0 on R or W(t) Z 0forallt € R. The system qi,..., qs isa 
basis of Ly, if and only if det[yi(t),...,n(t)| 4 0 for at (and so for all t). 


3. Theorem (Variation of Constants Formula) Let 9 be an arbitrary fundamental matrix of 
(17.13b). Then the solution y of (17.13a) with initial point p at time t — 7 can be represented in the 
form 


plt) = 9(t)b(r) p + / d(t)ó(s) !b(s)ds (tc R). (17.13d) 


2. Autonomous Linear Differential Equations 
Consider the differential equation 
Ξ Απ, (17.14) 


where A is a constant matrix of type (n, n). The operator norm (see also 12.5.1.1, p. 677) of a matrix 
A is given by ||A|| = max(||Az||, z € R”, ||x|| € 1), where for the vectors of R” the Euclidean norm 
is again considered. 

Let A and B be two arbitrary matrices of type (n, n). Then 

a) |A + BII < IAI + IBI; — b)IAAI 9 ALAI (A € R), 


ο Axl] < | Allllz]| y € R^), | d) [ABI S ΑΙ, 


ϱ) | ΑΙ -- νλιως, Where \max is the greatest eigenvalue of ATA . 
The fundamental matrix with initial value E, at time t = 0 of (17.14) is the matria-exponential function 


(see 5.3.6.4,4., p. 356) 





At AP? œ Aiți 
ο qp dex il (17.15) 
$ ? 20 ? 


with the following properties: 

a) The series of e“ is uniformly convergent with respect to t on an arbitrary compact time interval 
and absolutely convergent for every fixed t; 

b) le^] x eat. (r2 0); 


1 
c) πίε") = (ey = Ae =e™A (teR); 
d) etA = e^ esA (ste R) 
e) e^t is regular for all t and (e^*)-1 = e74; 


f) if A and B are commutative matrices of type (n, n), i.e., AB = BA holds, then B eĉ = eĉ B and 
gA*B — eA cB; 


g) if A and B are matrices of type (n, n) and B is regular, then eP4P ^ — B e^ p^, 
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3. Linear Differential Equations with Periodic Coefficients 

Considered is the homogeneous linear differential equation (17.13b), where A(t) = [aj;(t)]?j-1 is a T- 
periodic matrix function, i.e., a(t) — aij (t-- T) (Vt € R, à,j — L,...,n). In this case (17.13b) is 
called a linear T-periodic differential equation. 'Then every fundamental matrix @ of (17.13b) can be 
written in the form (t) — G(t)e'*, where G(t) is a smooth, regular T-periodic matrix function and R 
is a constant matrix of type (n, n) (Floquet's theorem). 

Let P(t) be the fundamental matrix of the T-periodic differential equation (17.13b), normed at t = 0, 
i.e., (0) = Ep, and let (t) — G(t)e!* be a representation of it according to Floquet’s theorem. The 
matrix (T) = eT is called the monodromy matrix of (17.13b); the eigenvalues p; of P(T) are the 
multipliers of (17.13b). A number p € C is a multiplier of (17.13b) if and only if there exists a solution 
yp #0 of (17.13b) such that y(t + 7) = py(t) (t € R) holds. 


17.1.2.3 Stability Theory 


1. Lyapunov Stability and Orbital Stability 
Consider the non-autonomous differential equation (17.11). The solution y(t, to, 7) of (17.11) is said 
to be stable in the sense of Lyapunov if: 


Vtr >to Ve >0 Jð = d(e, ty) Yzı E M : . m » 
[21 -- φίέι, ἴο, xo)]l « d lol, t1, £1) = P(t, to, o)]l « € (17.16a) 


ΡΠ 


The solution y(t, to, xo) is called asymptotically stable in the sense of Lyapunov, if it is stable and: 


Vt, > to JA = A(t) Va, EM : . . 
[μαι -- (tr, to, 20) || < ay e(t, tz, 22) — (to, 20 2 0 — (47 185) 
for t => +00. 

For the autonomous differential equation (17.1), there are other important notions of stability besides 
the Lyapunov stability. The solution Y(t, xo) of (17.1) is called orbitally stable (asymptotically orbitally 
stable), if the orbit y(xo) = {y(t, xo), t € R} is stable (asymptotically stable) as an invariant set. A 
solution of (17.1) which represents an equilibrium point is Lyapunov stable exactly if it is orbitally 
stable. The two types of stability can be different for periodic solutions of (17.1). 
E Let a flow be given in RË, whose invariant set is the torus T?. Locally, let the flow be described in 
a rectangular coordinate system by @; = 0, @2 = fo(@1), where fo: R —> R is a 2r periodic smooth 
function, for which: 


VO;€ R. dU, eighborhood of O Vó1,0 € U, 
i ME C LUE 

An arbitrary solution satisfying the initial conditions (;(0), O5(0)) can be given on the torus by 

O,(t) 2 €1(0), O»(t) — Θι(0) 4- fa(O1(0)t (t € R). 
From this representation it can be seen that every solution is orbitally stable but not Lyapunov stable 
(Fig. 17.2b). 
2. Asymptotical Stability Theorem of Lyapunov 
A scalar-valued function V is called positive definite in a neighborhood U of a point p € M C R^, if: 
1. V: U C M > R is continuous. 
2. V(x) > 0 forall z € U \ {p} and V(p)=0. 
Let U C M be an open subset and V : U — R a continuous function. The function V is called a 
Lyapunov function of (17.1) in U, if V(o(t)) does not increase while for the solution y(t) € U holds. 
Let V: U — R be a Lyapunov function of (17.1) and let V be positive definite in a neighborhood U 
of p. Then p is stable. If the condition V(w(t, xo)) = constant (t > 0) always yields y(t, x9) = p for 
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a solution o of (17.1) with (t,x) € U (t 7 0), ie., if the Lyapunov function is constant along a 
complete trajectory, then this trajectory can only be an equilibrium point, and the equilibrium point 
pis also asymptotically stable. 

Bl The point (0,0) is a steady point of the planar differential equation & = y, y = —x — x?y. The 
function V (x,y) = x? + y? is positive definite in every neighborhood of (0,0) and for its derivative 


Sv). y(t)) = —2x(t)?y(t)? < 0 holds along an arbitrary solution for x(t)y(t) z 0. Hence, (0,0) is 
asymptotically stable. 

3. Classification and Stability of Steady States 

Let xo be an equilibrium point of (17.1). In the neighborhood of xo the local behavior of the orbits of 
(17.1) can be described under certain assumptions by the variational equation y = D f(xo)y, where 
D f (vo) is the Jacobian matrix of f in x. If D f(x) does not have an eigenvalue A; with Re A; = 0, 
then the equilibrium point xo is called hyperbolic. The hyperbolic equilibrium point zo is of type (m, k) 
if D f (zo) has exactly m eigenvalues with negative real parts and k = n — 1 — m eigenvalues with 
positive real parts. The hyperbolic equilibrium point of type (m, k) is called a sink, if m = n, a source, 
if k = n, and a saddle point, if m z 0 and k Æ 0 (Fig. 17.3). A sink is asymptotically stable; sources 
and saddles are unstable (theorem on stability in the first approvimation). Within the three topological 
basic types of hyperbolic equilibrium points (sink, source, saddle point) further algebraic distinctions 
can be made. A sink (source) is called a stable node (unstable node) if every eigenvalue of the Jacobian 
matrix is real, and a stable focus (unstable focus) if there are eigenvalues with non-vanishing imaginary 
parts. For n — 3 a classification of saddle points is obtained as saddle nodes and saddle foci. 
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e e e 

Eigenvalues of the ο e ο 

Jacobian matrix ô . e 
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Figure 17.3 
4. Stability of Periodic Orbits 
Let y(t, £o) be a T-periodic solution of (17.1) and 4(xo) — (w(t,co), t € [0 T]) its orbit. Under 
certain assumptions, the phase portrait in a neighborhood of (a) can be described by the variational 
equation ý = D f(p(t,xo)) y. Since A(t) = D f(q(t, xo)) is a T-periodic continuous matrix function 
of type (n, n), it follows from the Floquet theorem (see p. 863) that the fundamental matrix Pz, (t) of 
the variational equation can be written in the form ®,,(t) = G(t)e*, where G is a T-periodic regular 
smooth matrix function with G(0) — E,, and R represents a constant matrix of type (n, n) which is not 
uniquely given. The matrix $,, (T) — e^ is called the monodromy matrix of the periodic orbit y(xo), 
and the eigenvalues pi, ... , pn of e*” are called multipliers of the periodic orbit y(ao). If the orbit (a0) 
is represented by another solution y(t, x1), i.e., if y(wo) = y(71), then the multipliers of y(a9) and (x1) 
coincide. One of the multipliers of a periodic orbit is always equal to one (Andronov-Witt theorem). 
Let p1,..., pni; fn — 1 be the multipliers of the periodic orbit (£o) and let S}, (T) be the monodromy 
matrix of (xo). Then 


Y pi = Tr a (T) and Il pj = det Sa, (T) = exp μ᾽ TrD fet) 
j=l 


ja 
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= exp μ΄ divf etas) at : (17.17) 


Hence, if n = 2, then p2 = 1 and pı = exp (E divf (y(t, zo)) dt). 


E Let c(t, (1,0)) — (cost,sint) be a 27-periodic solution of (17.92). The matrix A(t) of the variational 
equation with respect to this solution is 


—2cos?t —1-— sin2t 
A( = Df(et. 0,0) - (1. ast) 


The fundamental matrix $(,9)(t) normed at t — 0 is given by 
banD = e" cost —sint| — (cost —sintV (e? 0 
aol) = | etsint cost) = \ sint cost 0 1)? 
where the last product is a Floquet representation of $(9)(t). Thus, p1 — €^ and p» — 1. The multi- 
pliers can be determined without the Floquet representation. For system (17.92) div f (z, y) ^ 2—4z?— 
4y? holds, and hence divf (cost, sint) = —2. According to the formula above, p; = exp ( Jo" —2dt) = 
exp(—47). 
5. Classification of Periodic Orbits 
Ifthe periodic orbit ^ of (17.1) has no further multiplier on the complex unit circle besides p, — 1, then 
q is called hyperbolic. The hyperbolic periodic orbit is of type (m, k) if there are m multipliers inside 
and k = n — 1 — m multipliers outside the unit circle. If m > 0 and k > 0, then the periodic orbit of 
type (m, k) is called a saddle point. 
According to the Andronov-Witt theorem, a hyperbolic periodic orbit y of (17.1) of type (n — 1, 0) is 
asymptotically stable. Hyperbolic periodic orbits of type (m, k) with k > 0 are unstable. 
E A: A periodic orbit  — (v(t),t € [0, T]) in the plane with multipliers pı and p2 = 1 is asymptoti- 
cally stable if |p;| < 1, i.e., if JZ div f(p(t)) dt < 0. 
W B: If there is a further multiplier besides p, — 1 on the complex unit circle, then the Andronov— 
Witt theorem cannot be applied. The information about the multipliers is not sufficient for the stability 
analysis of the periodic orbit. 
Bl C: As an example, let the planar system  — —y + x f(x? +y?), y=at+yf(a? +y?) be given 
by the smooth function f: (0, 4-oo) — RR, which additionally satisfies the properties f(1) = f’(1) = 0 
and f(r)(r — 1) « 0 forall r Z 1,r > 0. Obviously, y(t) = (cost, sint) is a 27-periodic solution of the 
system and 
cost — sint 10). ; f 
P10) (t) = ( od ae J F a is the Floquet representation of the fundamental matrix. It follows 
that p; — p» — 1. The use of polar coordinates results in the system * = r f(r?), ) — 1. This repre- 
sentation yields that the periodic orbit y((1,0)) is asymptotically stable. 
6. Properties of Limit Sets, Limit Cycles 
The a- and w-limit sets defined in 17.1.1.2, p. 859, have with respect to the flow of the differential 
equation (17.1) with M C R" the following properties. Let x € M be an arbitrary point. Then: 
a) The sets a(x) and w(x) are closed. 
b) If y+ (x) (respectively y~ (x)) is bounded, then w(x) 4 @ (respectively a(x) # Ø) holds. Furthermore, 
w(x) (respectively a(x)) are in this case invariant under the flow (17.1) and connected. 
W [If for instance, y+ (x) is not bounded, then w(x) is not necessarily connected (Fig. 17.4a). 
For a planar autonomous differential equation (17.1), (i.e., M C R?) the Poincaré-Bendixson theorem 
is valid. 
Poincaré-Bendixson Theorem: Let y(-, p) be a non-periodic solution of (17.1), for which y*(p) is 
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Figure 17.4 

















bounded. If w(p) contains no equilibrium point of (17.1), then w(p) is a periodic orbit of (17.1). 
Hence, for autonomous differential equations in the plane, attractors more complicated than an equi- 
librium point or a periodic orbit are not possible. 

A periodic orbit y of (17.1) is called a limit cycle, if there exists an x ¢ y such that either y C w(x) or 
y C a(x) holds. A limit cycle is called a stable limit cycle if there exists a neighborhood U of y such 
that y = w(x) holds for all x € U, and an unstable limit cycle if there exists a neighborhood U of y such 
that y = a(x) holds for all x € U. 

W A: For the flow of (17.9a), the property y = w(p) for all p # (0,0) is valid for the periodic orbit 
7 ={(cost,sint), t € [0,27)}. Hence, U = R*\{(0,0)} is a neighborhood of y such that with it, y is 
a stable limit cycle (Fig. 17.4b). 

E B: In contrast, for the linear differential equation « = —y, y = x, the orbit y = {(cost, sint), t € 
[0, 2π]} is a periodic orbit, but not a limit cycle (Fig. 17.4c). 

7. m-Dimensional Embedded Tori as Invariant Sets 

A differential equation (17.1) can have an m-dimensional torus as an invariant set. An m-dimensional 
torus T™ embedded into the phase space M C R” is defined by a differentiable mapping g: R™ > R”, 
which is supposed to be 27-periodic in every coordinate O; as a function (01,...,Om) > g(@1,..., Om). 
W In simple cases, the motion of the system (17.1) on the torus can be described in a rightangular co- 
ordinate system by the differential equations Θ, —wi (i —1,2,..., m). The solution of this system 
with initial values (91(0),...,O@m(0)) at time t = 0 is O,(t) = wit +O,(0) (i = 1,2,...,m;t € R). 
A continuous function f: R —> R” is called quasiperiodic if f has a representation of the form f(t) = 
glwit, wat, ..., Wnt), where g is also a differentiable function as above, which is 27-periodic in every 
component, and the frequencies w; are incommensurable, i.e., there are no such integers n; with 


Σ n? > 0 for which nyw, +--+ + nat, — 0 holds. 
i=l 


17.1.2.4 Invariant Manifolds 
1. Definition, Separatrix Surfaces 
Let 7 be a hyperbolic equilibrium point or a hyperbolic periodic orbit of (17.1). The stable manifold 
W*(7) (respectively unstable manifold W“(y)) of y is the set of all points of the phase space such that 
the orbits tending to y ast — +00 (respectively t — —oo) pass through these points: 

W*(7) = {x € M: w(x) = y) and W"(y) 2 (x € M: a(x) = 4}. (17.18) 
Stable and unstable manifolds are also called separatrix surfaces. 
W In the plane, the differential equation 


t=-a, y=yt2? (17.19a) 
is considered. The solution of (17.19a) with initial state (ao, yo) at time t = 0 is explicitly given by 
3 
p(t, £o, Yo) = (exo, eyo + τ (=e). (17.19b) 


For the stable and unstable manifolds of the equilibrium point (0,0) of (17.19a) one gets: 


12 
W*((0, 0)) = {(Xo, Yo): m. p(t, Xo, yo) = (0, 0)} = { (20, Yo): Yo + F ux 0j, 
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μ“((0, 0)) -- {(το, yo): lm. p(t, xo, yo) = (0,0)} — ((xo, yo): xo — 0, yo € R} (Fig. 17.5a). 


YL-W'((0,0)) 


x 
7 (N W'((0,0)) M 


b) 


Figure 17.5 


Let M and N betwo smooth surfaces in IR", and let L, M and L,N bethe corresponding tangent planes 
to M and N through x. The surfaces M and N are transversal to each other if for all x € M A N the 
following relation holds: 

dim L, M + dim LN — n = dim (LM N LN). 
W For the section represented in Fig. 17.5b one can see that dim L,M = 2, dim L,N — 1 and 
dim(L,M n L,N) — 0 holds. Hence, the section represented in Fig. 17.5b is transversal. 
2. Theorem of Hadamard and Perron 
Important properties of separatrix surfaces are given by the Theorem of Hadamard and Perron: 
Let y be a hyperbolic equilibrium point or a hyperbolic periodic orbit of (17.1). 
a) The manifolds W*(7) and W“(7) are generalized C’-surfaces, which locally look like C’-smooth ele- 
mentary surfaces. Every orbit of (17.1), which does not tend to y for t + +00 or t + —oo, respectively, 
leaves a sufficiently small neighborhood of y for t —> +00 or t + —oo, respectively. 


b) If y = xo is an equilibrium point of type (m, k), then W*(29) and W" (xo) are surfaces of dimension 
rn and k, respectively. The surfaces W*(xg) and W" (zo) are tangent at xo to the stable vector subspace 


= {y € R”: cP fy 0 for t£ — +00} of equation j — D f(vo)y (17.20a) 


and d unstable vector subspace 


— (y € R^: ePfG9', 0 for t — —oo) of equation ý = D f(vo)y, respectively. (17.20b) 
c) If y is a hyperbolic periodic orbit of type (m,k), then W*(7) and W“(7) are surfaces of dimension 
m+ land k +1, respectively, and they intersect each other transversally along 7 (Fig. 17.6a). 
W A: To determine a local stable manifold through the steady state (0,0) of the differential equation 
(17.19a) here it is supposed that W$,((0,0)) has the following form: 
W5-((0,0)) = {(z,y): y = A(z), |a | « A, h: (2A, A) > R differentiable}. 
Let (x(t), y(t)) be a solution of (17.19a) lying in Wg,((0,0)). Based on the invariance, for times s near 
to t y(s) = h(x(s)) holds. By differentiation and representation of i and 7 from the system (17.19a) 
one gets the initial value problem h'(x) (~x) = h(x) + z?, h(0) — 0 for 2 unknown function h(x). If 


looking for the solution in the form of a series expansion h(a) = a + a? 34... where h’(0) = 0is 


; : I . 2 
taken under consideration, then one gets by comparing the coefficients az = -3 and ap = 0 for k > 3. 


W B: For the system 

Y y-z(1—22—3?) ᾗ -- ᾱ ἠ-γ(1-- αἳ -- υλ), 2 -- α2 (17.21) 
with a parameter a > 0, the orbit y = { (cost, sint, 0), t € [0, 27]} is a periodic orbit with multipliers 
py — € 75, py — e?" and py — 1. 
In cylindrical coordinates r — r cos, y = rsin V, z = z, with initial values (ro, 0o, z9) at time t — 0, 
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the solution of (17.21) has the representation (r(t, ro), V(t, Vo), e°%*zo), where r(t, ro) and V(t, vo) is the 
solution of (17.9a) in polar coordinates. Consequently, 
W°(y) = {(x,y,z): z = 0} \ {(0,0,0)} and W"(y) 2 ((z,y,z): 3? --y? — 1) (cylinder). 


Both separatrix surfaces are shown in Fig. 17.6b. 





Figure 17.6 


3. Local Phase Portraits Near Steady States for n = 3 

Considered is the differential equation (17.1) with the hyperbolic equilibrium point 0 for n = 3. Set 
A = D f(0) and let det[AE — A] = 4° + pA? + qÀ + r be the characteristic polynomial of A. With the 
notation ô = pq — r and A = —p?q? 4- Ap?r + 4q? — 18pqr + 27r? (discriminant of the characteristic 
polynomial), the different equilibrium point types are characterized in Table 17.1. 

4. Homoclinic and Heteroclinic Orbits 

Suppose 7 and %2 are two hyperbolic equilibrium points or periodic orbits of (17.1). If the separatrix 
surfaces W*(^1) and W"(45) intersect each other, then the intersection consists of complete orbits. 
For two equilibrium points or periodic orbits, the orbit y C W8(y1) A W"(42) is called heteroclinic if 
yı Æ %2 (Fig. 17.7a), and homoclinicif yı = y2. Homoclinic orbits of equilibrium points are also called 
separatrix loops (Fig. 17.7b). 





Figure 17.7 


E Consider the Lorenz system (17.2) with fixed parameters c — 10, b — 8/3 and with variable r. The 
equilibrium point (0, 0, 0) of (17.2) is a saddle for 1 < r < 13.926 . . . , which is characterized by a two- 
dimensional stable manifold W° and a one-dimensional unstable manifold W”. If r = 13.926..., then 
there are two separatrix loops at (0,0,0), i.e., as t — --oo branches of the unstable manifold return 
(over the stable manifold) to the origin (see [17.9]). 


17.1.2.5 Poincaré Mapping 


1. Poincaré Mapping for Autonomous Differential Equations 

Let 4 — (e(t, xo), t € [0 T]) be a T-periodic orbit of (17.1) and © a (n — 1)-dimensional smooth 
hypersurface, which intersects the orbit ^ transversally in zo (Fig. 17.8a). Then, there is a neighbor- 
hood U of zo and a smooth function 7: U — RR such that (zo) = T and (T(x), x) € X for all x € U. 
The mapping P: UN £ > Ð with P(x) = (T(x), x) is called the Poincaré mapping of y at xo. If 
the right-hand side f of (17.1) is r times continuously differentiable, then P is also r times continu- 
ously differentiable . The eigenvalues of the Jacobi matrix DP(xọ) are the multipliers p1, .. . , ps1 of 
the periodic orbit. They do not depend on the choice of xo on y and on the choice of the transversal 
surface. 
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Table 17.1 Steady state types in three-dimensional phase spaces 























































































































Parameter A Type of equili- | Roots of the charac- | Dimension of 
domain brium point teristic polynomial | W* and W% 
stable ImA; = 0 rer ee 
TES Δ«0 meds à. 0, 51,2, dim W* — 3, dinW" — 0 
450 stable Reài2 < 0 
A focus A3 < 0 
WY 
A <0 A>0: 
Parameter A Type of equili- | Roots of the charac- | Dimension of 
domain brium point teristic polynomial | W* and W% 
unstable ImA; = 0 ; rs 1 Duo 
S20 224, A<0 tode A > 0, j=1,2,3 dim W° = 0, dim W” = 3 
q>0 A>0 unstable Red 2 > 0 
focus A3 > 0 
WY 
A <0: A>0: 
Parameter AX Type of equili- | Roots ofthe charac- | Dimension of 
domain brium point teristic polynomial | W* and W% 
T saddle ImA; = 0 rs ο dimn ru 
: aad <0, | A«0 anda Mia <0, tg > 0 dim W° = 2, dim W” = 1 
r<0,q>0 saddle Rerya <0 
B focus A3 > 0 
I 
UT LAU Ut 
Parameter ^ Type of equili- | Roots of the charac- | Dimension of 
domain brium point teristic polynomial | W* and W% 
ς n saddle ImA; —-0 : ΕΝ : ο: 
- = >0, | A<0 ode disc, Aged dim W? — 1, dinW" —2 
r> 0,q>0 A>0 saddle RerA 2 > 0 
focus A3 < 0 
A « 0: A^ » 0: 


E 
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A system (17.3) in M = U can be connected with the Poincaré mapping, which makes sense until the 
iterates stay in U. The periodic orbits of (17.1) correspond to the equilibrium points of this discrete 
system, and the stability of these equilibrium points corresponds to the stability of the periodic orbits 
of (17.1). 





Ey Ξ 
9093) E A - 


eriodic 
a) b) steady state Pit 


Figure 17.8 


H Considered is for the system (17.9a) the transversal hyperplanes 


So = {(r, 0): 7 > 0,0 = Vo} 
in polar coordinate form. For these planes U = Σ) can be chosen. Obviously, r(r) — 2x (Vr > 0) and 
so 

P(r) = [1+ (r -—De* 7, 
where the solution representation of (17.9a) is used. It is also valid that P(X) = »5, P(1) — 1 and 
P'(M)) etr «1. 
2. Poincaré Mapping for Non- Autonomous Time-Periodic Differential Equations 
A non-autonomous differential equation (17.11), whose right-hand side f has period T' with respect 
to t, i.e., for which f(t + T, x) — f(t,z) (Vt € R, Vx € M) holds, is interpreted as an autonomous 
differential equation + = f(s, x), $ — 1 with cylindrical phase space M x (s mod T). Let so € (s mod 
T) be arbitrary. Then, © = M x {so} is a transversal plane (Fig. 17.8b). The Poincaré mapping is 
given globally as P: £ > È over zo —9 q(so + T, so, £o), where y(t, 80, o) is the solution of (17.11) 
with the initial state zo at time sp. 


17.1.2.6 Topological Equivalence of Differential Equations 


1. Definition 
Suppose, besides (17.1) with the corresponding flow (ven, that a further autonomous differential 
equation 

& = g(x), (17.22) 
is given, where g: N + R” is a C’-mapping on the open set N C R”. Of course, the flow {Yt her of 
(17.22) should exist. 
The differential equations (17.1) and (17.22) (or their flows) are called topologically equivalent if there 
exists a homeomorphism h: M — N (i.e., h is bijective, h and A^ are continuous), which transforms 
each orbit of (17.1) to an orbit of (17.22) preserving the orientation, but not necessarily preserving 
the parametrization. The systems (17.1) and (17.22) are topologically equivalent if there also exists a 
continuous mapping T: R x M — R , besides the homeomorphism h: M —> N, such that 7 is strictly 
monotonically increasing at every fixed x € M, maps R onto R, with 7(0, x) = 0 for all z € M and 
satisfies the relation h(w'(z)) — v^? (h(x)) for all z € M and t € R. 
In the case of topological equivalence, the equilibrium points of (17.1) go over into steady states of 
(17.22) and periodic orbits of (17.1) go over into periodic orbits of (17.22), where the periods are not 
necessarily coincident. Hence, if two systems (17.1) and (17.22) are topologically equivalent, then the 
topological structure of the decomposition of the phase spaces into orbits is the same. If two systems 
(17.1) and (17.22) are topologically equivalent with the homeomorphism h: M — N and if h preserves 
the parametrization, i.e., h(q*(x)) — v'(h(x)) holds for every t, x, then (17.1) and (17.22) are called 
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topologically conjugate. 

Topological equivalence or conjugation can also refer to subsets of the phase spaces M and N. Sup- 
pose, e.g., (17.1) is defined on U; C M and (17.22) on Uz C N. Then (17.1) on Uj is called topologically 
equivalent to (17.22) on Us if there exists a homeomorphism h: U; — U2 which transforms the inter- 
section of the orbits of (17.1) with U; into the intersection of the orbits of (17.22) with Uz preserving 
the orientation. 

W A: Homeomorphisms for (17.1) and (17.22) are mappings where, e.g., stretching and shrinking of 
the orbits are allowed; cutting and closing are not. 

The flows corresponding to phase portraits of Fig. 17.9a and Fig. 17.9b are topologically equivalent; 
the flows shown in Fig. 17.9a and Fig. 17.9c are not. 


Figure 17.9 
W B: Consider the two linear planar differential equations (see. [17.11]) 
i — Ar and i — Br with A = ( 2. Ej and B — e m ) The phase portraits of these systems 
close to (0,0) are shown in Fig. 17.10a and Fig. 17.10b. 


1 = 
The homeomorphism h: R? — R? with h(x) = Rx, where R = —= G i) and the function 


V2 1 1 
1 
T: Rx R? > R with r(t, £) = z t transform the orbits of the first system into the orbits of the second 


one. Hence, the two systems are topologically equivalent. 


M. 


AN S MC 


Figure 17.10 
2. Theorem of Grobman and Hartman [ 


Let p be a hyperbolic equilibrium point of (17.1). Then, in a neighborhood of p the differential equation 
(17.1) is topologically equivalent to its linearization y = Df (p)y. 





17.1.3 Discrete Dynamical Systems 
17.1.3.1 Steady States, Periodic Orbits and Limit Sets 


1. Types of Steady State Points 

Let xo be an equilibrium point of (17.3) with M C R”. The local behavior of the iteration (17.3) 
close to ao is given, under certain assumptions, by the variational equation yi41 = Dp(xo)y:, t € D. 1f 
D(a) has no eigenvalue \; with |A;| = 1, then the steady state point £o, analogously to the differential 
equation case, is called hyperbolic. The hyperbolic equilibrium point xo is of type (m, k) if D f (xo) has 
exactly m eigenvalues inside and k = n— m eigenvalues outside the complex unit circle. The hyperbolic 
equilibrium point of type (m, k) is called a sink for m = n, a source for k = n and a saddle point for 
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m > Qand k > 0. Asink is asymptotically stable; sources and saddles are unstable (theorem on stability 
in the first approximation for discrete systems). 

2. Periodic Orbits 

Let (xo) — (e (xo), k — 0,---, T — 1) bea T-periodic orbit (T > 2) of (17.3). If xq is a hyperbolic 
equilibrium point of the mapping y7, then y(xo) is called hyperbolic. 

The matrix Dy? (29) = De(y?1(a0)) --- Dip(xo) is called the monodromy matriz; the eigenvalues p; 
of Dy? (a) are the multipliers of γ(το). 

If all multipliers p; of (xo) have an absolute value less than one, then the periodic orbit ^(xo) is asymp- 
totically stable. 

3. Properties of » -Limit Set 

Every w -limit set w(x) of (17.3) with M = R” is closed, and w(y(x)) = w(x). If the semiorbit 7+ (2) is 
bounded, then w(x) # Ø and w(x) is invariant under y. Analogous properties are valid for a-limit sets. 





W Suppose the difference equation z4441 — —2z,, t — 0,EL,---, is given on IR. with e(r) — —r. Ob- 
viously, the relations w(1) = {1,—1}, w(y(1)) = w(-1) = w(1), and y(w(1)) = w(1) are satisfied 
for r — 1. It is to be mentioned that w(1) is not connected, is different from the case of differential 
equations. 


17.1.3.2 Invariant Manifolds 


1. Separatrix Surfaces 

Let xo be an equilibrium point of (17.3). Then W8(ao) = (y € M: q(y) — xo for i + +00} is called 

a stable manifold and W” (xo) = (y € M: q!(y) — xo for i — —oo) an unstable manifold of xo. Stable 

and unstable manifolds are also called separatriz surfaces. 

2. Theorem of Hadamard and Perron 

The theorem of Hadamard and Perron describes the properties of separatrix surfaces for discrete sys- 

tems in M C R*: 

If vo is a hyperbolic equilibrium point of (17.3) of type (m, k), then W*(xg) and W" (xo) are generalized 

C"-smooth surfaces of dimension m and k, respectively, which locally look like C’-smooth elementary 

surfaces. The orbits of (17.3), which do not tend to xo for i — --oo or i — —oo, leave a sufficiently 

small neighborhood of x for i + +00 or i — —oo, respectively. The surfaces W?(vg) and W" (x) 

are tangent at xo to the stable vector subspace E* — (y € R^" : [Do(xo)|| y — 0 for i —^ —oo) of 

Vii — De(xo)y; and the unstable vector subspace E" — (y € R* : [Do(xo)|!y — 0 fori 4 —oo}, 

respectively. 

W Considered is the following time discrete dynamical system from the family of Hénon mappings: 
in = x +yi— 2, Yiyi = Ti, iE Z. (17.23) 


Both hyperbolic equilibrium points of (17.23) are P, = (v2, V2) and Py = (—v2, — v2). 
Determination of the local stable and unstable manifolds of Pj: The variable transformation r; = 
& - V2, yi — yi -- V2 transforms system (17.23) into the system &41 = €? +2V2 & +, mou =& 
with the equilibrium point (0,0). The eigenvectors a, = (V2 + V3, 1) and ay = (V2 — V3, 1) of the 
Jacobian matrix Df((0,0)) correspond to the eigenvalues \y9 = V2 + V3, so E* = (tas,t € R] 
and EY = {ta,,t € R}. Supposing that Wi.((0,0)) = {(€,): 7 = B(), |€| < A,B: (-A, A) > 
R differentiable}, one looks for f in the form of a power series B(€) = (V3 — V2) € 4- k£? ----. From 
(£i, qi) € Wi, ((0,0)), (£1. mia) € Wis ((0, 0)) follows. 'This leads to an equation for the coefficients 
of the decomposition of 6, where k < 0. The theoretical shape of the stable and unstable manifolds is 
shown in Fig. 17.11a (see [17.12]). 

3. Transverse Homoclinic Points 

The separatrix surfaces W*(rg) and W" (xg) of a hyperbolic equilibrium point xg of (17.3) can intersect 
each other. If the intersection W*(xg) rn W" (ro) is transversal, then every point y € W*(xo)  W"(xg) 
is called a transversal homoclinic point. 
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Figure 17.11 


Fact: If y is a transversal homoclinic point, then the orbit (v (y)) ofthe invertible system (17.3) consists 
only of transversal homoclinic points (Fig. 17.11b). 


17.1.3.3 Topological Conjugation of Discrete Systems 


1. Definition 
Suppose, besides (17.3), a further discrete system 

Ziyi = Y(t) (17.24) 
with : N — N is given, where N C R” is an arbitrary set and 7 is continuous (M and N can be 
general metric spaces). The discrete systems (17.3) and (17.24) (or the mappings y and w) are called 
topologically conjugate if there exists a homeomorphism h: M > N such that y = h-to woh. If (17.3) 
and (17.24) are topologically conjugated, then the homeomorphism h transforms the orbits of (17.3) 
into orbits of (17.24). 
2. Theorem of Grobman and Hartman 
If y in (17.3) is a diffeomorphism p: R” > R"”, and 29 a hyperbolic equilibrium point of (17.3), then 
in a neighborhood of xo (17.3) is topologically conjugate to the linearization y41 = Dy(xo) yt. 


17.1.4 Structural Stability (Robustness) 


17.1.4.1 Structurally Stable Differential Equations 


1. Definition 

The differential equation (17.1), i.e., the vector field f: M — R", is called structurally stable or robust, 
if small perturbations of f result in topologically equivalent differential equations. The precise defini- 
tion of robustness requires the notion of distance between two vector fields defined on M. The further 
investigations are restricted to smooth vector fields on M, which have a common open connected ab- 
sorbing set U C M. Let the boundary QU of U be a smooth (n — 1)-dimensional hypersurface and 
suppose that it can be represented as QU = {x € R”: h(x) = 0}, where h: R" — R is a Cl-function 
with grad h(a) # 0 ina neighborhood of U. Let X! (U) be the metric space of all smooth vector fields 
on M with the C! metric 


P(f.9) = sup I| f(x) — g(x)| + sup | D f(x) - D g(z)]. (17.25) 


(In the first term of the right-hand side || - || means the Euclidean vector norm, in the second one the 
operator norm.) The smooth vector fields f intersecting transversally the boundary QU in the direction 
U, ie., for which grad h(x)" f(x) 4 0, (x € OU) and y!(x) € U (x € OU, t > 0) hold, form the set 
X} (U) C X!(U). The vector field f € X} (U) is called structurally stable if there is ad > 0 such that 
every other vector field g € X} (U ) with p(f, g) « ô is topologically equivalent to f. 

W Consider the planar differential equation g (-, a) 


t y+rla-r? -y), ἡ --αἼ-γ(α--α)-- υ)) (17.26) 
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with parameter a, where |a] « 1. The differential equation g belongs, e.g., to XL (U) with U — ((v, y): 
a? +y? « 2) (Fig. 17.12a). Obviously, p(g (,0), g (,0)) — la| (V2 +1). The vector field g (-, 0) is 
structurally unstable; there exist vector fields arbitrarily close to g (-,0), which are not topologically 
equivalent to g (-, 0) (Fig. 17.12b,c). This is clear, if the polar coordinate representation ? — —r? 4- 





or, à — 1 of (17.26) is considered. For a > 0 there always exists a stable limit cycle r = γα. 


y X M 
SU 





a) b) acd c) o0 
Figure 17.12 


2. Structurally Stable Systems in the Plane 

Suppose the planar differential equation (17.1) with f € Xi(U ) is structurally stable. Then: 

a) (17.1) has only a finite number of equilibrium points and periodic orbits. 

b) All w-limit sets w(x) with x € U of (17.1) consist of equilibrium points and periodic orbits only. 
Theorem of Andronov and Pontryagin: The planar differential equation (17.1) with f € XL (U) 
is structurally stable if and only if: 

a) All equilibrium points and periodic orbits in U are hyperbolic. 

b) There are no separatrices, i.e., no heteroclinic or homoclinic orbits, coming from a saddle and tending 
to a saddle point. 


17.1.4.2 Structurally Stable Time Discrete Systems 

In the case of time discrete systems (17.3), i.e., of mappings o: M — M,let U C M C R^ be 
a bounded, open, and connected set with a smooth boundary. Let Diff!(U ) be the metric space of 
all diffeomorphisms on M with the corresponding U defined C?-metric. Suppose the set Diff 1U )€ 
Diff(U) consists of the diffeomorphisms ¢, for which (U) C U is valid. The mapping y € Dif} (U) 
(and the corresponding dynamical system (17.3)) is called structurally stable if there exists ad > 0 such 
that every other mapping v € Diff l(U ) with p(y, w) < å is topologically conjugate to y. 


17.1.4.3 Generic Properties 


1. Definition 
A property of elements of a metric space (M, p) is called generic (or typical) if the set of the elements 
B of M with this property form a set of the second Baire category, i.e., it can be represented as B = 


Bm, where every set B,, is open and dense in M. 
m=1,2... 


E A: The sets R and I C R (irrational numbers) are sets of second Baire category, but Q C R is not. 

W B: Density alone as a property of “typical” is not enough: Q C R and I C R are both dense, but 

they cannot be typical at the same time. 

E C: There is no connection between the Lebesgue measure À (see 12.9.1, 2., p. 694) of a set from R 

and the Baire category of this set. These B= (| Bywith By — U (an — cM Qn + xxx) , where 
k=1,2,... n>0 k^ k2n 


Q = {ay} represents the rational numbers, is a set of second Baire category (see [17.5], [17.10]). 
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1 
On the other hand, since By 2 By441 and \( Br) < +00 also \(B) = jm A(Bk) € jm 0 
—00 


2 
>o k 1— 1/2 
holds. 
2. Generic Properties of Planar Systems, Hamiltonian Systems 
For planar differential equations the set of all structurally stable systems from X: (U) is open and dense 
in xi (U). Hence, structurally stable systems are typical for the plane. It is also typical that every orbit 
of a planar system from X1 (U) for increasing time tends to one of a finite number of equilibrium points 
and periodic orbits. Quasiperiodic orbits are not typical. Under certain assumptions, in the case of 
Hamiltonian systems, the quasiperiodic orbits of differential equation are preserved in the case of small 
perturbations. Hence, Hamiltonian systems are not typical systems. 
OH . OHo 
ae o 
Of Ojo 
where the Hamiltonian Ho(j1, j2) is analytical. Obviously, this system has the solutions jı — c1, ja = 
C9, O1 — wt ^ c3, Og = Wet + cq with constants c1,..., c4, where w and wz can depend on cı and cz. 
The relation (71, j2) — (c1, c2) defines an invariant torus 7?. Consider now the perturbed Hamiltonian 

Ho(j1, J2) + € Ai (ji, Ja, O1, O2) 

instead of Ho, where H; is analytical and £ > 0 is a small parameter. 
The Kolmogorov-Arnold-Moser theorem (AM theorem) says in this case that if Ho is non-degenerate, 





E Given in R* a Hamiltonian system in action-angle variables j; = 0, ja — 0, O1 = 





: OPH, ; ; : ς $ 
i.e., det ( 8 - # 0, then in the perturbed Hamiltonian system most of the invariant non-resonant 
Jk 
tori will not vanish for sufficiently small € > 0 but will be only slightly deformed. “Most of the tori” 
means that the Lebesgue measure of the complement set with respect to the tori tends to zero if € tends 
to 0. A torus, defined as above and characterized by w; and wy, is called non-resonant if there exists a 
1 P 6 


ω : 
> -z holds for all positive integers p and q. 
ω q^ 


constant c > 0 such that the inequality 
2 








3. Non-Wandering Points, Morse-Smale Systems 
Let (teg be a dynamical system on the n-dimensional compact orientable manifold M. The point 
p € M is called non-wandering with respect to (qf) if 

VT»0 3t στ: (Up nU, £0 (17.27) 
holds for an arbitrary neighborhood U, C M of p. 
W Steady states and periodic orbits consist only of non-wandering points. 
The set Q(y*) of all non-wandering points of the dynamical systems generated by (17.1) is closed, in- 
variant under {y*} and contains all periodic orbits and all w-limit sets of points from M. 
The dynamical system {y"}:cr on M generated by a smooth vector field is called a Morse-Smale system 
if the following conditions are fulfilled: 
1. The system has finitely many equilibrium points and periodic orbits and they are all hyperbolic. 
2. All stable and unstable manifolds of equilibrium points and periodic orbits are transversal to each 
other. 
3. The set of all non-wandering points consists only of equilibrium points and periodic orbits. 
Theorem of Palis and Smale: Morse-Smale systems are structurally stable. 
The converse statement of the theorem of Palis and Smale is not true: In the case of n > 3, there exist 
structurally stable systems with infinitely many periodic orbits. 
For n > 3, structurally stable systems are not typical. 
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17.2 Quantitative Description of Attractors 
17.2.1 Probability Measures on Attractors 


17.2.1.1 Invariant Measure 


1. Definition, Measure Concentrated on the Attractor 
Let {p'her be a dynamical system on (M, p). Let B be the c-algebra of Borel sets on M (12.9.1, 2., 
p. 694) and let ji: B — [0, 4-00] be a measure on B. Every mapping vf is supposed to be j( measurable. 
The measure yu is called invariant under {p'}ier if u(y *(A)) = u(A) holds for all A € B and t » 0. 
If the dynamical system {'}ier is invertible, then the property of the measure being invariant under 
the dynamical system can be expressed as u(qf(A)) — u(A) (A € B, t > 0). The measure pz is said 
to be concentrated on the Borel set AC M if u(M \ A) = 0. If A is also an attractor of {y'}rer and μ 
is an invariant measure under {y‘}, then it is concentrated at A, if (B) — 0 for every Borel set B with 
ANB=6. 
The support of a measure μ: B — [0, --oo], denoted by supp p, is the smallest closed subset of M on 
which the measure jz is concentrated. 
E A: The Bernoulli shift mapping is considered on M = [0, 1]: 

$541 — 2x; (modl). (17.282) 
In this case the map o: [0,1] — [0, 1] is defined as 

ux Jia, 0 x x x 1/2, 

p(z) = \ 2x — 1l, 1/2«zx «1. 


The definition yields that the Lebesgue measure is invariant under the Bernoulli shift mapping. If a 


(17.28b) 


oo 

number x € [0, 1) is written in dyadic form x — Y; a,-27" (an = Oor 1), then this representation can 
n=1 

be identified with # = .a,aga3.... The result of the operation 2r( mod 1) can be written as .a^ a5 a5 .. . 


with a; — aj44 , i.e., all digits ακ are shifted to the left by one position and the first digit is omitted. 
E B: The mapping V: (0, 1] — [0,1] with 
2y, 0 € y « 1/2, 
V(y) — E 
2(1—y), 1/2<y<l 


is called a tent mapping and the Lebesgue measure is an invariant measure. The homeomorphism 


(17.29) 


2 
h: [0, 1) — [0,1) with y = —arcsin /z transforms the mapping ¢ from (17.5) into (17.29). Hence, 
π 


in the case of a = 4, (17.5) has an invariant measure which is absolutely continuous. For the density 
pily) = 1 of (17.29) and p(x) of (17.5) at a = 4 it is valid that pi(y) = p(h-+(y)) |(h71)'(y)]. It follows 


1 
πνα(] - z) 


HC: If 29 is a stable periodic point of period T of the invertible discrete dynamical system {y"}, 
T-1 

then u = T Σ oia) i$ a probability measure for {y'}. Here, à, is the Dirac measure concentrated 
i=0 

at zo (see 12.9.1,2., p. 694). 

2. Natural Measure 

Let A be an attractor of {y* şer in M with domain of attraction W. For an arbitrary Borel set A C W 

and an arbitrary point zo € W define the number: 


«T, Au 
μ(Α; ας) --- Jim. ' (17.30) 


directly that p(x) = 
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Here, t(T, A, vo) is that part of the time T' 7 0 for which the orbit portion (q'(xo) 1.9 lies in the set A. 
If (A; o) — o for A-a.e.* zo from W, then let u(A) := (A; £o). Since almost all orbits with initial 
points zo € W tend to A for t > +00, pis a probability measure concentrated at A. 


17.2.1.2 Elements of Ergodic Theory 


1. Ergodic Dynamical Systems 
A dynamical system {y*}ier on (M, p) with invariant measure ju is called ergodic (one also says that the 
measure is ergodic) if either μ(Α) -- 0 ον μ(ΜΝ A) — 0 for every Borel set A with ^ *(A) = A(Vt > 0). 
If (o) is a discrete dynamical system (17.3), o: M — M is a homeomorphism and M is a compact 
metric space, then there always exists an invariant ergodic measure. 
Wl A: Suppose there is given the rotation mapping of the circle St 

3141— x; H9. (mod2-y), t—0,1,..., (17.31) 
with vy: [0, 27) — [0, 2), defined by p(x) = «+ @(mod 27). The Lebesgue measure is invariant under 


Φ Φ 
12 — ; 5 irrational, then (17.31) is ergodic; or τ 5 rational, then (17.31) is not ergodic. 


E B — systems with stable udine points or stable periodic orbits as attractors are 
ergodic with respect to the natural measure. 


Birkhoff Ergodic Theorem: Suppose that the dynamical system (v');er is ergodic with respect 
to the invariant probability measure μ. Then, for every integrable ον. he ae B, 1), the time 


average along the positive semiorbits (v' (ro) ] 99 , i.e. h(zo) — um. = a h. (o (xo)) dt for flows and 


n-1 


1 
h(a) = Jim. 2» h(y'(ao)) for discrete systems, coincide with the space average d h dw for pra.e. 


points zo € M. 
2. Physical or SBR Measure 


The statement of the ergodic theorem is useful only if the support of the measure ju is large. Let y: 
M — M be a continuous mapping, and u: B + R be an invariant measure. The measure p is called 
a SBR measure (according to Sinai, Bowen and Ruelle, see also [17.9]) if for any continuous function 
h: M — R the set of all points x9 € M, for which 


n-1 
lim = h(y'(20)) = nz (17.322) 
M 


holds, has a positive Lebesgue measure for this. It is sufficient that the sequence of measures 


EE 


"i δρ (17.32b) 


where ὃς is "m pom measure, weakly converges to u for almost all x € M, i.e., for every continuous 


function 15 dln > frau as n — +00. 
M 
W For some important attractors, such as the Hénon attractor, the existence of an SBR measure is 
proven. 
3. Mixing Dynamical Systems 
A dynamical system (ver on (M, p) with invariant probability measure j1 is called mizing if 
mon (An e *(B)) 2 u(A)u(B) holds for arbitrary Borel sets A4, B C M. For a mixing system, the 


measure of the set of all points which are at t — 0 in A and under οἱ [ου large t in B, depends only on 





*Here and in the following a.e. is an abbreviation for *almost everywhere". 
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the product u(A)u(B). 
A mixing system is also ergodic: Let {y‘} be a mixing system and A be a Borel set with (4) -- 
A (t > 0). Then (A)? = Jim no *(A) n A) 2 (A) holds and ju(A) is 0 or 1. 


Α βοῦν {φ!} οἳ (17.1) is mixing if and only if the relation 


im. flot 2) - gh) — Bi] du = 0 (17.33) 


holds for arbitrary quadratically integrable functions g, h € L?(M, B, u). Here, g and ^ denote the 
space average, which is replaced by the time average. 
W The modulo mapping (17.28a) is mixing. The rotation mapping (17.31) is not mixing with respect 


a A 
to the probability measure = 
π 


4. Autocorrelation Function 
Suppose the dynamical system (']er on M with invariant measure p is ergodic. Let h: M > R 
be an arbitrary continuous function, {'(2)}:50 be an arbitrary semiorbit and let the space average 


— 1 
h be replaced by the time average, i.e., by Jim T / h(y'(a)) dt in the time-continuous case and by 
—00 


im = oF h(y'(a)) in the time-discrete case. With respect to h the autocorrelation function along 
Τι 


the semior rbit {φί(α ο to a time point 7 7 0 is defined for a flow by 


Οκτ) Ξ lim πα. **7 (2) h (ip! (a) dt — n? (17.34a) 
^ T3o 
and for a discrete system by 
n-1 
Cr(7) = lim P «7 (a) h (gh (x)) — h?. (17.34b) 
n—0oo n 


The autocorrelation ο. is defined also for negative time, where C; (-) is considered as an even 
function on R or Z. 

Periodic or quasiperiodic orbits lead to periodic or quasiperiodic behavior of Ch. A quicker descent of 
Cx (7) for increasing 7 and arbitrary test function / refers to chaotic behavior. If C; (7) decreases for 
increasing 7 with an exponential speed, then it means mixed behavior. 

5. Power Spectrum 

'The Fourier transform of C; (7) is called a power spectrum (see also 15.3.1.2, 5., p. 786) and is denoted 


+00 
by P,(w). In the time-continuous case, under the assumption that / [Cs (7)|dr « oo, 
4J—00 
= fos Ομίτ)ε-'“Τ dr = 2 [ας ) cos(wr) dr. (17.35a) 
In the time-discrete case, if ο |Ci,(k)| < 4-oo holds, then 


Ἁι{ω) -- Ο0 ) 425° Cn(k) ) cos wk: (17.35b) 
k=l 


holds. If the absolute integrability or summability of C;,(-) does not hold, then, in the most impor- 
tant cases, Pj, can be considered as a distribution. The power spectrum corresponding to the periodic 
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motions of a dynamical system is characterized by equidistant impulses. For quasiperiodic motions, 
there occur impulses in the power spectrum, which are linear combinations with integer coefficients of 
the basic impulses of the quasiperiodic motion. A “wide-band spectrum with singular peaks” can be 
considered as an indicator of chaotic behavior. 

W A: Let o be a T-periodic orbit of (17.1), h be a test function such that the time average of h(p(t)) 
is zero, and suppose h(y(t)) has the Fourier representation 


2 
A(y(t)) = x apet with wo = 
k-—oo T 
Then with ô as the ô distribution, holds 
+0 Ads 
C&(r) 2 M Ιαν] οο(Κωοτ) and PB&(w)-—2m- Y; |αι] δω -- Κωο). 
Kee k-—oo 


E B: Suppose y is a quasiperiodic orbit of (17.1), h is a test function such that the time average is 
zero along , and let Mer be the representation (double Fourier series) 


coo 
h(y(t)) = Σ Y Ok ks i(krwi+kwa)t 


ky=—co kg=—0o 


Then, 


Too Too 
CalT) = Σ £ [akika |? cos(kiwi + ἔρωο)τ, 


ki——oo ko——oo 


too 
) = 2n A x [ou ka 0 (wo — Kia — Kou). 


ky=—00 kg=—00 


17.2.2 Entropies 
17.2.2.1 Topological Entropy 


Let (M, p) be a compact metric space and (er be a continuous dynamical system with discrete 
time on M. A distance function p, on M for arbitrary n € IN is defined by 


Palt, y) := max p (p(x), 9 (y). (17.36) 


Furthermore, let N Ke. Pn) be the largest number of points from M which have in the metric p, a distance 
at least € from each other. The topological entropy of the discrete dynamical system (17.3) or of the 


1 
mapping y is h(y) = lim lim: sup = In N(e, pn). The topological entropy is a measure for the complexity 


of the mapping. Let (Mi, p) be a further compact metric space and yı: Mi — Mi be a continuous 
mapping. If both mappings y and yy, are topologically conjugate, then their topological entropies 
coincide. In particular, the topological entropy does not depend on the metric. For arbitrary n € N, 
h(y") = nh(y) holds. If y is a homeomorphism, then h (y*) = |k| h (p) for allk € Z. Based on the 
last property the topological entropy h(i") := h (4t) is defined for a flow v! = p(t,-) of (17.1) on 
MCR”. 


17.2.2.2 Metric Entropy 


Let {%"}er be a dynamical system on M with attractor A and with an invariant probability mea- 
sure p concentrated on A. For an arbitrary £ > 0 consider the cubes Ωι(ε)..... (δηεγ(ε) of the form 
(Gm... mu): kie € v; € (k; - 1)e (i — 1,2,...,n)) with k; € Z, for which u(Q;) > 0. For arbitrary 
x from : a 'Q the semiorbit (f (: Σ} 55. 2o ÍS followed for i increasing t. In time- distances of T > 0 (7 = 1 
in discrete systems), the N cubes, in which the semiorbit is found is denoted by à4,...,iyw after each 
other. Let Ej, bethe set of all starting points in the neighborhood of A whose semiórbits at the 
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times t; = ir (i = 1,2,..., N) are always in Qi, ..., Qi, and let p(i,*- iv) — p(Ei, iy) be the 
probability that a (typical) starting point is in Ej, iy. 

The entropy gives the increment of the information on average by an experiment which shows that 
among a finite number of disjoint events which one has really happened. In the above situation this is 


Ην -- -- Σ Ῥ (νετ ΤΝ) In p(is,* iw), (17.37) 
(ssi) 
where the summation is over all symbol sequences (4i, - - - iy) with length N, which are realized by the 


orbits described above. 
The metric entropy or Kolmogorov-Sinai entropy h,, of the attractor A of (f) with respect to the in- 


Hn 
variant measure p is the quantity h, = lim lim ~=. For discrete systems, the limit as € — 0 is 
: £30 Noo TÍN 


omitted. For the topological entropy h(y) of p: A + A the inequality h, < h(y) holds. In several 
cases h(y) = sup{h,: ji invariant probability measure on A]. 

W A: Suppose A = {29} is a stable equilibrium point of (17.1) as an attractor, with the natural mea- 
sure u concentrated on xo. For these attractors h, = 0. 

HB: For the shift mapping (17.28a), h(y) = h, = In 2, where p is the invariant Lebesgue measure. 


17.2.3. Lyapunov Exponents 


1. Singular Values of a Matrix 

Let L be an arbitrary matrix of type (n,n). The singular values 0, > oz > +++ > oy of L are the 
non-negative roots of the eigenvalues a, > --- > a» > 0 of the positive semidefinite matrix LTL. The 
eigenvalues a; are enumerated according to their multiplicity. 

The singular values can be interpreted geometrically. If A’, is a sphere with center at 0 and with radius 
€ > 0, then the image L(K-) is an ellipsoid with semi-axis lengths oj¢ (¢ = 1,2,...,n) (Fig. 17.13a). 


@ (Xp) 
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Figure 17.13 


2. Definition of Lyapunov Exponents 

Let (ver be a smooth dynamical system on M C R”, which has an attractor A with an invariant 
ergodic probability measure p concentrated on A. Let o1(t, £) > --- > on(t,x) be the singular values 
of the Jacobian matrix D y'(x) of gy! at the point x for arbitrary t > 0 and x € A. Then there exists 


1 
a sequence of numbers Àj > --- > An, the Lyapunov exponents, such that 4 In a;(t,z) — AX for 


t + +00 p-a.e. in the sense of L1. According to the theorem of Oseledec, there exists j1-a.e. a sequence 
of subspaces of IR” 
R” = Ep D Ea Dco D EE, = {0}, (17.38) 


1 
such that for jj-a.e. x the quantity * In | D e? (z)v|| tends to an element As; € {A1,.-., An} uniformly 
with respect tov € EZ, N E? 


Sj+1' 
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3. Calculation of Lyapunov Exponents 
Suppose o;(t, x) are the semi-axis lengths of the adea got by deformation of the unit sphere with 


center at x by D t(x). The formula y;(x) = jim sup 7 Lu c(l, ) can be used to calculate the Lya- 


punov exponents, if additionally a reorthonor: on method, such as Householder, is used. The 
function y(t, x, v) = D q'(x)v is the solution of the variational equation with v at t — 0 associated to 
the semiorbit ^* (1) of the flow (o). Actually, {y"}rer is the flow of (17.1), so the variational equation 
is y = D f(y'(x)) y. The solution of this equation with initial v at time t = 0 can be represented as 
y(t, x,u) = 9, (t)v, where $, (t) is the normed fundamental matrix of the variational equation at t = 0, 
which is a solution of the matrix differential equation Z = D f(y"(«)) Z with initial Z(0) — E, accord- 
ing to the theorem about differentiability with respect to the initial state (see 17.1.1.1, 2., p. 857). 


1 
The number x(x, v) = Jim sup z In || D e (z)v|| describes the behavior of the orbit 5(z4-6v),0 <£ « 1 
—>00 b 


with initial x + ev with respect to the initial orbit y(x) in the direction v. If x(a, v) < 0, then the orbits 
move nearer to x for increasing ¢ in the direction v. If, on the contrary, x(«,v) > 0, then the orbits 
move away (Fig. 17.13b). 
Let A be the attractor of the dynamical system {y'}ier and y the invariant ergodic measure concen- 
trated on it. Then, the sum of all Lyapunov exponents pi-a.e. x € A is 

t 


5 à = jim 2 5 f divf(e (4) ds (17.39a) 
0 


i=1 


in the case of flows (17.1) and for a discrete system (17.3), it is 


n 


pel 
Sx = jim | 2 In| det D y(y'(x))|. (17.39b) 
i=l Κ56 


Hence, in dissipative systems x A; « 0 holds. Considering that one of the Lyapunov exponents is 
= 

equal to zero if the attractor is not an equilibrium point, the calculation of Lyapunov exponents can be 
simplified (see [17.9]). 

W A: Let be xo an equilibrium point of the flow of (17.1) and let a; be the eigenvalues of the Jacobian 
matrix at zg. With the measure concentrated on xo, the following holds for the Lyapunov exponents: 
Mi = Rea; (¢ = 1,2,...,n). 

Bl B: Let 7(xo) = {p'(x0), t € [0,T]} be a T-periodic orbit of (17.1) and let p; be the multipliers of 


1 
7(xo). With the measure concentrated on 7(xo) there is A; = T In |p;| for i 2 1,2...,n 


4. Metric Entropy and Lyapunov Exponents 
If {ther is a dynamical system on M C R” with attractor A and an ergodic probability measure p 
concentrated on A, then the inequality hy < 5 A; holds for the metric entropy h,, where in the sum 
A20 
the Lyapunov exponents are repeated according to their multiplicity. 
The equality 
h,— MA (Pesi's formula) (17.40) 
A0 
is not valid in general (see also 17.2.4.4, BI B, p. 888). If the measure ju is absolutely continuous with 
respect to the Lebesgue measure and o: M — M is a C?-diffeomorphism, then Pesin's formula is valid. 
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17.2.4 Dimensions 
17.2.4.1 Metric Dimensions 


1. Fractals 

Attractors or other invariant sets of dynamical systems can be geometrically more complicated than 
point, line or torus. Fractals, independently of dynamics, are sets which distinguish themselves by 
one or several characteristics such as fraying, porosity, complexity, and self-similarity. Since the usual 
notion of dimension used for smooth surfaces and curves cannot be applied to fractals, a generalized 
definition of the dimension is necessary. For more details see [17.2], [17.12]. 

E The interval Go = [0, 1] is divided into three subintervals with the same length and the middle open 
third is removed, so the set Gy = [0, +]U [2, 1] is obtained. Then from both subintervals of G; the open 


middle third ones are removed, which yields the set Gy = lo, i U B i U B z] U B 1]. Continuing 
this procedure, Gx is obtained from Gp- by removing the open middle thirds from the subintervals. 
So, a sequence of sets Go 2 G1 2 -:: 2 Ga 2 ++- is obtained, where every G,, consists of 2” intervals 


1 
f length —. 
of length 3 


The Cantor set C is defined as the set of all points belonging to all Gn, i.e., C = n Gn. The set C is 
n=1 


compact, uncountable, its Lebesgue measure is zero and it is perfect, i.e., C is closed and every point 
is an accumulation point. The Cantor set can be an example of a fractal. 


2. Hausdorff Dimension 
The motivation for this dimension comes from volume calculation based on Lebesgue measure. If sup- 
posing that a bounded set A C R? is covered by a finite number of spheres DB,, with radius r; € e, 


4 
so that U; B,, 5 A holds, then for A a “rough volume” is Ὁ 37 . Now, one defines the quantity 
i 


4. 
H(A) = inf © gms over all finite coverings of A by spheres with radius r; < £. If € tends to zero, 
i 


then one gets the Lebesgue outer measure \(A) of A. If A is measurable, the outer measure is equal to 
the volume vol(A). 

Suppose M is the Euclidean space IR" or, more generally, a separable metric space with metric p and 
let A C M bea subset of it. For arbitrary parameters d > 0 and £ > 0, the quantity 


1 


Hael A) = inf [Diang : A CU Bj, diamB; € e} (17.41a) 


is determined, where B; C M are arbitrary subsets with diameter diamB; = sup p(x, y). 
m,y€Bi 
The Hausdorff outer measure of dimension d of A is defined by 


Lal A) = lim pape( A) = sup pae(A) (17.41b) 
630 e>0 


and it can be either finite or infinite. The Hausdorff dimension dy(A) of the set A is then the (unique) 
critical value of the Hausdorff measure: 


oo, if A 0 for all d> 0, 
auia) { inf {d > ih =f}, (17.41¢) 
Remark: The quantities Ha (A) can be defined with coverings of spheres with radius r; < £ or, in the 
case of R”, of cubes with side length < e. 
Important Properties of the Hausdorff Dimension: 
(HD1) dg(0) — 0. 
(HD2) If AC R^, then 0 € dg(A) € n. 
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(HD3) From A C B, it follows that dy(A) < dx(B). 
(HD4) If A= U Aj, then dy(A) = sup dy(A)). 
i=l i 


105) Η A is finite or countable, then dy(A) = 0. 


(HD6) If e: M — M is Lipschitz continuous, i.e., there exists a constant L > 0 such that p(y(x), y(y)) 

< Lp(a, y)Vx,y € M, then dy(p(A)) < dy(A). If the inverse mapping yp! exists as well, and it is also 

Lipschitz continuous, then dy(A) = dy(y(A)). 

Wi For the set Q of all rational numbers dg(Q) — 0 because of (HD5). The dimension of the Cantor 
In2 

set C is dg(C) — .—. & 0.6309... 

set C is dg(C) TE 

3. Box-Counting Dimension or Capacity 

Let A be a compact set of the metric space (M, p) and let N:(.A) be the minimal number of sets of 

diameter X e, necessary to cover A. The quantity 


_ In N£(A 
dg(A) = lim sup NA) (17.42a) 
ε--θ In B 
is called the upper bor-counting dimension or upper capacity, the quantity 
In N.(A 
dp(A) = liminf ENAA) (17.42b) 


In * 
€ 


is called the lower box-counting dimension or lower capacity (then dg) of A. Ifdg(A) — dg(A) :— dg(A) 
holds, then dg(A) is called the box-counting dimension of A. In R” the box-counting dimension can be 
considered also for bounded sets which are not closed. 

For a bounded set A C R”, the number N-(A) in the above definition can also be defined in the following 
way: Let IR" be covered by a grid from n-dimensional cubes with side length e. Then, N;(A) can be 
the number of cubes of the grid having a non-empty intersecting A. 

Important Properties of the Box-Counting Dimension: 

(BD1) dg (A) € dp(A) always holds. 

(BD2) For m-dimensional surfaces F C R^ holds dg(F) — dg(F) = m. 

(BD3) dp(A) = dp(A) holds for the closure A of A, while often dz (A) « dg (A) is valid. 

(BD4) If A = UAn, then, in general, the formula dg(A) = supdg(A,) does not hold for the box- 


counting dimension. 

11 1 
WB Suppose A — (0, 1, PUE .}. Then dq(A) = 0 and dg(A) = 3° 
If A is the set of all rational points in [0, 1], then because of BD2 and BD3 dg(A) = 1 holds. On the 
other hand dg(A) — 0. 
4. Self-Similarity 
Several geometric figures, which are called self-similar, can be derived by the following procedure: An 
initial figure is replaced by a new one which is composed of p copies of the original, any of them scaled 
linearly by a factor q > 1. All figures that are k times scalings of the initial figure in the k-th step are 
handled as the in the first step. 


JAN a α A: Cantor set: p = 2, q = 3. 
— α B: Koch curve: p = 4, q = 3. The 


first three steps are shown in Fig. 17.14. 
Figure 17.14 
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AAA 


Figure 17.15 


ZZ 





W C: Sierpinski gasket: p = 3, q = 2. The 
first three steps are shown in Fig. 17.15. 
(The white triangles are always removed.) 





W D: Sierpinski carpet: p — 8, q — 3. The 
first three steps are shown in Fig. 17.16. 
(The white squares are always removed.) 














For the sets in the examples A-D: 























T 








LA. 


Figure 17.16 





17.2.4.2 Dimensions Defined by Invariant Measures 


1. Dimension of a Measure 
Let 1 be a probability measure in (M, p), concentrated on A. If x € A is an arbitrary point and B;(x) 
is a sphere with radius 6 and center at 2, then 


= In po( Bs (x 
d,(x) — lim sup ΜΡ (17.43a) 
5250 Ind 
denotes the upper and 
ο ρε Inpe(Bs(x)) 
d,(z) — lim inf a (17.43b) 


denotes the lower point-wise dimension of ui in x. If d, (x) = d(x) := d,,(a), then d,,(xr) is called the 
dimension of the measure ji in x. 
Young Theorem 1: If the relation d,(x) — a holds for j-a.e.*, x € A, then 
a=dy(") := xc ie UC: (17.44) 
The quantity dy (y) is called the Hausdorff dimension of the measure p. 
E Suppose M = R” and let A C R” be a compact sphere with Lebesgue measure A(A) > 0. The 
λ 
restriction of u to A is pA = Then 
A) 
μ(Βε(ῳ)) -ο 6” and dy(p) = n. 


2. Information Dimension 

Suppose, the attractor A of {p'her is covered by cubes Qi (€),-.., Qne) (€) of side length & as in 17.2.2.2, 
p. 879. Let jj be an invariant probability measure on A. 

The entropy of the covering Q1(€),..., Qne)(€) is 


n(e) 
-— > Pile) Inpi(e), with pe) = u(Qi(e)) (= 1,...,n(e)). (17.45) 


Ex --ο 
If the limit dr(u) = — lim lac 


the information dimension. 





exists, then this quantity has the property of a dimension and is called 





*Here and in the following a.e. is an abbreviation for “almost everywhere”. 
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Young Theorem 2: If the relation d,(x) — a holds for ji-a.e. x € A, then 
a = ἆη(μ) -- ἁπμ). (17.46) 


ΒΛ: Let the measure jt be concentrated at the equilibrium point x of {y"}. Since H-() = —1In 1 = 
0 always holds for € 7 0, so dy(js) = 0. 


Bl B: Suppose the measure p is concentrated on the limit cycle of (o*). For € » 0, H.(u) = — lne 
holds and so d;(j1) = 1. 

3. Correlation Dimension 

Let {y;}%, bea typical sequence of points of the attractor A C IR" of (o! Yer, p an invariant probability 





measure on A and let m € IN be arbitrary. For the vectors x; := (yi,-..,Yit+m) let the distance be 
defined as dist(x;, £j) := ax llYi+s — Yj+sl||; where || - || is the Euclidean vector norm. If O denotes 
sam 
1x à 0, z € 0, à 
the Heaviside function O(x) — 1150 then the expression 
m ; 1 : 
C™(e) = limsup 5 card((z;, vj): dist(z; v5) < €} 
N-++00 
ΙΝ 
= limsup — } 0 (e — dist(2;, x,)) (17.47a) 
Ν-λοο Ν ij=l 
is called the correlation integral. The quantity 
. In C""(e 
dy — lim - (17.47b) 
e>0 Ine 


(if it exists) is the correlation dimension. 
4. Generalized Dimension 


Let the attractor A of {y'}:er on M with invariant probability measure jz be covered by cubes with 
side length & as in 17.2.2.2, p. 879. For an arbitrary parameter q € IR, q Z 1, the sum 





(8) 
Hy(e) = In © pi(e)’ where pi(e) = u(Qi(e)) (17.48a) 
ici 


1—4 


is called the generalized entropy of q-th order with respect to the covering Qi(€),.--, Qn(e)(€)- 
The Rényi dimension of q-th order is 





(17.48b) 


if this limit exists. 


Special Cases of the Rényi Dimension: 


a) g=0: do = ἀσ(δαρρ μ). (17.49a) 
b) g=1: αι lim dq = di(p). (17.49b) 
c) q= 2: dz = dy. (17.49c) 


5. Lyapunov Dimension 
Let (of Her be a smooth dynamical system on M C R” with attractor A (or invariant set) and with the 
invariant ergodic probability measure jz concentrated on A. If Ay > Àz > -+ > An are the Lyapunov 


k ΚΕΙ 
exponents with respect to yz and if k is the greatest index for which X A; > 0 and > A; < 0 hold, then 


i=1 i=1 
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the value k 
LA 
di(u) =k + Al (17.50) 


is called the Lyapunov dimension of the measure p. 


If = A; 2 0, then dr(u) = n; if Ay < 0, then dz (1) = 0. 
ici 


Ledrappier Theorem: Let (o!) be a discrete system (17.3) on M C R" with a C?-function q and 
i, as above, an invariant ergodic probability measure concentrated on the attractor A of {t}. Then 
dy(u) < dz(u) holds. 

E A: Suppose the attractor A C R? of a smooth dynamical system {yt} is covered by N, squares with 
side length e. Let a; > 1 > o be the singular values of D y. Then for the dg-dimensional volume of 
the attractor ma, ~ Ne - 4? holds. Every square of side length € is transformed by y approximately 
into a parallelogram with side length σοε and oye. If the covering is made by rhombi with side length 


σ E : 
o£, then Noze ~ Ne holds. From the relation N.e?® ~ N,,-(€02)7 one gets directly 


ποιο aq (17.51) 
In 23 |λο| 


This heuristic calculation gives a hint of the origin of the formula for the Lyapunov dimension. 





WB B: Suppose the Hénon system (17.6) is given with a = 1.4 and b = 0.3. The system (17.6) has 
an attractor A (Hénon attractor) with a complicated structure for these parameter values. The nu- 
merically determined box-counting dimension is dg(A) ~ 1.26. It can be shown that there exists 
an SBR measure for the Hénon attractor A. For the Lyapunov exponents \; and A, the formula 
Ai + Ag = In| det Dy(x)| = nb = n0.3 ~ —1.204 holds. With the numerically determined value 
Ai & 0.42 one gets Az ~ —1.62. So, 

0.42 


di(u) 1 155 c: 126. (17.52) 


17.2.4.3 Local Hausdorff Dimension According to Douady and Oesterlé 


Let [fier be a smooth dynamical system on M C R* and A à compact invariant set. Suppose that 
an arbitrary to > 0 is fixed and let $ :— «f^. 


Theorem of Douady and Oesterlé: Let oci(x) > -:- > on(x) be the singular values of D (x) 
and let d € (0,n] be a number written as d = dọ + s with dọ € (0,1,...,n — 1} and s € [0,1]. If 
sup [e1(x)o2(x) . . cay (x)02,,4(x)] € 1 holds, then dg(A) « d. 

ΦΕΛ 


Special Version for Differential Equations: Let {y'}icr be the flow of (17.1), A be a compact 
invariant set and let o4(z) > --- > ap(a#) be the eigenvalues of the symmetrized Jacobian matrix 
1 

zP f(x)* -- D f(x)] at an arbitrary point x € A. Ifd € (0,n] is a number of the form d = do + s 
where dg € (0,...,n — 1} and s € [0,1], and sup[ai(x) +--+ + ado (£) + Saa+1(x)] < 0 holds, then 

ced 
dg(M) « d. The quantity 
d (z) = 0, if oa (x) « 0, 

DWT) =) sup{d: 0 € d € n, aile) +-+ agg (x) + (d — [d]) ayaj41() > 0} otherwise, 
where x € A is arbitrary and [d] is the integer part of d, is called the Douady-Oesterlé dimension at the 
point x. Under the assumptions of the Douady-Oesterlé theorem for differential equations, dg(A) € 
sup dpo(x). 
zeA 


(17.53) 
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W The Lorenz system (17.2), in the case of o — 10, b = 8/3, r = 
28, has an attractor A (Lorenz attractor) with the numerically de- 
termined dimension dy(A) * 2.06 (Fig. 17.17 is generated by 
Mathematica). With the Douady-Oesterlé theorem, for arbitrary 
b l,c0andr > 0 one gets the estimation 

dg(A) X 3— eni where (17.542) 





1 
5-73 Vo—1 





o+b+\ l(a »«( : | 2) ^ 2 (17.54b) 


17.2.4.4 Examples of Attractors 

ΒΑ: The horseshoe mapping p occurs in connection with 
Poincaré mappings containing the transversal intersections of sta- 
ble and unstable manifolds. The unit square M = [0,1] x [0,1] 
is stretched linearly in one coordinate direction and contracted in 
Figure 17.17 the other direction. Finally, this rectangle is bent at the middle 
(Fig. 17.18). Repeating this procedure 
infinitely many times, a sequence of sets 
M D (M) dD --- is produced, for which 














A= f e" (M) (17.55) 


k=0 


























is a compact set and an invariant set with 
M φ(Μ) °M) respect to y. A attracts all points of M. 
Apart from one point, A can be described 

Figure 17.18 locally as a product “line x Cantor set”. 


WB B: Leto € (0,1/2) bea parameter and M = [0,1] x [0, 1] be the unit square. The mapping e: M — 
M given by 


1 
(2x, ay), if0<a< aye (0, 1], 


elz, y) = (17.56a) 


1 1 
(2x — lray+ 2 if 5 « zr € 1, y € [0,1] 


is called the dissipative baker’s mapping. Two iterations of the baker’s mapping are shown in Fig. 17.19. 


UA 


1x 1x 1x 


The “flaky pastry structure” is recogniz- 


1 y able. 'The set 
! A=) (M) (17.56b) 
k=0 


is invariant under y and all points from 
M are attracted by A. The value of the 
Hausdorff dimension is 


In2 
πω ον 
Figure 17.19 -ma 


For the dynamical system (t) there exists an invariant measure jj on M, which is different from the 
Lebesgue measure. At the points where the derivative exists, the Jacobian matrix is D y*((a,y)) = 








(17.56c) 


k 
E M Hence, the singular values are o1(k,(x,y)) = 2*, oo(k,(x,y)) = a* and, consequently, 
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the Lyapunov exponents are \; = In2, Az = Ina (with respect to the invariant measure p). For the 
Lyapunov dimension 





In2 
da =t Sal) (17.56d) 
—Ina 
is obtained. Pesin’s formula (see 17.2.3,4., (17.40), p. 881) for the metric entropy is valid here, i.e., 
= y A= ln, (17.56e) 
λ150 


W C: Let T be the whole torus with local coordinates (O, z, y), as is shown in Fig. 17.20a. 


φ(Τ) 
D(0) © C9 Qu 


Figure 17.20 





Let a mapping v: T' — T' be defined by 


x 1 3, 2 
Bua s 38 | = i a 4 ο (k — 0,1,...) (17.57) 


Uk4i sin Ok Uk 
with parameter a € (0, 1/2). The image (T), with the intersections p(T) à D(0) and e? (T) n D(@), 
is shown in Fig. 17.20b and Fig. 17.20c. The result of infinitely many intersections is the set A = 
oo 
(f. v^ (T), which is called a solenoid. The attractor A consists of a continuum of curves in the length 
k=0 


direction, and each of them is dense in A, and unstable. The cross-section of the A transversal to these 
curves is a Cantor set. 

: - In2 . D I 
The Hausdorff dimension is dy(A) = 1— In: The set A has a neighborhood which is a domain of at- 

na 

traction. Furthermore, the attractor A is structurally stable, i.e., the qualitative properties formulated 
above do not change for C!-small perturbations of φ. 
W D: The solenoid is an example of a hyperbolic attractor. 


17.2.5 Strange Attractors and Chaos 


1. Chaotic Attractor 

Let (v! Hier be a dynamical system in the metric space (M, p). The attractor A of this system is called 
chaotic if there is a sensitive dependence on the initial condition in A. 

The property “ sensitive dependence on the initial conditions ” will be made more precise in different 
ways. It is given, e.g., if one of the two following conditions is fulfilled: 

a) All motions of (v) on A are unstable in a certain sense. 

b) The greatest Lyapunov exponent of {p"} is positive with respect to an invariant ergodic probability 
measure concentrated on A. 

W Sensitive dependence in the sense of a) occurs for the solenoid. Property b) can be found, e.g., for 
Hénon attractors. 

2. Fractals and Strange Attractors 

An attractor A of {y'}ier is called fractal if it represents neither a finite number of points or a piecewise 
differentiable curve or surface nor a set which is bounded by some closed piecewise differentiable surface. 
An attractor is called strange if it is chaotic, fractal or both. The notions chaotic, fractal and strange 
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are used for compact invariant sets analogously even if they are not attractors. A dynamical system is 
called chaotic if it has a compact invariant chaotic set. 
E The mapping 
In41 = 2%n,+Yn (mod1), Yn41 =Tn+Yn (mod 1) (17.58) 
(Anosov diffeomorphism) is considered on the unit square. The adequate phase space for this system 
is the torus T?. It is conservative, has the Lebesgue measure as invariant measure, has a countable 
number of periodic orbits whose union is dense and is mixing. Otherwise, A = T? is an invariant set 
with integer dimension 2. 
3. Systems Chaotic in the Sense of Devaney 
Let {y’}rer be a dynamical system in the metric space (M, ϱ) with a compact invariant set A. The 
system {yter (or the set A) is called chaotic in the sense of Devaney, if: 
a) (e! ier is topologically transitive on A, i.e., there is a positive semiorbit, which is dense in A. 
b) The periodic orbits of (v! );er are dense in A. 
C) (e! hier is sensitive with respect to the initial conditions in the sense of Guckenheimer on A, i.e., 
Je»0VzeAVó»0 AYEANU (x) At>0 : p(y'(x), y'(y)) Se (17.59) 
where Us(x) = {z € M: p(x, z) < ô}. 
E Consider the space of the 0-1-sequences 


So = {s = sosise..., 81€ {0,1} (@=0,1...)}. 
For two sequences s — 595152... and s' — ss/s}..., their distance is defined by 
0, ifs=s' 
s» gl) ? , 
plss) = ee) if sz s', 
where j is the smallest index for which s; Z sj. So, (X, p) is a complete metric space which is also 
compact. 
The mapping p: $ = 809185... Ε-ὸ σ(8) Ξ- 8’ = 815953... is called a Bernoulli shift. 
The Bernoulli shift is chaotic in the sense of Devaney. 


17.2.6 Chaos in One-Dimensional Mappings 

For continuous mappings of a compact interval into itself, there exist several sufficient conditions for 

the existence of chaotic invariant sets. Three examples are to be considered. 

Shinai Theorem: Let y: I — I be a continuous mapping of a compact interval I, e.g., Z = [0, 1] 

into itself. Then the system (y^) on J is chaotic in the sense of Devaney if and only if the topological 

entropy of y on J, 1.6., h(q), is positive. 

Sharkovsky Theorem: Consider the following ordering of positive integer numbers: 
βχδ»Τ»...»53:8»3:5»...»323-8» 23.-»...... - 23» 2»93»1. (17.60) 

Let o: I — I bea continuous mapping of a compact interval into itself and suppose {y*} has an n- 

periodic orbit on I. "Then (q^) also has an m-periodic orbit if n = m. 

Block, Guckenheimer and Misiuriewicz Theorem: Let o: / — 1 be a continuous mapping 

of the compact interval J into itself such that (q^) has a 2"m-periodic orbit (m » 1, odd). Then 


ie holds. 


= gn+1 





17.2.7 Reconstruction of Dynamics from Time Series 
17.2.7.1 Foundations, Reconstruction with Basic Properties 


1. Measuring Function, Time Series 
Considered is a dynamic system {pt hier with I € (Z,, R4], generated by a mapping e € Difi (U ) 
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(see 17.1.4.2, p. 874) or a vector field f € X} (U) (see 17.1.4.1. p. 873). A Cl-function h: U > R, 
the so called measuring function is needed to reconstruct the dynamics from measurements. Since in 
practice only discrete time measurements can be obtained, this will be done in time periods (kr, k — 
1,2,...} C P, where 7 7 0 is a fixed interval of time. For m € IN and & € {—1,1} the time sequence 
of order m measured with fixed time step T > 0 


{(h (p) ^ (a *9"(9)) ,..., n (pf m709"()))^.— | (17.61) 
is called the backward (« = —1) or forward (« = 1) coordinates of the orbit {y"(p)}ier with p € U 


based on the measuring function h. 


2. Immersion, Embedding, Theorem of Whitney 

Let U C R” be an open set. The C!-mapping ®: U > R" is called an immersion if the Jacobian 
matrix Dé(uw) has rank n for every u € U. The immersion $: U — R"' is called embedding if Φ is a 
homeomorphism of U into 9(U) (®(U) is considered with the subspace topology of IR"). 


Whitney’s theorem states that with respect to the open and bounded set U C IR" for every m > 2n+1 
the set of all embeddings ®: U + R™ form an open and dense subset of C!-mappings U + R™. Hence, 
for m > 2n + 1 is @ generically an embedding. 

3. Reconstruction Theorem of Takens, Theorem of Kupka and Smale 

Considering Diff l(U ) with an arbitrary natural number m > 2n + 1 the set of all pairs (y,h) € 
Diff} (U) x C1(U, R), for which the reconstruction mapping (in forward coordinates) 


pe U r+ Ga(p) = (Alp), A(p'(p)),--- h(e" (p) (17.62) 


is an embedding, is open and dense in Diff} (U ) x Cl(U,R). Hence, the property of ®,,,, to be an 
embedding is generic for m > 2n + 1. Such a number m is called embedding dimension (see theorem 
of Takens [17.13]). The theorem of Takens can be applied for differential equations from X} (U je If 
m > 2n + 1 is an arbitrary natural number, then the set of all pairs (f, h) € X1(U) x C*(U, R) for 
which the reconstruction mapping (in forward coordinates) 


p E U > Spalp) = (hp), ho (p)), -he o))) (17.63) 


(with {y'}is0 as the semi-flow belonging to f) is an embedding, constitutes an open and dense set in 
xi(U ) x C'(U, R) . Hence, the property of ®,,,, to be an embedding is generic. 

W Let the differential equation 2 — —z — f(x) be given in an interval (C1 — &,1-- €) (e » 0, fora 
sufficiently small £). Since f is continuously differentiable and f(—1) — 1 > 0 and f(1) — —1, clearly 
f € X} (U) with U — (—1,1). This also follows from the explicite solution t(x) ^ re *(t 7 0,2 € U). 
The theorem of Takens states that for m 7 3 the reconstruction function 9; is a generic embedding 
with continuously differentiable measuring function h : (—1,1) — R. For example the measuring 
function h4(r) = x for the mapping x € (—1,1) œ fa, (£) = (a, xe, xe?) is obviously an em- 
bedding in R. However with the measuring function h: (—1,1) — R the reconstruction mapping 
x E (—1,1) 9 Opa (x) — (x?,2?e ?, x?e 4) is not injective and therefore is not an embedding. 

The reconstruction theorem of Takens is based on the theorem of Kupka and Smale: The set of all 
diffeomorphisms y € Diff} (U ), such that their period points are hyperbolic and W%(p) is transver- 
sal to W“(q) for arbitrary period point, forms a set of second Baire category, that is, such diffeomor- 
phisms are typical in Difi (U ). The condition m > 2n + 1 follows from the fact that for typical 
(o, h) € Difi (U ) x C (U, R) mapping $,,; is an immersion in the neighborhood of the period points, 
therefore it can be extended into an embedding on the entire U. 

4. Dynamic in the Reconstruction Space 

The theorem of Takens implies that for a generic (y,h) € Diff}(U) x C1(U,R) the set 9(U) (re- 
construction space) with 9 — 4, is an immerse and homeomorphic image of U and the mapping 
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y = @o po @* is defined on &(U). The topologic properties of the stationary points and periodic 
orbits of the (unknown) system (v^ rez, defined on U and the system {yk} reg, defined on (U) are 
identical to those of the eigenvalues of the Jacobi matrices. Similarly the entropies and dimensions, e.g. 
correlation dimension (see 17.2.7.2, p. 891), correspond to the Lyapunov exponents associated with in- 
variant measure. The mapping v» on (u) is completely described in all points being given in the time 
series. For example select T = 1. Let point £ = (hi (p)) h (zr 71(p))) € R", k € Z4 begiven. 
Clearly z, — $(g,) , with ag = y*(p). Then Y(zp) = (D o p o P-L) (D(qr)) 9 zi, ie; v (h(g),..., 

h(i-em-1)) — (qa) A(dk+2), -- - , R(dk+m)) - From the measurements of the orbits (with I = Z4) of 
w defined on &(U) the entire dynamics of y defined on U can be obtained. 


17.2.7.2  Reconstructions with Prevalent Properties 


1. Prevalence as a Generic Metric 

Prevalence or generic metric is an extension to infinite dimension of the wellknown concept ” almost 
everywhere in Lebesgue-measure" (see 12.9.1,2., p. 694) known in finite dimensions, so it differs from 
the corresponding notions for the sets of the second Baire category. A Borel set S in a Banach space 
B is prevalent (see [17.13]) if there is a finite Borel measure p with support K (see 17.2.1.1,1., p. 876), 
such that u(S + x) = a(S) = (K) for all x € B. 

WA: Every Borel set of a finite dimensional vector space with measure zero complement is prevalent. 
WB: Tho union and intersection of finitely many prevalent sets are also prevalent. 

MC: Let C*(U, R), U C R* be the Banach space of all scalar valued functions from C^(U, IR) whose 
partial derivatives up to order k are continuous on U. If U € R" is open and connected, then the 
prevalent subsets of C*(U, R) , are dense in this space. 


2. Reconstruction Theorems of Sauer, Yorke and Casdagli 

Let (329 bea continuous dynamics generated by the vector field f € X B (see 17.1.4.1, p. 873) and let 
A be a compact subset in U with fractal dimension do(A) — d. Furthermore let m > 2d be an integer 
and r > 0 arbitrary. The process {y"}:s0 restricted to A might have only finitely many stationary 
points, there is no orbit with period 7 or 27, and only finitely many orbits with periods 37, 47, ...,mr 
where the multiplyers of these periodic orbits (except by 1) are all different. The set of all measuring 
functions h : U + Ris a prevalent set C! (U, R) if the reconstruction function $; n,- 


p € U e dps (p) — (h (e (p)) ,h (79 (p) ,... (p) (17.64) 
satisfies the following conditions: 
a) Bfn is injective on A; 
(W), whith W = G N R? x 


b) fhar is an immersion on every subset Uc A, such that U = 
< d. (Theorem of Sauer, Yorke, Cas- 


{0,..-,0} -,0},G c R^ open, V: G — R" is a Chmapping and k 
n= YIN. 


dagli see [17.13].) 


3. Estimation of Correlation Dimension 
Given the dynamic system [t );er mit P € (Z4, R4), generated either by o € Difi (U) or f € X} (U). 
Let {y'}icr have an attractor A in U with invariant probability measure j. Let h: U — R bea 
measuring function, m € IN the order parameter, 7 — 1 the time stepsize and for i = 1, 2,... 

Ti = (Yi, Viris Uitm) € ἘΠ (17.65) 
with y; = h(y'(p)) being a forward coordinate of the m -- 1 order time series of the orbit (ler with 
p € U. The distance of vectors x; and v; is defined as dist(r;, vj) = ax [yos — Yj+s|- Let N >m 
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denote a natural number and £ > 0 a real number, then the expression 
1 
C™(y) = limsup — Nà g (number of ordered pairs (zi, 2;): 4j € (1,..., NJ, dist(ziz;) « e) (17.66) 
N-oo 


In C" (e 
is called (discrete) correlation integral (with respect to m and &). If the d(m) — lim “ee exists, 
e ne 

then it is an estimation of the correlation dimension dg. The Takens theorem implies that h € C! (U, R) 
is generic for m > 2n, and the theorem of Sauer, Yorke, Casdagli implies that it is prevalent for m+1 > 
do(A) with backward coordinates. 

E The Lorenz system (17.2) (see 17.1.1.1,2., p.858) belongs to X1 (U), where U can be selected as 
U :— ((z,y,z) € R? : Ea? - y? - (z — o — rY] « ο) (c » 0, sufficiently large) . Clearly the Lorenz 
attractor A (with ø = 10, b — 8/3, r = 28) is in U. The theorem of Douady-Oesterlé (see 17.2.4.3, 
p. 886) gives the upperbound dg (A) € 2.421. Numerical integration with the Box-Counting-Method 
gives dg (A) & 2.06. Estimation of the correlation dimension in natural measure using the embedding 
method with time series in backward coordinates (r & 0.12) gives the value dy & 2.03 (Grassberger, 
(17.12]) for the Lorenz attractor . 


17.3 Bifurcation Theory and Routes to Chaos 


17.3.1 Bifurcations in Morse-Smale Systems 

Let {yt her be a dynamical system generated by a differential equation or by a mapping on M C R”, 
which additionally depends on a parameter e € V C RR. Every change of the topological structure 
of the phase portrait of the dynamical system for small changes of the parameter is called bifurcation. 
The parameter € = 0 € V is called the bifurcation value if there exist parameter values e € V in every 
neighborhood of 0 such that the dynamical systems (f) and (qb) are not topologically equivalent 
or conjugated on M. The smallest dimension of the parameter space for which a bifurcation can be 
observed is called the codimension of the bifurcation. 

One distinguishes between local bifurcations, which occur in the neighborhood of a single orbit of the 
dynamical system, and global bifurcations, which affect a large part of the phase space. 


17.3.1.1 Local Bifurcations in Neighborhoods of Steady States 


1. Center Manifold Theorem 
Consider a parameter-dependent differential equation 
=f (té) or 2 = fi(ti,...,2n,€1,---,€1) (¢=1,2,...,n) (17.67) 


with f: M x V > R”, where M C R” and V C R? are open sets and f is supposed to be r times con- 
tinuously differentiable. Equation (17.67) can be interpreted as a parameter-free differential equation 
t = f(x,£), = 0 in the phase space M x V. From the Picard-Lindelöf theorem and the theorem on 
differentiability with respect to the initial values (see 17.1.1.1, 2., p. 857) it follows that (17.67) has a 
locally unique solution y(-, p,€) with initial point p at time t = 0 for arbitrary p € M and € € V, which 
is r times continuously differentiable with respect to p and ε. Suppose all solutions exist on the whole 
of R. 

Furthermore, it is supposed that the system (17.67) has the equilibrium point x = 0 at € = 0, i.e., 


afi 
Ox 


Furthermore, suppose, D; f (0,0) has exactly m eigenvalues with negative a part andk =n—s—m 
eigenvalues with positive real part. 

According to the center manifold theorem for differential equations (theorem of Shoshitaishvili, see 
[17.12]), the differential equation (17.67), for € with a sufficiently small norm ||e|| in the neighborhood 





f(0,0) — 0 holds. Let À;,..., À; be the eigenvalues of D,/(0,0) — [34 -(0, ο) with ReA; — 0. 


i,j—l 
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of 0, is topologically equivalent to the system 

t= F(x,£) = Ax +g(x,£€) ὑ--υ, ᾱ--2 (17.68) 
with z € Ró, y € R" and z € R*, where A is a matrix of type (s, s) with eigenvalues A,,..., A, and g 
represents a C’-function with g(0,0) = 0 and D,g(0,0) — 0. 
It follows from representation (17.68) that the bifurcations of (17.67) in a neighborhood of 0 are 
uniquely described by the differential equation 

& = F(a, e). (17.69) 
Equation Q7. 69) represents the reduced differential equation to the local center manifold Wg, = {x,y, z: 
y = 0,z = 0} of (17.68). The reduced differential equation (17.69) can often be transformed into 
a relatively simple form, e.g., with polynomials on the right-hand side, by a non-linear parameter- 
dependent coordinate transformation so that the topological structure of its phase portrait does not 
change close to the considered equilibrium point. This form is a so-called normal form. A normal form 
cannot be determined uniquely; in general, a bifurcation can be described equivalently by different 
normal forms. 
2. Saddle-Node Bifurcation and Transcritical Bifurcation 
Suppose (17.67) is given with | — 1, where f is continuously differentiable at least twice and DD, f (0, 0) 
has the eigenvalue A; — 0 and n — 1 eigenvalues A; with ReA; 4 0. According to the center manifold 
theorem, in this case, all bifurcations (17.67) near 0 are described by a one-dimensional reduced differ- 
OF 
Ox 

or 

F(0,0) # 0, 3 0 0) Z 0 and the right-hand side of (17.69) is expanded according to the Taylor 


ential equation (17.69). Obviously, here F(0,0) — 

2 
8r 
formula, then n —M can be transformed by coordinate transformation (see [17.6]) into 
the normal form 


aids (17.70) 


(0,0) = 0. If, additionally, it is supposed that 


Pp op 
(ων πα. 0)» o) ort’ =a-—a?4+--- (ων ae 3 (0,0) « ο) where a = a(e) is a differentiable 
function with a(0) = 0 and the points indicate higher-order terms. For a < 0, (17.70) has two equi- 
librium points close to « = 0, among which one is stable, the other is unstable. For a = 0, these 
equilibrium points fuse into « = 0, which is unstable. For a > 0, (17.70) has no equilibrium point near 
to 0 (Fig. 17.21b). 

The multidimensional case results in a saddle-node bifurcation in a neighborhood of 0 in (17.67). This 
bifurcation is represented in Fig. 17.22 for n — 2 and \y = 0,A_ < 0. The representation of the 
saddle-node bifurcation in the extended phase space is shown in Fig. 17.21a. For sufficiently smooth 
vector fields (17.67), the saddle-node bifurcations are generic. 


ee a‘ 


a<0 a=0 a>0 





Figure 17.21 


F 
If among the conditions which yield a saddle-node bifurcation for F, the condition σα, 0) # 0is 


2 


F 
and (0,0) Z 0, then one gets from (17.69) the truncated 
ve 





iE 
replaced by the conditions XO 0) 2 0 and 
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| 0 
a<0 a=0 a>0 





Y 


Figure 17.22 


normal form (without higher-order terms) ¢ = ax — x? of a transcritical bifurcation. For n = 2 and 
λα « 0, the transcritical bifurcation together with the bifurcation diagram is shown in Fig. 17.23. 
Saddle-node and transcritical bifurcations have codimension 1-bifurcations. 





m E 


Ξ0 α»0 
Figure 17.23 


3. Hopf Bifurcation 
Consider (17.67) with n > 2,1 = 1 andr > 4. Suppose that f(0,¢) = 0 is valid for all & with |e| € £o 
(£o > 0 sufficiently small). Suppose the Jacobian matrix D,f(0,0) has the eigenvalues \y = Ay = iw 
with w 4 0 and n — 2 eigenvalues A; with ReA; z 0. According to the center manifold theorem, the 
bifurcation is described by a two-dimensional reduced differential equation (17.69) of the form 
i = a(e)x — w(e)y + gi(x,y,£€), ý = w(e)x + a(e)y + go(x,y, €) (17.71) 
where a, w, gı and g are differentiable functions and w(0) = w and also a(0) = 0 hold. By a non-linear 
complex coordinate transformation and by the introduction of polar coordinates (r, 0), (17.71) can be 
written in the normal form 
? — a(&)r -- a(e)r? ----, à — w(e) 4 b(e)r? 4 --- (17.72) 
where dots denote the terms of higher order. The Taylor expansion of the coefficient functions of (17.72) 
yields the truncated normal form 


* — a'(0)er - a(0)r?, Ù= w(0) + w'(0)e + b(0)r?. (17.73) 


The theorem of Andronov and Hopf guarantees that (17.73) describes the bifurcations of (17.72) in a 
neighborhood of the equilibrium point for € = 0. 





The following cases occur for (17.73) under the assumption o'(0) » 0: 


1. a(0) <0 (Fig. 17.24a). : 2. α(0) » 0 (Fig. 17.24b) : 
à) € 7 0: Stable limit cycle and a) € <0: Unstable limit cycle. 
unstable equilibrium point. 
b) e£ — 0: Cycle and equilibrium point fuse b) £ — 0: Cycle and equilibrium point fuse 
into a stable equilibrium point. into an unstable equilibrium point. 


c)e « 0: Allorbits close to (0,0) tend asin b) — c)&e » 0: Spiral type unstable 
fort + +00 spirally to the equilibri- equilibrium point as in b). 
um point (0, 0). 
The interpretation of the above cases for the initial system (17.67) shows the bifurcation of a limit cycle 
of a compound equilibrium point (compound focus point of multiplicity 1), which is called a Hopf bifur- 
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α) ε»ῦ e<0 b) £«0 £20 


Figure 17.24 


cation (or Andronov-Hopf bifurcation). The case a(0) « 0 is also called supercritical, the case a(0) > 0 
subcritical (supposing that a/(0) > 0). The case n = 3, Ay = Ag = i, A3 < 0,a’(0) > 0 and a(0) < 0 is 
shown in Fig. 17.25. 


e>0 £x0 


Figure 17.25 
Hopf bifurcations are generic and have codimension 1. The cases above illustrate the fact that a super- 
critical Hopf bifurcation under the above assumptions can be recognized by the stability of a focus: 


Suppose the eigenvalues A; (c) and As(£) of the Jacobian matrix on the right-hand side of (17.67) at 0 
are pure imaginary for « = 0, and for the other eigenvalues A; ReA; 4 0 holds. Suppose furthermore 


that i ReAj(€)j-=0 > 0 and let 0 be an asymptotically stable focus for (17.67) at ¢ = 0. Then there is 
a supercritical Hopf bifurcation in (17.67) at ¢ = 0. 
Wi The van der Pol differential equation % + e(x? — 1)4 + x = 0 with parameter £ can be written as a 
planar differential equation 

i-y, g--e(z-1)y-z. (17.74) 
For e = 0, (17.74) becomes the harmonic oscillator equation and it has only periodic solutions and 


an equilibrium point, which is stable but not asymptotically stable. With the transformation u — 
Vex, v= Vey fore > 0 (17.74) is transformed into the planar differential equation 


ü—v ὑ--ι--(ω--ε)υ. (17.75) [ 


For the eigenvalues of the Jacobian matrix at the equilibrium point (0,0) of (17.75): 


Je? ] d 1 
λιρ(ε) - zF VTT land so A1,2(0) = +i and T ReAi(€)je=0 = 2 0. 


As shown in the example of 17.1.2.3, 1., p. 863, (0,0) is an asymptotically stable equilibrium point of 
(17.75) for € — 0. There is a supercritical Hopf bifurcation for € — 0, and for small € 7 0, (0,0) is an 
unstable focus surrounded by a limit cycle whose amplitude is increasing as € increases. 








4. Bifurcations in Two-Parameter Differential Equations 

1. Cusp Bifurcation Suppose the differential equation (17.67) is given with r 7 4 and | — 2. Let 

the Jacobian matrix D, f(0,0) have the eigenvalue Ay — 0 and n — 1 eigenvalues A; with ReA; z 0 and 
F 2p 

suppose that for the reduced differential equation (17.69) F(0,0) — τα 0) = 7 (0, 0) = 0 und 
X “Ὁ 
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F 
l3 := an 4 (0,0) # 0. Then the Taylor expansion of F close to (0, 0) leads to the truncated normal form 
(without higher-order terms see, [17.1]) 


$ — 03 oor ^ sign laa? (17.76) 
with the parameters a; and az. The set (01,09, x): o4 -- osx 4- sign 33? — 0) represents a surface in 
extended phase space and this surface is called a cusp (Fig. 17. 26a). 

In the following, it is supposed l3 < 0. The non-hyperbolic equilibrium points of (17.76) are defined 
by the system a, + agv — 2° = 0, ag — 3a? = 0 and thus they lie on the curves 54 and S5, which are 
determined by the set {(a1, a2): 27a? — 4a} = 0} and form a cusp (Fig. 17.26b). If (a1, a2) = (0,0) 
then the equilibrium point 0 of (17.76) is stable. The phase portrait of (17.67) in a neighborhood of 0, 
e.g., for n — 2, ls « 0 and 4, — 0 is shown in Fig. 17.26c for A» « 0 (triple node) and in Fig. 17.26d 
for Az > 0 (triple saddle) (see [17.6]). 

At transition from (a1, a2) = (0,0) into the interior of domain 1 (Fig. 17.26b) the compound node- 
type non-hyperbolic equilibrium point 0 of (17.67) splits into three hyperbolic equilibrium points (two 
stable nodes and a saddle) (supercritical pitchfork bifurcation). 

In the case of the two-dimensional phase space of (17.67) the phase portraits are shown in Fig. 17.26c,e. 
When the parameter pair of 5; \ {(0,0)} (i = 1,2) traverse from 1 into 2 then a double saddle node- 
type equilibrium point is formed which finally vanishes. A stable hyperbolic equilibrium point remains. 


i = E JV 
b) m 9 3 





e) 


Figure 17.26 


KK OK 
CX OK CX 


Figure 17.27 





2. Bogdanov-Takens Bifurcation Suppose, for (17.67), n > 2, l = 2, r > 2 hold and the matrix 
D,f (0,0) has two eigenvalues 4; — Ag = 0 and n — 2 eigenvalues Ag with Red; 4 0. Let the reduced 
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two-dimensional differential equation (17.69) be topologically equivalent to the planar system 


$—y, ý= +T +T? — xy. (17.77) 
Then there is a saddle-node bifurcation on the curve S1 = {(a1, @2): a3 — 4aı = 0}. At the transition 
on the curve $5 — {(a1, a2): a1 = 0,a2 < 0} from the domain a; < 0 into the domain a; > 0 a stable 
limit cycle arises by a Hopf bifurcation and on the curve $3 = {(a1,a2): a, = —ka3+---} (k > 0, 
constant) there exists a separatrix loop for the original system (Fig. 17.27), which bifurcates into a 
stable limit cycle in domain 3 (see [17.6], [17.9]). 

This bifurcation is of a global nature and one says that a single periodic orbit arises from the homoclinic 
orbit of a saddle or a separatrix loop vanishes. 

3. Generalized Hopf Bifurcation Suppose that the assumptions of the Hopf bifurcation with 
r > 6are fulfilled for (17.67), and the two-dimensional reduced differential equation after a coordinate 
transformation into polar coordinates has the normal form # = eyr + €gr?— rr? +---, )—1-4--- The 
bifurcation diagram (Fig. 17.28) of this system contains the line Sı = {(€1,€2): ¢1 = 0, €2 £ 0}, whose 
points represent a Hopf bifurcation (see [17.4], [17.6]). There exist two periodic orbits in domain 3, 
among which one is stable, the other one is unstable. On the curve S2 = { (£1, £2): €3+4e2 > 0, £1 < 0}, 
these non-hyperbolic cycles fuse into a compound cycle which disappears in domain 2. 


D 6 j, 6 6 


Figure 17.28 


5. Breaking Symmetry 

Some differential equations (17.67) have symmetries in the following sense: There exists a linear trans- 
formation T' (or a group of transformations) such that f(T'v,c) — T f (v,£) holds for all x € M and 
€ € V. An orbit * of (17.67) is called symmetric with respect to T if Ty = 4. 

One talks about a symmetry breaking bifurcation at ¢ = 0, e.g., in (17.67) (for | — 1), if there is a 
stable equilibrium point or a stable limit cycle for € < 0, which is always symmetric with respect to T, 
and for ¢ = 0 two further stable steady states or limit cycles arise, which are no-longer symmetric with 
respect to T'. 

Wl For system (17.67) with f(x, £) = &x—2? the transformation T defined as T: x — —zc is asymmetry, 
since f(—a,e) = —f(x,e) (a € R,e € R). Fore « 0 the point z; — 0 is a stable equilibrium point. 
For € > 0, besides zı = 0, there exist the two other equilibrium points $3 = +,/e; both are non- 
symmetric. 


17.3.1.2 Local Bifurcations in a Neighborhood of a Periodic Orbit 


1. Center Manifold Theorem for Mappings 
Let ^y be periodic orbit of (17.67) for € — 0 with multipliers p1, ..., Pn—1, Pn = 1. A bifurcation close to 
η is possible, if when changing £, at least one of the multipliers lies on the complex unit circle. The use 
of a surface transversal to y leads to the parameter-dependent Poincaré mapping 

qz —3 P(m,e). (17.78) 
Then, with open sets E C R" and V C Rllet P: ExV — R?-! bea C"-mapping where the mapping 
P:ExV —R'!xR!with P(r,e) — (P(x, e), ) should be a C"-diffeomorphism. Furthermore, 
let P(0,0) = 0 and suppose the Jacobian matrix D; P(0, 0) has s eigenvalues o1, ..., ps with |pi| = 1, 
m eigenvalues ps41,..., Ps¢m With |p;| < 1 and k — n — s — m — 1 eigenvalues ps4m+1;. -- , Pn-1 
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with |p;| > 1. Then, according to the the center manifold theorem for mappings (see [17.4]), close to 
(0,0) € E x V, the mapping P is topologically conjugate to the mapping 

(2, y, 2, €) — (Fr, €), A*y, A*2, €) (17.79) 
near (0,0) € R"^! x R! with F(r,£) — A*r + g(x, ). Here g is a C"-differentiable mapping satisfying 
the relations g(0,0) = 0 and D,g(0,0) = 0. The matrices A^, A* and A" are of type (s, s), (m, rn) and 
(k, k), respectively. 
It follows from (17.79) that bifurcations of (17.78) close to (0,0) are described only by the reduced 
mapping 








zı F(zx,8) (17.80) 
on the local center manifold Wi, — ((x,y.z): y — 0, z — 0). 
F(,0)4 F(a) F(a) 
» » » 
X X X 
a) 
b) a<0 α-θ o0 





Figure 17.29 


2. Bifurcation of Double Semistable Periodic Orbits 

Let the system (17.67) be given with n > 2, r > 3 and/ = 1. Suppose, at ¢ = 0, the system (17.67) has 
periodic orbit y with multipliers p; = +1, |p;| #1 (i = 2,3,...,n—1) and p, = 1. According to the 
center manifold theorem for mappings, the bifurcations of the Poincaré mapping (17.78) are described 





2 F 
by the one-dimensional reduced mapping (17.80) with A* — 1. If ? z (0,0) # 0 and ^: (0,0) Z 0 is 
T 
supposed, then it leads to the normal forms 
z 7 OF 
z—3F(rxo)-octrctz for p 00-0] or (17.81a) 
r 
: OF 
αι--λα--α--α for gaz 0)«0]. (17.81b) 
a 


The iterations from (17.81a) close to 0 and the corresponding phase portraits are represented in 
Fig. 17.29a and in Fig. 17.29b for different a (see [17.6]). Close to x — 0 there are fora < 0a 
stable and an unstable equilibrium point, which fuse for a = 0 into the unstable steady state x = 0. 
For a > 0 there exists no equilibrium point close to r — 0. The bifurcation described by (17.81a) in 
(17.80) is called a subcritical saddle node bifurcation for mappings. 

In the case of the differential equation (17.67), the properties of the mapping (17.81a) describe the 
bifurcation of a double semistable periodic orbit: For a < 0 there exists a stable periodic orbit ^; and an 
unstable periodic orbit 72, which fuse for a = 0 into a semistable orbit y, which disappears for a > 0 
(Fig. 17.30a,b). 

3. Period Doubling or Flip Bifurcation 

Let system (17.67) be given with n > 2, r > 4 and/ = 1. Considered is a periodic orbit 7 of (17.67) at 
E = 0 with the multipliers p, = —1, |p;| A 1 (¢ = 2,...,n — 1), and py = 1. The bifurcation behavior 
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a) 471 b) 9 


Figure 17.30 
of the Poincaré mapping in the neighborhood of 0 is described by the one-dimensional mapping (17.80) 
with A* — —1, if supposing the normal form 
xr F(v,o) — (-1-- o)z 4- οὗ. (17.82) 
The steady state x — 0 of (17.82) is stable for small a > 0 and unstable for a < 0. The second iterated 





mapping F has for a < 0 two further stable fixed points besides x = 0 for 213 -- +V/—a + offal), 
which are not fixed points of F. Consequently, they must be points of period 2 of (17.82). 


In general, for a C^-mapping (17.80) there is a two-periodic orbit at ¢ = 0, if the following conditions 
are fulfilled (see [17.2]): 




















F p? 
F0,0)=0,  eoo=-1, So) =o, 
op pr on uu 
πο κο, LL (60)-0, 570,0) 40. 
_ 9 OF E : : . 
Since à (0,0) = +1 holds (because of δε» 0) = —1), the conditions for a pitchfork bifurcation are 
T T 


formulated for the mapping £?. 
For the differential equation (17.67) the properties of the mapping (17.82) imply that at œ = 0 a stable 
periodic orbit ^, splits from y with approximately a double period (period doubling), where ^ loses its 
stability (Fig. 17.30c). 
E The logistic mapping Ya: [0,1] > [0, 1] is given for 0 < a € 4 by ga(x) — ox(1 — z), i.e., by the 
discrete dynamical system 

Tiy  απι(1 -- η). (17.84) 
The mapping has the following bifurcation behavior (see [17.10]): For 0 < a € 1 system (17.84) has 
the equilibrium point 0 with domain of attraction [0,1]. For 1 < œ < 3, (17.84) has the unstable 
equilibrium point 0 and the stable equilibrium point 1 — 1/a, where this last one has the domain of 
attraction (0,1). For a, = 3 the equilibrium point 1—1/a is unstable and leads to a stable two-periodic 
orbit. 
At the value ag = 1 + V6, the two-periodic orbit is also unstable and leads to a stable 2?-periodic or- 
bit. The period doubling continues, and stable 24-periodic orbits arise at a = ag. Numerical evidence 
shows that ag 4 Ag. © 3.570... as q — +00. 
For a = Qo, there is an attractor F (the Feigenbaum attractor), which has a structure similar to the 
Cantor set. There are points arbitrarily close to the attractor which are not iterated towards the at- 
tractor, but towards an unstable periodic orbit. The attractor F has dense orbits and the Hausdorff 
dimension is dg (F) & 0.538.... On the other hand, the dependence on initial conditions is not sen- 
sitive. In the domain aœ% < a < 4, there exists a parameter set A with positive Lebesgue measure 
such that system (17.84) has a chaotic attractor of positive Lebesgue measure for a € A. The set A is 
interspersed with windows in which period doublings occur. 
The bifurcation behavior of the logistic mapping can also be found in a class of unimodal mappings, 
i.e., of mappings of the interval J into itself, which has a single maximum in J. Although the param- 
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eter values o;, for which period doubling occurs, are different from each other for different unimodal 
mappings, the rate of convergence by which these parameters tend to a, is equal: y — o, & Có-^, 
where ó — 4.6692... . is the Feigenbaum constant (C depends on the concrete mapping). The Hausdorff 
dimension is the same for all attractors F at a = a9: dy(F) £2 0.538... . 


4. Creation of a Torus 
Consider system (17.67) with n 7 3,r > 6 and! = 1. Suppose that for all e close to 0 system (17.67) has 


. 2 
a periodic orbit ye. Let the multipliers of yo be p1,2 — e*'* with V ¢ fo, 7 = πλ. pj (j — 3,...,n—1) 


with |p;| A 1 and p, — 1. 

According to the center manifold theorem, in this case there exists a two-dimensional reduced C$- 

mapping 

zı F(a, ¢) (17.85) 

with F'(0,¢) = 0 for € close to 0. 

If the Jacobian matrix D; F(0, 2) has the conjugate complex eigenvalues p(£) and p(z) with |p(0)| — 1 
d 

for all € near 0, if d :— T [ρίε]ικ-ο > 0 holds and p(0) is not a q-th root of 1 for q = 1,2,3,4, 
de 

then (17.85) can be transformed by a smooth ¢ dependent coordinate transformation into the form 

rH F(x,€) = F,(x,e) + O(|x||®) (O Landau symbol), where F, is given in polar coordinates by 


r |p()lr - ae)? 
E— . 17.86 
(5 (i + w(e) + b(e)r? ( ) 
Here a, w and bare differentiable functions. Suppose a(0) < 0 holds. Then, the equilibrium point r = 0 
of (17.86) is asymptotically stable for all € « 0 and unstable for ¢ > 0. Furthermore, for € > 0 there 


: 7 | de πο. : . 
exists the circle r = O , which is invariant under the mapping (17.86) and asymptotically stable 
a 
(Fig. 17.31a). 


a) εΞ0 





Figure 17.31 


The Neimark-Sacker Theorem (see [17.10], [17.1]) states that the bifurcation behavior of (17.86) 
is similar to that of F (supercritical Hopf bifurcation for mappings). 
W In mapping (17.85), given by 


gz 1 /(1+e)r +y +r? -— 2y? 
-F 1 
y V2 X-x - (14 e)y t x? — 23/" 


there is a supercritical Hopf bifurcation at ¢ = 


With respect to the differential equation (17.67), the existence of a closed invariant curve of mapping 
(17.85) means that the periodic orbit 7 is unstable for a(0) < 0 and for ¢ > 0 a torus arises which is 
invariant with respect to (17.67) (Fig. 17.31b). 
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17.3.1.3 Global Bifurcation 
Besides the periodic creation orbit which arises if a separatrix loop disappears, (17.67) can have further 
global bifurcations. Two of them are shown in [17.12] by examples. 
1. Emergence of a Periodic Orbit due to the Disappearance of a Saddle-Node 
W The parameter-dependent system 

i-—zcz(l—a?—3?)--y(14- x o), j — —x(12- x o) - y(1— 22 — ?) 
has in polar coordinates x = r cos V, y = r sin Ü the following form: 

$—r(1—72), 9 2 —(133- a 4 r cos). (17.87) 
Obviously, the circle r — 1isinvariant under (17.87) for an arbitrary parameter o;, and all orbits (except 
the equilibrium point (0, 0)) tend to this circle for t — --oo. For o « 0 there are a saddle and a stable 
node on the circle, which fuse into a compound saddle-node type equilibrium point at a = 0. For a > 0, 
there is no equilibrium point on the circle, which is a periodic orbit (Fig. 17.32). 


X2 €) €) 


Figure 17.32 





2. Disappearance of a Saddle-Saddle Separatrix in the Plane 
W Consider the parameter-dependent planar differential equation 

t =a +2ry, j 14a? —3. (17.88) 
For a = 0, equation (17.88) has the two saddles (0, 1) and (0, —1) and the y-axis as invariant sets. The 
heteroclinic orbit is part of this invariant set. For small |a| 4 0, the saddle-points are retained while 
the heteroclinic orbit disappears (Fig. 17.33). 


117 


a=0 α»0 


Figure 17.33 





17.3.2 Transitions to Chaos 

Often a strange attractor does not arise suddenly but as the result of a sequence of bifurcations, from 
which the typical ones are represented in Section 17.3.1. The most important ways to create strange 
attractors or strange invariant sets are described in the following. 

17.3.2.1 Cascade of Period Doublings 

Analogously to the logistic equation (17.84), a cascade of period doublings can also occur in time- 
continuous systems in the following way. Suppose system (17.67) has the stable periodic orbit y“ for 
E < &j. For € = g a period doubling occurs near X, at which the periodic orbit y% loses its stability 


for € > £. A periodic orbit 49 with approximately double period splits from it. At € = €2, there isa 
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new period doubling, where 49 loses its stability and a stable orbit o with an approximately double 


period arises. For important classes of systems (17.67) this procedure of period doubling continues, so 
a sequence of parameter values (2;) arises. 
Numerical calculations for certain differential equations (17.67), e.g., for hydrodynamical differential 


equations such as the Lorenz system, show the existence of the limit 


; Ej+a 7 Ej 
lim —— ——— — 6, whereó (17.89) 
jotoo Ejy2 — £j41 
is again the Feigenbaum constant. 
For e, = lim ej, the cycle with infinite period loses its stability, and a strange attractor appears. 
joo 


The geometric background for this strange attractor in (17.67) by a cascade of period doubling is shown 
in Fig. 17.34a. The Poincaré section shows approximately a baker mapping, which suggests that a 
Cantor set-like structure is created. 


F(a) 


O X. 


a) b) 


Figure 17.34 


17.3.2.2 Intermittency 

Consider a stable periodic orbit of (17.67), which loses its stability for € = 0 when exactly one of 
the multipliers, for € < 0 inside the unit circle takes the value +1. According to the center manifold 
theorem, the corresponding saddle-node bifurcation of the Poincaré mapping can be described by a 





one-dimensional mapping in the normal form x > F (z,a) — a o z 4 a? 4 ---. Herea is a parameter 
depending on £, i.e., a = a(e) with a(0) = 0. The graph of F(., a) for positive a is represented in 
Fig. 17.34b. 
As can be seen in Fig. 17.34b, the iterates of f^ (-,@) stay for a relatively long time in the tunnel zone 
{οι α 0. For equation (17.67), this means that the corresponding orbits stay relatively long in the 
neighborhood of the original periodic orbit. During this time, the behavior of (17.68) is approximately 
periodic (laminar phase). After getting through the tunnel zone the considered orbit escapes, which 
results in irregular motion (turbulent phase). After a certain time the orbit is recovered and a new 
laminar phase begins. A strange attractor arises in this situation if the periodic orbit vanishes and 
its stability goes over to the chaotic set. The saddle-node bifurcation is only one of the typical local 
bifurcations playing a role in the intermittence scenario. Two further ones are period doubling and the 
creation of a torus. 


17.3.2.3 Global Homoclinic Bifurcations 


1. Smale's Theorem 

Let the invariant manifolds of the Poincaré mapping of the differential equation (17.67) in R? near the 
periodic orbit y be as in Fig. 17.11b, p. 873. The transversal homoclinic points PJ (zo) correspond 
to a homoclinic orbit of (17.67) to y. The existence of such a homoclinic orbit in (17.67) leads to 
a sensitive dependence on initial conditions. In connection with the considered Poincaré mapping, 
horseshoe mappings, introduced by Smale, can be constructed. This leads to the following statements: 





a) Inevery neighborhood of a transversal homoclinic point of the Poincaré mapping (17.80) there exists 
a periodic point of this mapping (Smale’s theorem). Hence, in every neighborhood of a transversal 
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homoclinic point there exists an invariant set of P" (m € IN), A, which is of Cantor type. The restriction 
of P" to A is topologically conjugate to a Bernoulli shift, i.e., to a mixing system. 

b) The invariant set of the differential equation (17.67) close to the homoclinic orbit is like a product of 
a Cantor set with the unit circle. If this invariant set is attracting, then it represents a strange attractor 
of (17.67). 

2. Shilnikov's Theorem 

Consider the differential equation (17.67) in RË with scalar parameter e. Suppose that the system 
(17.67) has a saddle-node type hyperbolic steady state 0 at € = 0, which exists so long as |e| remains 
small. Suppose, that the Jacobian matrix D; f (0,0) has the eigenvalue A3 > 0 and a pair of conjugate 
complex eigenvalues Àj5 — a t iw with a < 0. Suppose, additionally, that (17.67) has a separatrix 
loop *o for € — 0, i.e., a homoclinic orbit which tends to 0 for t + —oo and t + +00 (Fig. 17.35a). 





Then, in a neighborhood of a separatrix loop (17.67) has the following phase portrait : 

a) Let As +a < 0. If the separatrix loop breaks at € Z 0 according to the variant denoted by A in 
(Fig. 17.352), then there is exactly one periodic orbit of (17.67) for € — 0. If the separatrix loop breaks 
at € Z 0 according to the variant denoted by B in (Fig. 17.35a), then there is no periodic orbit. 

b) Let A3 +a > 0. Then there exist countably many saddle-type periodic orbits at ¢ = 0 (respectively, 
for small e|) close to the separatrix loop ^ (respectively, close to the destructed loop yo). The Poincaré 
mapping with respect to a transversal to the yo plane generates a countable set of horseshoe mappings 
at ¢ = 0, from which there remain finitely many for small |e| 4 0. 


w'(p,) 





b) Wp) 





Figure 17.35 
3. Melnikov's Method 
Consider the planar differential equation 
i = f(x) + g(t, 2), (17.90) 
where ¢ is asmall parameter. For ¢ = 0, let (17.90) be a Hamiltonian system (see 17.1.4.3, 2., p. 875), 


ie, for f = (fi, fo) fi = M" and fo = e hold, where H: U c R? — R is supposed to be a C?- 


function. Suppose the α vector field g: R x U > R? is twice continuously differentiable, 
and T-periodic with respect to the first argument. Furthermore, let f and g be bounded on bounded 
sets. Suppose that for ¢ = 0 there exists a homoclinic orbit with respect to the saddle point 0, and 
the Poincaré section $7,, of (17.90) in the phase space { (x1, £2, t)} for t = to looks as in Fig. 17.35b. 
The Poincaré mapping P24): i, — 74, for small |e|, has a saddle point pe close to x = 0 with the 
invariant manifolds W*(pz) and W"(p,). If the homoclinic orbit of the unperturbed system is given by 
y(t — to), then the distance between the manifold W*(p.) and W"(p.), measured along the line passing 
through v(0) and perpendicular to f(y(0)), can be calculated by the formula 


—.. M(to) 
do) = TOT 


Here, M(-) is the Melnikov function which is defined by 


t O(e). (17.91a) 


M(to) = [| f(e(t — t) ^ at e(t — £9) dt. (17.910) 


—oo 
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(For a = (a1,a2) and b = (bi, 03), ^ means a ^ b = ayb2 — agby.) If the Melnikov function M has a 
simple root at to, i.e., M (to) — 0 and M'(tg) Z 0 hold, then the manifolds W*(p.) and W" (p.) intersect 
each other transversally for sufficiently small € 7 0. If M has no root, then W*(pz) O W” (p) — 0, i.e., 
there is no homoclinic point. 

Remark: Suppose the unperturbed system (17.90) has a heteroclinic orbit given by y(t — t9), running 
from a saddle point 0; in a saddle 05. Let pl and p2 be the saddle points of the Poincaré mapping P, 
for small |e]. If M, calculated as above, has a simple root at to, then W*(p1) and W” (p2) intersect each 
other transversally for small e > 0. 

W Consider the periodically perturbed pendulum equation ț + sin x = e sin wt, i.e., the system 


$ — y, y — —sinz 4 &sinct, in which € is a small parameter and w is a further parameter. The 
unperturbed system # = y, y = —sinz is a Hamiltonian system with H (x,y) = rl — cosx. It has 


(among others) a pair of heteroclinic orbits through (—7, 0) and (7,0) (in the cylindrical phase space 








1 
S! x R these are homoclinic orbits) given by (8) -- (42 arctan(sinh t), +2) (t € R). The 
Sno 


_ 2rsinwto 
" cosh(mw/2) 
at to = 0, the Poincaré mapping of the perturbed system has transversal homoclinic points for small 
e>0. 


17.3.2.4 Destruction ofa Torus 


1. From Torus to Chaos 


1. Hopf-Landau Model of Turbulence The problem of transition from regular (laminar) behav- 
ior to irregular (turbulent) behavior is especially interesting for systems with distributed parameters, 
which are described, e.g., by partial differential equations. From this viewpoint, chaos can be inter- 
preted as behavior irregular in time but ordered in space. 


direct calculation of the Melnikov function yields M (to) = 4 . Since M has a simple root 





On the other hand, turbulence is the behavior of the system, that is irregular in time and in space. The 
Hopf-Landau model explains the arising of turbulence by an infinite cascade of Hopf bifurcations: For 
€ = & a steady state bifurcates in a limit cycle, which becomes unstable for £5 > £ and leads to a 
torus T?. At the k-th bifurcation of this type a k-dimensional torus arises, generated by non-closed 
orbits which wind on it. The Hopf-Landau model does not lead in general to an attractor which is 
characterized by sensitive dependence on the initial conditions and mixing. 
2. Ruelle-Takens-Newhouse Scenario Suppose that in system (17.67) n > 4 and | — 1 hold. 
Suppose also that changing the parameter ¢, the bifurcation sequence “equilibrium point — periodic 
orbit — torus T? — torus T?" is achieved by three consecutive Hopf bifurcations. 
Let the quasiperiodic flow on T? be structurally unstable. Then, certain small perturbation of (17.67) 
can lead to the destruction of T? and to the creation of a structurally stable strange attractor. 
3. Theorem of Afraimovich and Shilnikov on the Loss of Smoothness and the 
Destruction of the Torus T? Let the sufficiently smooth system (17.67) be given with n > 3 
and / = 2. Suppose that for the parameter value £o, the system (17.67) has an attracting smooth torus 
T? (£9) spanned by a stable periodic orbit γ., a saddle-type periodic orbit ^;, and its unstable manifold 
W"(qu) (resonance torus). 
The invariant manifolds of the equilibrium points of the Poincaré mapping computed with respect to 
a surface transversal to the torus in the longitudinal direction, are represented in Fig. 17.36a. The 
multiplier p of the orbit ys, which is the nearest to the unit circle, is assumed to be real and simple. 
Furthermore, let e(-) : [0,1] — V be an arbitrary continuous curve in parameter space, for which 
e(0) = £o and for which system (17.67) has no invariant resonance torus for ¢ = &(1). Then the 
following statements are true: 
a) There exists a value s, € (0, 1) for which T?(<(s,)) loses its smoothness. Here, either the multiplier 
p(s) is complex or the unstable manifold W” (yu) loses its smoothness close to Ys. 
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b) There exists a further parameter value s,,. € (5+, 1) such that system (17.67) has no resonance torus 
for s € (s,,, 1]. The torus is destroyed in the following way: 

œ) The periodic orbit ys loses its stability for e = e(s,,). A local bifurcation arises as period doubling 
or the formation of a torus. 

B) The periodic orbits γι and y, coincide for ¢ = e(s,.) (saddle-node bifurcation) and so they vanish. 
^) The stable and unstable manifolds of +, intersect each other non-transversally for € = €(S4) (see 
the bifurcation diagram in Fig. 17.36c). The points of the beak-shaped curve 5; correspond to the 
fused 7, and γι (saddle-node bifurcation). The tip C1 of the beak-shaped curve is on a curve So, which 
corresponds to a splitting of the torus. 





rN 
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y 
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Figure 17.36 


The parameter points where the smoothness is lost, are on the curve 55, while the points on $3 charac- 
terize the dissolving of a T? torus. The parameter points for which the stable and unstable manifolds of 
γα intersect each other non-transversally, are on the curve $4. Let Pp be an arbitrary point in the beaked 
shaped tip of the beak such that for this parameter value a resonance torus 7? arises. The transition 
from P to P, corresponds to the case a) of the theorem. If the multiplier p becomes —1 on 52, then 
there is a period doubling. A cascade of further period doublings can lead to a strange attractor. Ifa 
pair of complex conjugate multipliers p;,9 arises on the unit circle passing through 5S5, then it can result 
in the splitting of a further torus, for which the Afraimovich-Shilnikov theorem can be used again. 
The transition from P) to P, represents the case 3) of the theorem: The torus loses its smoothness, and 
on passing through on 5}, there is a saddle-node bifurcation. The torus is destroyed, and a transition 
to chaos through intermittence can happen. The transition from Po to P3, finally, corresponds to the 
case t): After the loss of smoothness, a non-robust homoclinic curve forms on passing through on S4. 
The stable cycle ys remains and a hyperbolic set arises which is not attracting for the present. If ys 
vanishes, then a strange attractor arises from this set. 


2. Mappings on the Unit Circle and Rotation Number 


1. Equivalent and Lifted Mappings The properties of the invariant curves of the Poincaré map- 
ping play an important role in the loss of smoothness and destruction of a torus. Ifthe Poincaré mapping 
is represented in polar coordinates, then, under certain assumptions, one gets decoupled mappings of 
the angular variables as informative auxiliary mappings on the unit circle. These are invertible in the 
case of smooth invariant curves (Fig. 17.36a) and in the case of non-smooth curves (Fig. 17.36b) 
they are not invertible. A mapping F: R > R with F(Q@ + 1) = F(O) +1 forall O € R, which 
generates the dynamical system 

Ony = F (On), (17.92) 
is called equivariant. For every such mapping, an associated mapping of the unit circle f: S! — δὶ 
can be assigned where S! = R \ Z = {O mod 1,0 € R}. Here f(x) := F(@) if the relation x — [6] 
holds for the equivalence class [O]. F is called a lifted mapping of f. Obviously, this construction is not 
unique. In contrast to (17.92) 


T= f(x) (17.93) 
is a dynamical system on $1. 


E Fortwo parameters w and K let the mapping F(- τω, K) be defined by F (c;w, K) = o+w-— K sino 
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for all r € R. The corresponding dynamical system 
On41 = On +w — K sin on (17.94) 
can be transformed by the transformation on — 270, into the system 


K 
Ony =n +9- T sin 27O, (17.95) 
T 

K 
where Q = = With F(0; 0, K) 2 O4-0— a sin 270 an equivariant mapping arises, which generates 

π 2π 

the canonical form of the circle mapping. 
2. Rotation Number The orbit 4(0) — (F"(O)) of (17.92) is a q-periodic orbit of (17.93) in S? if 
and only if it isa E cycle of (17.92), i.e., if there exists an integer p such that On4g = On +p, (n € Z) 
q 


holds. The mapping f: S! — S! is called orientation preserving if there exists a corresponding lifted 
mapping £^, which is monotone increasing. If F from (17.92) is a monotone increasing homeomorphism, 
"n 





then there exists the limit | lim for every x € R, and this limit does not depend on x. Hence, 
> 


ο 





the expression p(F) :— lim can be defined. If f: 5! — S! is a homeomorphism and F and 


In|-oo 
F are two lifted mappings of f, then p(F) = p(F) + k holds, where k is an integer. Based on this last 
property, the rotation number p(f) of an orientation-preserving homeomorphism f: S! — $+ can be 
defined as p( f) = p(F) mod 1, where F is an arbitrary lifted mapping of f. 
If f: St — St in (17.93) is an orientation-preserving homeomorphism, then the rotation number has 
the following properties (see [17.4]): 


a) If (17.93) has a q-periodic orbit, then there exists an integer p such that p(f) — P holds. 
q 
b) If p(f) = 0, then (17.93) has an equilibrium point. 
ce) If p(f) = P? , where p Æ 0 is an integer and q is a natural number (p and q are co-primes), then 
q 


(17.93) has a q-periodic orbit. 
d) p(f) is irrational if and only if (17.93) has neither a periodic orbit nor an equilibrium point. 
Theorem of Denjoy: If f: S! — Sl isan orientation-preserving C?-diffeomorphism and the rotation 
number a — p( f) is irrational, then f is topologically conjugate to a pure rotation whose lifted mapping 
is F(x) =z +a. 
3. Differential Equations on the Torus T? 
Let 

01 = fı(91,02), 05 — f2(91, O2) (17.96) 
be a planar differential equation, where f; and f2 are differentiable and 1-periodic functions in both 
arguments. In this case (17.96) defines a flow, which can also be interpreted as a flow on the torus 
qT? — S! x S! with respect to O, and Os. If f1(61,O05) » 0 for all (1, 05), then (17.96) has no 
equilibrium points and it is equivalent to the scalar first-order differential equation 

dO; —f5(61,02) 


dO, fi(@1, 2)’ (17.97) 
fa 


With the relations O; — t, O9 — x and f = F 
Jı 


(17.97) can be written as a non-autonomous differential 


equation 


i= f(t, x) (17.98) 
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whose right-hand side has a period of 1 with respect to t and 2. 
Let (+, £o) be the solution of (17.98) with initial state zo at time t — 0. So, a mapping y'(-) = y(1,-) 
can be defined for (17.98), which can be considered as the lifted mapping of a mapping f: $! — $1. 


B Let w;,w2 E€ R be constants and Ôi = w, Ô — ws a differential equation on the torus, which 

] ; ; ; P ω ω 

is equivalent to the scalar differential equation 3 — 22 for w, # 0. Thus, g(t, zo) = mere zo and 
Wy Wy 


w 
g(a) = — a. 
ωι 


4. Canonical Form of a Circle Mapping 


1. Canonical Form The mapping F from (17.95) is an orientation-preserving diffeomorphism for 





OF 
0 € K < 1, because 08 7 1— Kcos2-0 » 0 holds. For K — 1, F is no-longer a diffeomorphism, 


but it is still a homeomorphism, while for K > 1, the mapping is not invertible, and hence no-longer a 
homeomorphism. In the parameter domain 0 € K < 1, the rotation number p(9, K) := p(F(*, Q, K)) 
is defined for F(-, Q, K). Let K € (0,1) be fixed. Then p(-, K) has the following properties on [0,1]: 
a) The function p(:, K) is not decreasing, it is continuous, but it is not differentiable. 


b) For every rational number Le (0, 1) there exists an interval p/q, whose interior is not empty and 
q 
for which ρ(Ω, K) = © holds for all 2 € Ip/q- 
q 


C) For every irrational number a € (0, 1) there exists exactly one 9 with p(Q, K) = a. 

2. Devil’s Staircase and Arnold Tongues For every K € (0,1), p(-, A’) is a Cantor function. The 
graph of p(-, A’), which is represented in Fig. 17.37b, is called the devil’s staircase. The bifurcation 
diagram of (17.95) is represented in Fig. 17.37a. At every rational number on the Q-axis, a beak- 
shaped region (Arnold tongue) with a non-empty interior starts, where the rotation number is constant 
and equal to the rational number. 


3 2 3 1 


l 08Γ ~ 
0.6 κ΄ 
0.4} jt 
02r τ 
Q 
b 0204 06 08 τα 


Figure 17.37 








le 
Nie 
Ql 


a) 


The reason for the formation of the tongue is a synchronization of the frequencies (frequency locking). 
a) For 0 < K < 1, these regions are not overlapping. At every irrational number of the Q-axis, a 
continuous curve starts which always reaches the line K = 1. In the first Arnold tongue with p = 0, 
the dynamical system (17.95) has equilibrium points. If K is fixed and €) increases, then two of these 
equilibrium points fuse on the boundary of the first Arnold tongue and vanish at the same time. Asa 
result of such a saddle-node bifurcation, a dense orbit arises on S*. Similar phenomena can be observed 
when leaving other Arnold tongues. 

b) For K > 1 the theory of the rotation numbers is no-longer applicable. The dynamics become 
more complicated, and the transition to chaos takes place. Here, similarly to the case of Feigenbaum 
constants, further constants arise, which are equal for certain classes of mappings to which also the 
standard circle mapping belongs. One of them is described below. 
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E 
3. Golden Mean, Fibonacci Numbers The irrational number is called the golden mean 
: ; : : ; -1 1 
and it has a simple continued fraction representation = — ,— = [1;1,1,...] (see 1.1.14, 
2 l-c-—— 
+o 
3., p. 4). By successive evaluation of the continued fraction a sequence {r,,} of rational numbers is got, 
. -1 : Fn 
which converges to . The numbers r, can be represented in the form rn = —— , where £, are 
n+l 


Fibonacci numbers (see 5.4.1.5, p. 375), which are determined by the iteration Fa+1 = Fa + Fn- (n = 
1,2,---) with initial values Fp = 0 and F1 — 1. Now, let €,; be the parameter value of (17.95), for which 


v5- 


1 
P(Qeo, 1) = 2 and let Q, be the closest value to Q,5, for which p(Q5,1) — r4, holds. Numerical 
Ωμ — Osa 


nl Qn 





= —2.8336.... 


calculation gives the limit lim 
N- 


18 Optimization 


18.1 Linear Programming 


18.1.1 Formulation ofthe Problem and Geometrical 
Representation 


18.1.1.1 The Form of a Linear Programming Problem 
1. TheSubject 


of linear programming is the minimization or maximization of a linear objective function (OF) of finitely 
many variables subject to a finite number of constraints (CT), which are given as linear equations or 
inequalities. 

Many practical problems can be directly formulated as a linear programming problem, or they can be 
modeled approximately by a linear programming problem. 

2. General Form 





























A linear programming problem has the following general form: 
OF: f(x) 9 «xo 4: + CrEr + Crp Ur ga +++ + Cn Ln = Max! (18. 1a) 
CT: Gyiti re AU, T αι τει d ainn € br, 
GsiTi Tees AgrXp T Asr410r41 ΣΎ απ πι < bs, 
ας 141 +++ + Astirtr + Aspirpitrga +++ + stimin = bet, (18.1b) 
Gm, 11 qe Am, rir T Am r+r+1 d Am nin = bm, 
By 2 Ups cat 2 U; Tr41;--- Zn free. 


In a more compact vector notation this problem becomes: 





T T 
OF: f(x)=cl x'4+c? x? = max! (18.2a) CT: Anx! + Appx? < bt, 
2 
Αχικὶ + Axx’ = b’, (18.2b) 
. I x! 7 0, x? free. 
Here, the following notations are used: 
Cy Cr+1 Tı Urq. 
C2 Cr+2 πο Ur+2 
c= C= ; x= | 1k =] oe s (18.2c) 
ος i dy Üa 
11 012 777 Qr Girl Q2 77^ O1 
21 ο "ΙΙ O2, O2, 7-1 Q2,n42 °° * G2,n 
Aq = é ᾿ > Aa» = i i ᾿ : > (18.2d) 
Qs 052 °t Asr Asr+1 Gsrt2°°* Asn 
ο ο 77^ Qs+1,r ο Astirt2 *** Astin 
1521 As+2,2 *** As+2,r 1521 As+2,r4+2 °° * As+2,n 
A» = 3 ᾿ ᾿ i ; Δ» = = , i ᾿ * (18.2e) 
Am,1 Gm2 '' Am, Am,r+1 Gm,r-2 ''' Amn 


3. Constraints 

with the inequality sign “ >” will have the above form if they are multiplied by (—1). 
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4. Minimum Problem 
A minimum problem f(x) = min! becomes an equivalent maximum problem by multiplying the ob- 
jective function by (—1) 

— f(x) = max! (18.3) 


5. Integer Programming 

Sometimes certain variables are restricted to be only integers. This discrete problem is not discussed 
here. 

6. Formulation with only Non-Negative Variables and Slack Variables 

In applying certain solution methods, only non-negative variables are considered, and constraints (18.1b), 


18.2b) given in equality form. 
( )8 q y OF: f(x) = c1£1 +-+: + Cnn = max! (18.4a) 
Every free variable x, must be decomposed into the 








difference of two non-negative variables 2, = vl — CT: ann cbe ar. — bs 

xz. The inequalities become equalities by adding : : : 
non-negative variables; they are called slack vari- Gmit1 amnia = 55, (18.4b) 
ables. That is, the problem is considered in the 

form as given in (18.4a,b), where n is the increased 21 2 0,...,2, 7 0. 

number of variables. In vector form: = > 

OF: f(x) 2 cTx = max! (18.5a) CT: Ax=b, x>0O. (18.5b) 


The relation m < n can be supposed, otherwise the system of equations contains linearly dependent 
or contradictory equations. 

7. Feasible Set 

The set of all vectors x satisfying constraints (18.2b) is called the feasible set of the original problem. 
If the free variables are rewritten as above, and every inequality of the form “<” into an equation as in 
(18.4a) and (18.4b), then the set of all non-negative vectors x > 0 satisfying the constraints is called 
the feasible set M: 


M —(xce R":x2 0, Ax—- b]. (18.6a) 
A point x* € M with the property 
f(x*) > f(x) for every x € M (18.6b) 


is called the maximum point or the solution point of the linear programming problem. Obviously, the 
components of x not belonging to slack variables form the solution of the original problem. 
18.1.1.2 Examples and Graphical Solutions 


1. Example of the Production of Two Products 
Suppose primary materials F4, R2, and Rg are needed to produce two products E; and E3. Scheme 








18.1 shows how many units of primary materials are needed to produce each unit of the products Ei 
and E», and there are given also the available amount of the primary materials. 

Selling one unit of the products Ej or E» results 

in 20 or 60 units of profit, respectively (PR). Scheme 18.1 | F3 / E; | H3 /E; | Ra / Ej 
Determine a production program which yields E 12 8 0 
maximum profit, if at least 10 units must be pro- 1 

duced from product E. E» 6 12 10 
Denoting by xı and xz the number of units pro- Amount 630 620 350 











duced from F and Es, the problem is: 


OF: f(x) = 202, + 6022 = max! 





CT: 12%, + 6x2 < 630, 
82, + 12%. < 620, 

1011 € 350, 

ry > 10. 
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Introducing the slack variables 3, v4, £5, xe, one gets: 











OF:  f(x)- 20-4 605 -- 0- z3 4- 0- x4 d- 0- 5 -- 0- 1g = max! 
CT: 12z1-F 6zx2-- T3 = 630, 
82, -- 1222 Ἔα. — 620, 

101. + Gd = 350, 

—2 + 25 = —10. 

2. Properties of a Linear Programming Problem x 
2 


On the basis of this example, some properties of the linear pro- 
gramming problem can be demonstrated by graphical representa- 
tion. Here the slack variables are not considered; only the original 35 
two variables are used. 

a) A line a1£1 +a2x2 = b divides the x1, £2 plane into two half-planes. 25 
The points (71, x2) satisfying the inequality ay71+a2a2 < bare in one 
of these half-planes. The graphical representation of this set of points 
in a Cartesian coordinate system can be made by a line, and the half- 
plane containing the solutions of the inequalities is denoted by an ar- 
row. The set of feasible solutions M, i.e., the set of points satisfying 
all inequalities is the intersection of these half-planes (Fig. 18.1). Figure 18.1 

In this example the points of M form a polygonal domain. It may happen that M is unbounded or 
empty. If more then two boundary lines go through a vertex of the polygon, this vertex is called a 
degenerate vertex (Fig. 18.2). 


0| ων τς t> 0 χι 


Figure 18.2 


0 40 X 





b) Every point in the x1, £2 plane satisfying the equality f(x) = 2021 + 6022 = cp is on one line, on 
the level line associated to the value cg. With different choices of co, a family of parallel lines is defined, 
on each of which the value of the objective function is constant. Geometrically, those points are the 
solutions of the programming problem, which belong to the feasible set M and also to the level line 
20a + 6022 = co with maximal value of co. In this example, the solution point is (21,72) = (25,35) 
on the line 202; + 6022 = 2600. The level lines are represented in Fig. 18.3, where the arrows point 
in the direction of increasing values of the objective function. 

Obviously, if the feasible set M is bounded, then there is at least one vertex such that the objective 
function takes its maximum. If the feasible set M is unbounded, it is possible that the objective function 
is unbounded, as well. 


18.1.2 Basic Notions of Linear Programming, Normal Form 
Now, the problem (18.5a,b) is considered with the feasible set M . 
18.1.2.1 Extreme Points and Basis 


1. Definition of the Extreme Point 
A point x € M is called an extreme point or vertex of M , if for all xj, x5 € M with x; z xy: 
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x #AX,+(1-A)x, 0«A«1, (18.7) 
i.e., x is not on any line segment connecting two different points of M. 
2. Theorem about Extreme Points 
The point x € M is an extreme point of M if the columns of matrix A associated to the positive 
components of x are linearly independent. 
If the rank of A is m, then the maximal number of independent columns in A is m. So, an extreme 
point can have at most m positive components. The other components, at least n — m, are equal to 
zero. In the usual case, there are exactly m positive components. If the number of positive components 
is less then m, it is called a degenerate extreme point. 





Ῥ 47 (42,0) 


Figure 18.3 Figure 18.4 
3. Basis 


To every extreme point m linearly independent column vectors of the matrix A can be assigned, the 
columns belonging to the positive components. This system of linearly independent column vectors is 
called the basis of the extreme point. Usually, exactly one basis belongs to every extreme point. However 
: . TL nga 
several bases can be assigned to a degenerate extreme point. There are at most (7) possibilities to 
choose m linearly independent vectors from n columns of A. Consequently, the number of different 


bases, and therefore the number of different extreme points is ie Jj If M is not empty, then M has at 


least one extreme point. CT: zı + πο z342 1, 
£ <2, 
è E κ Aq, —omaxl T2 S 4, 

W OF: /(x) — 2a) + 3x2 + 473 = max! e +: Dae < 2, (18.8) 
2zi — 3x9 4 223 «X 2. 





The feasible set M determined by the constraints is represented in Fig. 18.4. Introduction of slack 
variables 4, x5, xg, 17 leads to: 


CT: πι Ἑ ta + 23 - T4 zm 
22 + T5 EN 

-ᾱι + 2x3 + T6 = 2, 

20, — 3x9 + 223 +247 =2. 


The extreme point P, = (0, 1,0) of the polyhedron corresponds to the point x = (21, 00, 3, x4, 15, t, 
£7) = (0, 1,0,0, 1, 2, 5) of the extended system. The columns 2, 5, 6 and 7 of A form the corresponding 
basis. The degenerated extreme point P; corresponds to (1, 0,0,0,2,3,0). A basis of this extreme point 
contains the columns 1, 5,6 and one of the columns 2, 4 or 7. 
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Remark: Here, the first inequality was a “>” inequality and a4 was not added but subtracted. Fre- 
quently these types of additional variables both with a negative sign and a corresponding b; > 0 are 
called surplus variables, rather than slack variables. As will be seen in 18.1.3.3, p. 916, the occurrence 
of surplus variables requires additional effort in the solution procedure. 

4. Extreme Point with a Maximal Value of the Objective Function 

Theorem: If M is not empty, and the objective function f(x) = c?x is bounded from above on M, 
then there is at least one extreme point of M where it has its maximum. 

A linear programming problem can be solved by determining at least one of the extreme points with 
maximum value of the objective function. Usually, the number of extreme points of M is very large 
in practical problems, so a method is needed by which the solution can be found in a reasonable time. 
Such a method is the simpler method which is also called the simplex algorithm or simplex procedure. 


18.1.2.2 Normal Form of the Linear Programming Problem 


1. Normal Form and Basic Solution 
The linear programming problem (18.4a,b) can always be transformed to the following form with a 
suitable renumbering of the variables: 





OF: f(x) = ety + +++ + Cp—-m¥n—m + Co = max! (18.9a) 
CT: G11Ti Tct OnacmÜaom d Xa-mal = bh, 
Am 121 +++ + Gmjn-mTn-m + In = bm, (18.9b) 
τν ὃς πι n mb yz 0. 


The last m columns of the coefficient matrix are obviously independent, and they form a basis. The 
basic solution (x1,32,... Vn m; Vn-m4ti- n) — (0,...,0,0,,..., 04,) can be determined directly 
from the system of equations, but if b 7 0 does not hold, it is not a feasible solution. 

If b 7 0, then (18.9a,b) is called a normal form or canonical form of the linear programming problem. 
In this case, the basic solution is a feasible solution, as well, i.e., x > 0, and it is an extreme point of M. 
The variables 21, ..., x4 44, are called non-basic variables and x, 4444, .. ., 4 are called basic variables. 
The objective function has the value co at this extreme point, since the non-basic variables are equal 
to zero. 

2. Determination of the Normal Form 

If an extreme point of M is known, then a normal form of the linear programming problem (18.5a,b) 
can be obtained in the following way. A basis is chosen from the columns of A corresponding to the 
extreme point. Usually, these columns are determined by the positive components of the extreme point. 
Suppose the basic variables are collected into the vector xg and the non-basic variables are in xy. The 
columns associated to the basis form the basis matrix Ag, the other columns form the matrix Ay. 
Then, 


Ax — AyXy o ApXg — b. (18.10) 
The matrix Ag is non-singular and it has an inverse A, the so-called basis inverse. Multiplying 


(18.10) by Ag! and changing the objective function according to the non-basic variables results in the 
canonical form of the linear programming problem: 


OF: f(x) =cyxn +00, (18.1 1a) 
CT: AglAyxy-Xg — Agb with xy>0, xp>0. (18.11b) 





Remark: If the original system (18.1b) has only constraints of type “<” and simultaneously b 2 0, 
then the extended system (18.4b) contains no surplus variables (sce 18.1.2.1, p. 911). In this case a 
normal form is immediately known. Selecting all slack variables as basic variables xg the result is 
Ap — Land xg — b and xy — 0 is a feasible extreme point. 
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W In the above example x — (0, 1,0,0, 1,2, 5) is an extreme point. Consequently: 
1000 11-1 
1100 RAPIT 00 0 
Ag-| 0010]|' Ag- , Aw-|-12 ο]! (18122) 
-3001 Ce 22 0 
πώ νο. ας 3001 ΝΡ 
πο 25 16 T7 X1 X3 X4 
11-41 
sii i f 
Aš An= |-1 2 ο), 9 (18.129) 
5.5 —3 E 
5 
Ti X3 T4 
P+ fo + X4&— σα =], 
-ᾱι — απ T4 Ts Ξ1, 
-ᾱῃ + 2x3 + 26 =2, 3 (18.13) 
5x1 + 5x3 — 3x4 + £7 = 5. 





From f(x) = 2x1 + 3x2 + 4x3 the transformed objective function 
f(x) = -t1 + £3 + 3x4 +3 (18.14) 
is obtained, if the triple of the first constraint is subtracted. 


18.1.3 Simplex Method 
18.1.3.1 Simplex Tableau 


The simplex method is used to produce a sequence of extreme points of the feasible set with increasing 
values of the objective function. The transition to the new extreme point is performed starting from the 
normal form corresponding to the given extreme point, and arriving at the normal form corresponding 
to the new extreme point. In order to get a clear arrangement, and easier numerical performance, the 
normal form (18.9a,b) is represented in the simplex tableau (Scheme 18.2a, 18.2b): 




















Scheme 18.2a Scheme 18.2b 
Ti ttt Xwoam or briefly XN 
Tn—m4il|011 ^^^ Oln—-m by Xp|An| b 
: f ; c o 
Tn Gm *'* Gm,n—m bm 
C1 +t Cn-m |— C0 








The k-th row of the tableau corresponds to the constraint 


Ln—m+k + Gk1L1 + +++ + Qkn-m8n-m = bk. (18.15a) 
The objective function is 
Ci31 d 7] T Cn mUn-m — f(X) — co. (18.15b) 


From this simplex tableau, the extreme point (Xy, X5) — (0, b) can befound. The value ofthe objective 
function at this point is f(x) = co. To put down —c into the right below vertex of the tableau is 
advantageous for carrying out the simplex method. In every tableau always exactly one of the following 
three cases can be found: 

a)c; €0,j — L...,n — m: The tableau is optimal. The point (xy, X5) — (0, b) is a maximal point. 
If all the c; are positive, then this vertex is the only maximal point. 

b) There exists at least one j such that cj; 0 and ajj < 0, 4 — 1,...,m: The linear programming 
problem has no solution, since the objective function is not bounded on the feasible set; for increasing 


18.1 Linear Programming 915 





values of x; it increases without a bound. 

c) For every j with cj > 0 there exists at least one i with aj; > 0: It is possible to move from the 
extreme point x to a neighboring extreme point x with f(x) > f(x). In the case of a non-degenerate 
extreme point x, the “>” sign always holds. 


18.1.3.2 Transition to the New Simplex Tableau 


1. Non-Degenerate Case 
If a tableau is not in final form (case c)), then a new tableau (Scheme 18.3) is determined. A basic 
variable a, and a non-basic variable x, are interchanged by the following calculations: 


1 
a) dng = —. (18.16a) 
apq 
b) õpi = apj ρα, {χα By = Dp» Ging. (18.16b) 
C) üig — —dig * pg, iz p, €q = —Cq* pg. (18.166) 
ἆ) ἄμτ- αι Γαμ τν {προ 
b; =D + bp Gig, ὑπέρ, čj = cj + apj -Č § #4, —Co = —Co + bp - Gq. (18.164) 


The element Gpq is called the pivot element , the p-th row is the pivot row, and the q-th column is the 
pivot column . For the choice of a pivot element the following two requirements are to be considered: 
a) Cp > co should hold; 


b) the new tableau must also correspond to a feasible solution, i.e., b > 0 must hold. 
Then, (žy, g) — (0, b) is a new extreme point at which the value of the objective function f(X) = & 


is not smaller than it was previously. These conditions are satisfied if the pivot element is chosen in the 
following way: 

a) To increase the value of the objective function, a column with c, > 0 can be chosen for a pivot 
column; 

b) to get a feasible solution, the pivot row must be chosen as 


b — min {5}. (18.17) 


Ώρα 1<i<m | Gig 
Qig>0 
If the extreme points of the feasible set are not degenerate, then the simplex method terminates in a 
finite number of steps (case a) or case b)). 
Wb The normal form in 18.1.2, p. 911ff can be written in a simplex tableau (Scheme 18.4a). This 


tableau is not optimal, since the objective function has a positive coefficient in the third column. The 
third column is assigned as the pivot column (the second column could also be taken under considera- 


























tion). The quotients b;/ajq are calculated with every positive element of the pivot column (there is only 
one of them). The quotients are denoted behind the last column. The smallest quotient determines the 
pivot row. 
Scheme 18.3 Scheme 18.4a Scheme 18.4b 
XN X1 X3 T4 πι T3 T5 
Šg Anl b T2 1 1-1] 1 z2 0 0 1| 2 
ὅ |- ἄν 25 —1-1 1) 1 1:1 t4 |—1—1 1f 1 
2169, —1 2 0 2 z$|-1 2 0 2 2 
v7 5 5-3] 5 t7| 2 2 3] 8 
—1 1 3 -3 2 4-3 |-6 














If it is not unique, then the extreme point corresponding to the new tableau is degenerate. After per- 
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forming the steps of (18.16a)-(18.16d) the tableau in Scheme 18.4b is obtained. This tableau deter- 
mines the extreme point (0, 2, 0, 1,0, 2, 8), which corresponds to the point P; in Fig. 18.4. Since this 
new tableau is still not optimal, zg and x3 are interchanged (Scheme 18.4c). The extreme point of the 
third tableau corresponds to the point P5 in Fig. 18.4. After an additional change an optimal tableau 
is obtained (Scheme 18.4d) with the maximal point x* — (2, 2, 2,5,0,0, 0), which corresponds to the 
































point P5, and the objective function has a maximal value here: f(x*) — 18. 
Scheme 18.4c Scheme 18.4d Scheme 18.5 
ay Te T5 X7 Te T5 X1 Tn 
πο. 0 0 1 2 Z2 UNE E 2 Vp dp c 0a bi 
3 1 1 5 . ] f 
TA4|—- = 1 2 T. = = 5 
n A 2 ae 2 2 3 Um Am, `° Amn bm 
1 1 1 1 1 
en es 1 2 2 S OF, c CG 0 
23 5 5 0 £3 6 32 2 x 1 — - 
T 3. —]1-'3 6 6:3 OF* aji = Qjn| >. b; = —g(0, b 
| oe |e ee a ue ee 9,9) 
4—2 -3 |-10 : ; 
—=-=-7 |-18 
3 3 








2. Degenerate Case 

If the next pivot element cannot be chosen uniquely in a simplex tableau, then the new tableau rep- 
resents a degenerate extreme point. A degenerate extreme point can be interpreted geometrically as 
the coincident vertices of the convex polyhedron of the feasible solutions. There are several bases for 
such a vertex. In this case, it can therefore happen that some steps are performed without reaching a 
new extreme point. It is also possible that one gets a tableau that has occurred already before, so an 
infinite cycle may occur. 

In the case of a degenerate extreme point, one possibility is to perturb the constants b; by adding εἶ 
(with a suitable et > 0) such that the resulting extreme points are no longer degenerate. The solution 
can be got from the solution of the perturbed problem, if ¢ = 0 is substituted. 

If the pivot column is chosen “randomly” in the non-uniquely determined case, then the occurrence of 
an infinite cycle is unlikely in practical cases. 


18.1.3.3 Determination of an Initial Simplex Tableau 
1. Secondary Program, Artificial Variables 


If there are equalities among the original constraints (18.1b) or inequalities with negative b;, then it is 
not easy to find a feasible solution to start the simplex method. In this case, one starts with a secondary 
program to produce a feasible solution, which can be a starting point for a simplex procedure for the 
original problem. The system Ax = b is modified by multiplying some of the equations with (—1) in 
order to satisfy the condition b > 0. Now, an artificial variable yy 2 0 (k — 1,2,..., m) is added to 
every left-hand side of Ax — b with b — 0, and the secondary program is considered: 








OF*: g(x,y) = —y1 — +++ — Ym = max! (18.18a) 
Qiii c: G1aXs Tg = hh, 
OTE apan einat πο (18.18b) 


Ty, Ug Z0; νετ νι πι 3 0. 
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For this problem, the variables yj, ... , Ym are basic variables, and one can start the first simplex tableau 
(Scheme 18.5). The last row of the tableau contains the sums of the coefficients of the non-basic 
variables , and these sums are the coefficients of the new secondary objective function OF*. Obviously, 
g(x,y) < 0 always. If g(x*,y*) = 0 for a maximal point (x*, y*) of the secondary problem, then 
obviously y* = 0, and consequently x* is a solution of Ax — b. If g(x*, y*) « 0, then Ax = b does not 
have any solution. z 

2. Solution of the Secondary Program 

Our goal is to eliminate the artificial variables from the basis. A scheme is prepared not only for the 
secondary program separately. The original tableau is completed by columns of the artificial variables 
and the row of the secondary objective function. The secondary objective function now contains the 
sums of the corresponding coefficients from the rows corresponding to the equalities, as shown below. 
If an artificial variable becomes a non-basic variable, its column can be omitted, since it will be never 
chosen again as a basis variable. If a maximal point (x*, y*) is determined, then two cases are distin- 
guished: ΙΙ 

1. g(x*, y*) < 0: The system Ax = b has no solution, the linear programming problem does not have 
any feasible solution. 

2. g(x*, y*) — 0: If there are no artificial variables among the basic variables, this tableau is an ini- 
tial tableau for the original problem. Otherwise all artificial variables among the basic variables are 
removed by additional steps of the simplex method. 

By introducing the artificial variables, the size of the problem can be increased considerably. It is not 
necessary to introduce artificial variables for every equation. If the system of constraints before in- 
troducing the slack and surplus variables (see Remark in 18.1.2, 3., p. 913) has the form Aix 7 bj, 
A3x = by, A3x € bs with by, by, bs > 0, then artificial variables must be introduced only for the first 
two systems. For the third system the slack variables can be chosen as basic variables. 

W In the example of 18.1.2, p. 912, only the first equation requires an artificial variable: 





OF: g(xy)- =y = max! 
CT*: Tı + T2 + T3 — T4 +Y = 1, 
T2 + T5 E 
-ῇ + 223 + 26 = 72, 
921 -- 380 -Ἴ- 214 + a7 = 2. 
The tableau (Scheme 18.6b) is optimal with g(x*, y*) = 0. After omitting the second column the 
first tableau of the original problem is obtained. B 




















Scheme 18.6a Scheme 18.6b 

πα. 2X3 T4 πι Yı v3 T4 | 1 
ση 1 11-1 i 19 1 1 1-1) 1 
25 0100 2 2:1 25 —1-1-1 1, 1 
vw |-1 02 0) 2 26 -1 0 2 0} 2 
Y 2-3 2 0] 2 2τ 5.3 5-3 5 
OF 2.34 01/0 OF |-1-3 1 3 [-3 
OF*} 1 1 1-1/1 OF* 0-1 0 0) 0 














18.1.3.4 Revised Simplex Method 
1. Revised Simplex Tableau 


Suppose the linear programming problem is given in normal form: 


OF: f(&) 9 ex +--+ + Cn-mEn-m + co = max! (18.19a) 
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CT: 04101 στο Oim-min-m + Tn-m+1 = ĥi, 











Om,T1 +-+ + Amn—mEn—m + tn = Bm, (18.19b) 
2340,28 7 0. 
Obviously, the coefficient vectors a, m4; (¢=1,...,n) are the i-th unit vectors. 
In order to change into another normal form and therefore to reach another extreme point, it is suffi- 
cient to multiply the system of equations (18.19b) by the corresponding basis inverse. (Recall the fact 
that if Ag denotes a new basis, then the coordinates of a vector x can be expressed in this new basis 
as Ag!'x. If the inverse of the new basis is known, then any column as well as the objective function 
from the very first tableau can be got by simple multiplication.) The simplex method can be modified 
so that only the basis inverse is determined in every step instead of a new tableau. From every tableau 
only those elements are calculated which are Scheme 18.7 
required to find the new pivot element. If 
the number of variables is considerably larger 
than the number of constraints (n > 3m), z? αι πα +? Gin | by | rm 
then the revised simplex method requires 
considerably less computing cost and there- 
fore has better accuracy. T 
The general form of a revised simplex tableau 
is shown in Scheme 18.7. 
The quantities of the scheme have the following meaning: 


1 στ Ln—m Un—m+1 CC Tn Tq 





EE 


Üm,n—m4l ''' Qm; bm Tm 





Cy ++" Ca—m Cu—m4l ^77 Cn το | Cq 














xy,..., xv}: Actual basic variables (in the first step the same as z5 45441: : t4). 
C1,...,Cs : Coefficients of the objective function (the coefficients associated to the basic variables 
are zeros). 


b1,...,6m : Right-hand side of the actual normal form. 
co : Value of the objective function at the extreme point (zP,... zB) — (by, . . bs). 


Gin—m4l "^C Aln - x 
At= ἃ . . Actual basis inverse, where the columns of A* are the columns of 
` En-m+1;-- - , Zn corresponding to the actual normal form; 


Am, n-m+1 ``’ Amn 


r= (r1,..-, 1m)": Actual pivot column. 


2. Revised Simplex Step 


a) The tableau is not optimal when at least one of the coefficients c; (j — 1,2,...,n) is positiv. A 
pivot column q is chosen for a c, 7 0. 

b) One calculates the pivot column r by multiplying the q-th column of the original coefficient matrix 
(18.19b) by A* and introduces the new vector as the last vector of the tableau. 

The pivot row k is determined in the same way as in the simplex algorithm (18.17). 

c) The new tableau is calculated by the pivoting step (18.16a-d), where aj, is formally replaced by r; 
and the indices are restricted for n — m -- 1X j € n. The column F is omitted. r, becomes a basic 
variable. For j = 1,...,n — m, the results are 6j — cj 4- aT6, where € — (C; a1, . .., 65), and aj is 
the j-th column of the coefficient matrix of (18.19b). 


W Consider the normal form of the example in 18.1.2, p. 912. One wants to bring 24 into the basis. 
The corresponding pivot column r = ay is placed into the last column of the tableau (Scheme 18.84) 
(initially A* is the unit matrix). 

For j = 1,3, 4 one gets čj = cj — 3@2;: (c1, C3, C4) = (2, 4, 0). 

The determined extreme point x — (0, 2,0, 1, 0, 2, 8) corresponds to the point P; in Fig. 18.4, p. 912. 
The next pivot column can be chosen for j — 3 — q. 
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Scheme 18.8a Scheme 18.8b 
11 23 X4 [X9 X5 X6 X7 1. £1 £3 T4 19 T5 v6 v7 x3 
29 1000|1|-1 T2 1 10 0/2) 0 
αρ 0100/1|1 1:1 X4 0 1001-1 
Te 0010/2] 0 Tg 0010/2|2 
στ 0 0 0 1|5 l-83 X7 0 301|8| 32 
—113/[|000 0/|-3| 3 2 4—3| 0—3 0 0|-6| 4 
The vector r is determined by 
1100 1 0 
ΡΝ ΜΕΡΗ 
0301 5 2 


and it is placed into the very last column of the second tableau (Scheme 18.8b). One proceedes 
as above analogously to the method shown in 18.1.3.2, p. 915. If one wants to return to the original 
method, then the matrix of the original columns of the non-basic variables must be multiplied by A* 
and only these columns will be kept. 


18.1.3.5 Duality in Linear Programming 


1. Correspondence 


To any linear programming problem (primal problem) an other unique linear programming problem 
can be assigned (dual problem): 


Primal problem Dual problem 
OF: f(x)-cTx,-- cIx, — max! (18.20a) ΟΕ”: g(u)- bTu,--bTu, — min! (18.214) 
CT: ΑγιΧι T A13X2 Eb; CT*: Alu, T AJ,» 20, 


AoiX, + Ao2X9 = bo, Ev 1 
x; 20, X» free. (18.20b) 








T 
Γ A2202 = Co, 
u; > 0, Uy free. (18.21b) 
The coefficients of the objective function of one of the problems form the right-hand side vector of the 
constraints of the other problem. Every free variable corresponds to an equation, and every variable 
with restricted sign corresponds to an inequality of the other problem. 
2. Duality Theorems 
a) If both problems have feasible solutions, i.e., M 7^ 0, M* 4 0 (where M and M* denote the feasible 
sets of the primal and dual problems respectively), then 
f(x)zg(u) forall x e M, ue M*, (18.22a) 
and both problems have optimal solutions. 
b) The points x € M and u € M* are optimal solutions for the corresponding problem, if and only if 
f(x) = gu). (18.22b) 


c) If f(x) has no upper bound on M or g(u) has no lower bound on M*, then M* — Ø or M = Í, i.e., 
the dual problem has no feasible solution. 


d) The points x € M and u € M* are optimal points of the corresponding problems if and only if: 


uy (Aix) + A12X2 — bı) =0 and xi (AT + AJ us —c,)=0. (18.22c) 
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Using these last equations, a solution x of the primal problem can be found from a non-degenerate 
optimal solution u of the dual problem by solving the following linear system of equations: 


AÀ34X, -- À33X5 — b; — 0, (18.23a) 
(AiiXi + Aj 2X9 = bi)i Ξ0 for Ui > 0, (18.23b) 
z; 20. for (Aj,u, - A210; — οι), 70. (18.23c) 


The dual problem also can be solved by the simplex method. 


3. Application of the Dual Problem 

Working with the dual problem may have some advantages in the following cases: 

a) If it is simple to find a normal form for the dual problem, one switches from the primal problem to 
the dual. 

b) If the primal problem has a large number of constraints compared to the number of variables, then 
the revised simplex method can be used for the dual problem. 


W Consider the original problem of the example of 18.1.2, p. 912. 





Primal problem Dual problem 
OF: f(x) = 221 + 3x9 + 473 = max! ΟΕ”: g(u)-— —u; 4 2ua 4- 2u3 4- 2u4 — min! 
CT: -ο - Όρο z3X-1l, CT*: -ωι — ug + 2u, > 2, 
21 < 2, —u, + ue — 3u4 > 3, 
—1i +243 < 2, —u, + 2ug + 2u4 2 4, 
211 -- 910 + 243 32, ty, tg, Ug, Ua > 0. 
11,29,23 2 0. 


If the dual problem is solved by the simplex method after introducing the slack variables, then the 
optimal solution u* = (u1, u2, u3, u4) = (0,7,2/3,4/3) with g(u) = 18 is got. A solution x* of the 
primal problem can be got by solving the system (Ax — b); — 0 for u; 7 0,i.e., x9 — 2, —74 + 2x3 = 2, 
24 — 313 4- 23 — 2, therefore: x* — (2,2,2) with f(x) = 18. 


18.1.4 Special Linear Programming Problems 


18.1.4.1 Transportation Problem 


1. Modeling 

A certain product, produced by m producers Æ1, E2, . . . , Em in quantities a1, a2, . . . , am, is to be trans- 
ported to n consumers V4, Vo, ... , V, with demands 51, b, . .. , 54. Transportation cost of a unit product 
of producer E; to consumer V; is cij. The amount of the product transported from E; to V; is £i; units. 
An optimal transportation plan is to be determined with minimum total transportation cost. The 
system is supposed to be balanced, i.e., supply equals demand: 


Sa; = bj. (18.24) 
i j=l 


The matrix of costs C and the distribution matrix X are constructed: 


E: 233 
c- Cup c s V EA Xilccc Tia Y 01 
τ : ,(18.25a) X= : i fou (18.25b) 
Cm, ''* Cmn Em Um °° * Tmn Am 
V, ess 5: bye b, 


If condition (18.24) is not fulfilled, then two cases are distinguished: 

a) If Sa; > bj, then a fictitious consumer V,,41 is introduced with demand b,41 = a; — 5; 5; and 
with transportation costs Ci n41 = 0. 

b) If 55a; « X25; then introduce a fictitious producer Em+1 is introduced with capacity @m41 = 
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Z bj — Sa; and with transportation costs Gn41 — 0. 
In order to determine an optimal program, the following programming problem should be solved: 


OF: f(X) = S00 ejxij = min! (18.26a) 


i=1 j= 


~ 


CT: Yiaj=a @=1,...,.m), Yoay=b; G=l,...,n), αὐ 5 0). 
i=l 


j=1 


8.26b) 


The minimum of the problem occurs at a vertex of the feasible set. There are m + n — 1 linearly 
independent constraints among the m + n original constraints, so, in the non-degenerate case, the 
solution contains m +n — 1 positive components x;;. To determine an optimal solution the following 


algorithm is used, which is called the transportation algorithm. 
2. Determination of a Basic Feasible Solution 
With the Northwest corner rule an initial basic feasible solution can be determined: 
a) Choose 21, = min(ay, by). 
b) If a4 € bi, the first row of X is omitted. 
If ai > bı, the first column of X is omitted. 
If a, = 5i, either the first row or the first remaining column of X is omitted. 


(18.272) 
(18.27b) 
(18.216) 
(18.274) 








If there are only one row but several columns, then one column is cancelled. The same applies for the 


rows. 


c) a1 is replaced by ay — xy, and by, by b; — xy and the procedure in repeated in the left upper vertex 


of the reduced distribution matrix X. 


The variables obtained in step a) are the basic variables, all the others are non-basic variables with zero 


values. 
m E: M: 
0.9 4 T FA X11 31,9 21.3 T14 Qj — 9 
Ce í 8211 | Ez, X= 4321 Φρα 23 724 ) az = 10. 
9263 Es 431 132 %33 34 a3 = 3 
V: γι V3 V Va X b, = 4b, = 6 b3 =5 bg =7 


The determination of an initial extreme point with the Northwest corner rule gives 


first step second step further steps 
4 9 5 45 — 50 4 5 — 0 
x=( 10 , x-( j 10 , x-( m 109 40. 
| Ja "m ο... 
4657 0657 0 pt 
0 1 103 
0 


There are alternative methods to find an initial basic solution which also takes the transportation costs 
into consideration (see, e.g., the Vogel approximation method in [18.13]) and they usually result in a 


better initial solution. 


3. Solution of the Transportation Problem with the Simplex Method 


If the usual simplex tableau is prepared for this problem, then it results in a huge tableau ((m-+n) x (m- 
n)) with a large number of zeros: In each column, only two elements are equal to 1. So, areduced tableau 
is constructed, and the following steps correspond to the simplex steps working only with the non-zero 
elements of the theoretical simplex tableau. The matrix of the cost data contains the coefficients of 
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the objective function. The basic variables are exchanged for non-basic variables iteratively, while the 
corresponding elements of the cost matrix are modified in each step. The procedure is explained by an 
example. 
a) Determination of the modified cost matrix C from C by 

Gy —ejtp dq (6—15..;ym,j-1l..-m) (18.284) 
with the conditions 

Cjj — 0 for (i,j) if mij is an actual basic variable. (18.28b) 
The elements of C belonging to basic variables are marked and p; — 0 is substituted. The other 
quantities p; and q;, also called potentials or simplex multiplicators, are determined so that the sum of 
Pi, qj and the marked costs c;; should be 0: 














(5) (3) 2 T pi —0 0 0 05 
C= 8 23 Q0 @) [|p-1 = 6-|40 00 ]|. (18.280) 
9 2 6 (3) / p —--1 342/30 
qı = —5 Q = —3 93 = —2 q = —2 
b) The value 
Gq = main iy} (18.28d) 


must be determined. If ¢,, > 0, then the given distribution X is optimal; otherwise 2p, is chosen as a 
new basic variable. In our example: &j; — 632 — —2. 


c) In C, Č and the costs associated to the basic variables are marked. If C contains rows or columns 
with at most one marked element, then these rows or columns will be omitted. This procedure is 
repeated with the remaining matrix, until no further cancellation is possible. 





(18.28e) 
(0) 
d) The elements z;; associated to the remaining marked elements č; form a cycle. The new basic 
variable čą is to be set to a positive value ô. The other variables 2;; associated to the marked elements 


Čij are determined by the constraints. In practice, ô is subtracted and added from or to every second 
element of the cycle. To keep the variables non-negative, the amount ô must be chosen as 


Ô = Trs = min{ Tij: Žij = Tij — Ô}, (18.28f) 
where x; will be the non-basic variable. In the example 6 = min{1, 3} = 1. 
p» 
4.5 9 


x» 46 5 T7 


Then, this procedure is repeated with X — X. 


(5) (3) 2 T Ρι 5-0 (0) (0) (-2) 3 
ο- δ 2 (1) (1) po=3  —. C-| 6 2 (0) (0) |, (1828h) 
9 (2) 6 (3) p =1 5 (0 3 (0) 


qı = —5 Q = —3 q3 = —4 q4 = —4 
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4 432 

x= 4 5—6 € 5-46 zš X gen , Jf(X)- a9. (18.281) 

1 
l4ó — 2-δ 

The next matrix C does not contain any negative element. So, X is an optimal solution. 


18.1.4.2 Assignment Problem 

The representation is made by an example. 

E ~ shipping contracts should be given to n shipping companies so that each company receives exactly 
one contract. The assignment has to be determined which minimizes the total costs, if the i-th company 
charges c;; for the j-th contract. 

An assignment problem is a special transportation problem with m =n and a; = b; = 1 for all i, j: 


OF: f(x) = Y cri; ^ min! (18.294) 
i=1 j=1 
CT: Soa =1 (§=1,...,n), So ay =1 G=1,...,n), zij € {0, 1}. (18.29b) 
ja i-l 


Every feasible distribution matrix contains exactly one 1 in every row and every column, all other 
elements are equal to zero. In a general transportation problem of this dimension, however, a non- 
degenerate basic solution would have 2n — 1 positive variables. Thus, basic feasible solutions to the 
assignment problem are highly degenerate, with n — 1 basic variables equal to zero. Starting with 
a feasible distribution matrix X, the assignment problem can be solved by the general transportation 
algorithm. It is time consuming to do so. However, because of the highly degenerate nature of the basic 
feasible solutions, the assignment problem can be solved with the highly efficient Hungarian method 
(see [18.9]). 


18.1.4.3 Distribution Problem 

The problem is represented by an example. 

W 7 products Ej, F2,..., Em should be produced in quantities a1, a2,...,@m. Every product can be 
produced on any of n machines M1, Mo,..., M,. The production of a unit of product E; on machine 
Mi; needs processing time bj; and cost c;;. 'The time capacity of machine Mj is bj. Denote the quantity 
produced by machine M; from product F; by xij. The total production costs should be minimized. 
This distribution problem has the following general model: 


OF: /f(x)— Y cri; ^ min! (18.304) 


i=1 j=1 


m n 
CT: Mug =a; @=1,...;m), Msg €b; (j—L...,n) «ij 20 forall ij. (18.30b) 
j=l ici 
The distribution problem is a generalization of the transportation problem and it can be solved by 
the simplex method. If all b;; — 1, then the more effective transportation algorithm can be used (see 
18.1.4.1, p. 921) after introducing a fictitious product Ej,,4 (see 18.1.4.1, p. 920). 


18.1.4.4 Travelling Salesman 

Suppose there are n places O;, O2,...,On. The travelling time from O; to O; is cj. Here, cj; A cj is 
possible. 

One wants to determine the shortest route such that the traveller passes through every place exactly 
once, and returns to the starting point. 

Similarly to the assignment problem, exactly one element is chosen in every row and column of the time 
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matrix C so that the sum of the chosen elements is minimal. The difficulty of the numerical solution of 
this problem is the restriction that the marked elements cj; should be arranged in order of the following 
form: 

Cinta Cindia e Gimin With dy di for kd and i444 — dj. (18.31) 
The travelling salesman problem can be solved by the branch and bound methods. 


18.1.4.5 Scheduling Problem 


n different products are processed on m different machines in a product-dependent order. At any time 
only one product can be processed on a machine. The processing time of each product on each machine 
is assumed to be known. Waiting times, when a given product is not in process, and machine idle times 
are also possible. 

An optimal scheduling of the processing jobs is determined where the objective function is selected 
as the time when all jobs are finished, or the total waiting time of jobs, or total machine idle time. 
Sometimes the sum of the finishing times for all jobs is chosen as the objective function when no waiting 
time or idle time is allowed. 


18.2 Non-linear Optimization 
18.2.1 Formulation of the Problem, Theoretical Basis 
18.2.1.1 Formulation of the Problem 


1. Non-linear Optimization Problem 
A non-linear optimization problem has the general form 


f(x) = min! subjectto xc" with (18.32a) 

gi(x) € 0, ie€I-(L..,mh hj;(x)20, j€J-Í[Ll...r) (18.32b) 
where at least one of the functions f, gi, h; is non-linear. The set of feasible solutions is denoted by 

M-ÍxcR^: g(x €0, iel, hj(x)=0, j € J}. (18.33) 


The problem is to determine the minimum points. 
2. Minimum Points 
A point x* € M is called the global minimum point if f(x*) < f(x) holds for every x € M. If this 
relation holds for only the points x of a neighborhood U of x*, then x* is called a local minimum point. 
Since the equality constraints h;(x) = 0 can be expressed by two inequalities, 
—h;(x) <0, hj(x) <0, (18.34) 
it can be supposed that the set J is empty, J = Ø. 
18.2.1.2 Optimality Conditions 
1. Special Directions 
a) The Cone of the Feasible Directions at x € M is defined by 
Z(x) - (de R^: Ja>0: x+adeEM, 0<a<a}, κΧΕΜ, (18.35) 
where the directions are denoted by d. If d € Z(x), then every point of the ray x + ad belongs ο ΛΙ 
for sufficient small values of a. 
b) A Descent Direction at a point x is a vector d € R” for which there exists an @ > 0 such that 
f(x+ad) < f(x) Va € (0,a). (18.36) 
There exists no feasible descent direction at a minimum point. 
If f is differentiable, then d is a descent direction when Vf(x)Td < 0. Here, V denotes the nabla 
operator, so V f(x) represents the gradient of the scalar-valued function f at x. 
2. Necessary Optimality Conditions 
If f is differentiable and x* is a local minimum point, then 


Vf(x')Td»0 forevery de Z(x). (18.37a) 
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In particular, if x* is an interior point of M, then 
V f(x") = 9. (18.37b) 


3. Lagrange Function and Saddle Point 
Optimality conditions (18.37a,b) should be transformed into a more practical form including the con- 
straints. The so-called Lagrange function or Lagrangian is constructed: 


L(x, u) = f(x) + J ug) = f(x) + ug), xe R, ue RY, (18.38) 
ici 
according to the Lagrange multiplier method (see 6.2.5.6, p. 456) for problems with equality constraints. 
A point (x*, u*) € R" x RT is called a saddle point of L, if 
L(x*,u) < L(x*,u*) < L(x,u") for every x € R”, u € RT. (18.39) 


4. Global Kuhn-Tucker Conditions 

A point x* € R” satisfies the global Kuhn- Tucker conditions if there is an u* € RY, i.e., u* > 0 such 
that (x*, u*) is a saddle point of L. 

For the proof of the Kuhn-Tucker conditions see 12.5.6, p. 683. 

5. Sufficient Optimality Condition 

If (x*, u*) € R" x RT is a saddle point of L, then x* is a global minimum point of (18.32a,b). 

If the functions f and g; are differentiable, then local optimality conditions can be deduced. 

6. Local Kuhn-Tucker Conditions 

A point x* € M satisfies the local Kuhn-Tucker conditions if there are numbers u; 7 0, i € Jo(x*) such 
that 


-Vf(x)- M) uVg(x), where (18.404) 
i€Io(x*) 
Ip(x) = {i € {1,...,m} : φι(κ) -- 0} (18.40b) 


is the index set of the active constraints at x. The point x* is also called a Kuhn-Tucker stationary 
point. 


This means geometrically that a point x* € 
M satisfies the local Kuhn-Tucker condi- 
tions, if the negative gradient —V f(x*) lies 
in the cone spanned by the gradients V g;(x*) 
i € lo(x*) of the constraints active at x* 
(Fig. 18.5). 

The following equivalent formulation for 
(18.40a,b) is also often used: x* € R” satis- 
fies the local Kuhn-Tucker conditions, if there 
isau* € R7 such that 





g(x") € 0, (18.414) 
ugi(x*) =0, i=1,...,m, (18.41b) level lines 
m f(x) = const 
Vf(x*) + 3 uVgi(x*) = 0. (18.41c) 
i=l 


Figure 18.5 


7. Necessary Optimality Conditions and Kuhn-Tucker Conditions 

Ifx* € M isa local minimum point of (18.32a,b) and the feasible set satisfies the regularity condition at 
x*: Jd € R" such that Vgi(x*)"d « 0 for every i € Io(x*), then x* satisfies the local Kuhn-Tucker 
conditions. 





926 18. Optimization 





18.2.1.3 Duality in Optimization 
1. Dual Problem 


With the associated Lagrangian (18.38) the maximum problem is formed, the so-called dual of (18.32a,b): 


L(x,u) = max! subject to (x,u)€ M* with (18.42a) 
M* = {(x,u) © R" x RP: L(x,u) = min L(z,u)}. (18.42b) 
zcR^* 


2. Duality Theorems 

If x, € M and (x5, u5) € M*, then 

a) L(x»,u5) < f(x). 

b) If L(x3, u5) — f (xi), then x, is a minimum point of (18.32a,b) and (x5, ug) is a maximum point of 
(18.42a,b). 


18.2.2 Special Non-linear Optimization Problems 
18.2.2.1 Convex Optimization 


1. Convex Problem 
The optimization problem 

f(x) = min! subjectto g(x) <0 (i=1,... m) (18.43) 
is called a convex problem if the functions f and g; are convex. In particular, f and g; can be linear 
functions. The following statements are valid for convex problems: 
a) Every local minimum of f over M is also a global minimum. 
b) If M is not empty and bounded, then there exists at least one solution of (18.43). 
c) If f is strictly convex, then there is at most one solution of (18.43). 
1. Optimality Conditions 
a) If f has continuous partial derivatives, then x* € M is a solution of (18.43), if 

(x —x*)™Vf(x*) >0 forevery xe M. (18.44) 
b) The Slater condition is a regularity condition for the feasible set M. It is satisfied if there exists an 
x € M such that g;(x) « 0 for every non-affine linear functions g;. 
c) If the Slater condition is satisfied, then x* is a minimum point of (18.43) if and only if there exists a 
u* > 0 such that (x*, u*) is a saddle point of the Lagrangian. Moreover, if functions f and g; are dif- 
ferentiable, then x* is a solution of (18.43) if and only if x* satisfies the local Kuhn- Tucker conditions. 
d) The dual problem (18.42a,b) can be formulated easily for a convex optimization problem with dif- 
ferentiable functions f and g;: 

L(x,u) = max!, subjectto (x,u)c M* with (18.45a) 

M* = {(x,u) © R" x RP : VxL(x,u) = 0}. (18.45b) 

The gradient of L is calculated here only with respect to x. 
e) For convex optimization problems, the strong duality theorem also holds: 
If M satisfies the Slater condition and if x* € M is a solution of (18.43), then there exists a u* € RT, 
such that (x*, u*) is a solution of the dual problem (18.45a,b), and 

f(x") = min f(x) = max L(x,u)- L(x*,u*). (18.46) 

xeM M* 


(x,u)e 


18.2.2.2 Quadratic Optimization 


1. Formulation of the Problem 
Quadratic optimization problems have the form 
f(x) = x Ox 1- p'x = min!, subjectto xe MCR" with (18.47a) 


M=M,: M={xeR": Ax<b, x>0}. (18.47b) 
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Here, C is a symmetric (n, n) matrix, p € R^, A is an (m, n) matrix, and b € R”. 
'The feasible set M can be written alternatively in the following way: 
M=Mr: M-íx: Ax-b, x20] (18.484) 
M =M: M={x : Ax< b}. (18.48b) 
2. Lagrangian and Kuhn-Tucker Conditions 
The Lagrangian to the problem (18.47a,b) is 





L(x,u) = x"Cx + p™x + u! (Ax — b). (18.49) 

By introducing the notation 
OL i OL 

t= BOETA u and Y=- ^*tb (18.50) 

the Kuhn-Tucker conditions are as follows: 
Case I: Case II: Case III: 

a) Ax+y=b, a) Ax=b, a) Ax+y=b, (18.51a) 
b) 2€x-v- ATu= =p; b) 2Cx-v+ ATu = —p, b) 2Cx+ ATu- —p, (18.51b) 
ϱ κ2θ,ν20,Υ20,420, 9 κ20,ν20, c) u>0,y>0, (18.516) 
Ὁ) κἲν «ντα --0. d) x™v=0. d) γτα--0 (18.51d) 


3. Convexity 

The function f(x) is convex (strictly convex) if and only if the matrix C is positive semidefinite (posi- 
tive definite). Every result on convex optimization problems can be used for quadratic problems with 
a positive semidefinite matrix C; in particular, the Slater condition always holds, so it is necessary 
and sufficient for the optimality of a point x* that there exists a point (x*,y, u,v), which satisfies the 
corresponding system of local Kuhn-Tucker conditions. B 

4. Dual Problem 

If C is positive definite, then the dual problem (18.45a,b) of (18.47a,b) can be expressed explicitly: 


L(x,u) = max!, subjectto (x,u)c M*, where (18.52a) 
1 
M* - (&u) e Rx R? : x - -;C "(A'u« p)). (18.52b) 
1 
If the expression x — -3€ (Au + p) is substituted into the dual objective function L(x, u), then 


the equivalent problem is 


1 1 T 1 
φ(α) -- - ΑΟ Αα — (5407p + b) u— 19ο Ῥ — max, u>0. (18.53) 
Hence: If x* € M is a solution of (18.47a,b), then (18.53) has a solution u* 7 0, and 
f) = ία". (18.54) 
Problem (18.53) can be replaced by an equivalent formulation: 
y(u) = uEu + hu = min!, subjectto u>0 where (18.55a) 


1 1 
E- 10€ ^ and h= 5AC p+ b. (18.55b) 
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18.2.8 Solution Methods for Quadratic Optimization Problems 
18.2.3.1 Wolfe's Method 


1. Formulation of the Problem and Solution Principle 
'The method of Wolfe is to solve quadratic problems of the special form: 

f(x) = x™Cx + p'x — min!, subjectto Ax=b, x>0. (18.56) 
C is supposed to be positive definite. The basic idea is the determination of a solution (x*, u*, v*) of 
the corresponding system of Kuhn-Tucker conditions, associated to problem (18.56): 


Ax — b, (18.572) 

2€x — v t ATu — —p, (18.57b) 
x20, v20; (18.57c) 

xTv — 0. (18.58) 


Relations (18.57a,b,c) represent a linear equation system with m -- n equations and 2n 4- rn variables. 
Because of relation (18.58), either x; = 0 or v; = 0 (¢ = 1,2,...,n) must hold. Therefore, every solution 
of (18.57a,b,c), (18.58) contains at most m+n non-zero components. Hence, it must be a basic solution 
of (18.57a,b,c). 

2. Solution Process 

First, a feasible basic solution (vertex) X of the system Ax = b is determined. The indices belonging 
to the basis variables of X form the set Ig. In order to find a solution of system (18.57a,b,c), which also 
satisfies (18.58), the problem is formulated as 


-—y=min!, (we R); (18.59) 
Ax — b, (18.604) 
2Cx —v + ATu — uq -— —p with q=2Cx+p, (18.60b) 
x20,v20, 20; (18.60c) 
xTv — 0. (18.61) 

If (x, v, u, μ) is a solution of this problem also satisfying (18.57a,b,c) and (18.58), then pz = 0. 
The vector (x, v, u, p) — (X, 0,0, 1) is a known feasible solution of the system (18.60a,b,c), and it 


satisfies the relation (18.61), too. A basis associated to this basic solution is formed from the columns 
of the coefficient matrix 
| A000 | I denotes the unit matrix, 0 the zero matrix and 0 


2C -I AT E is the zero vector of the corresponding dimension, (18.62) 


in the following way: 

a) m columns belonging to x; with i € Ip, 

b) n — m columns belonging to v; with i € Ig, 

c) all m columns belonging to ui, 

d) the last column, but then a suitable column determined in b) or c) will be dropped. 

If q = 0, then the interchange according to d) is not possible. Then x is already a solution. 

Now, a first simplex tableau can be constructed. The minimization of the objective function is per- 
formed by the simplex method with an additional rule that guarantees that the relation xTv = 0 is 
satisfied: 

The variables x; and v; (i — 1,2,...,n) must not be simultaneously basic variables. 

In the case of a positive definite C, considering this additional rule the simplex method provides a so- 
lution of problem (18.59), (18.60a,b,c), (18.61) satisfying 4. = 0. For a positive semi-definite matrix C, 
because of the restricted pivot choice, it may happen that although jz > 0, no more exchange-step can 
be made without violating the additional rules. In this case js cannot be reduced any further. 
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M f(x) = 2} + 403 — 10x) — 3222 =min! with αι +2%2.+%3=7, 2a1+a2+%4=8. 


1000 -10 
ποιο (7 | [040 0 _ { -32 
αρα be €-là 9$ 9|* P5| 0 

0000 0 


In this case C is positive semi-definite. A feasible basic solution of Ax = b is X — (0,0, 7, 8). q= 


2CX + p = (—10,—32, 0,0)". The choices for the basis vectors are: a) columns 3 and 4 of G 


0 
b) columns 1 and 2 of ( i à c) the columns of ( av) and d) column ( 3 


us i instead of the first 


0 : Se 
column of ( I The basis matrix is formed from 
































these columns, and the basis inverse is calculated Scheme 18.9 
(see 18.1, p. 909). Multiplying matrix (18.62) and a ὃν m a d 
{ο .. 

the vectors -p by the basis inverse, the first x3 1 9 0 0 0 7 
simplex tableau (Scheme 18.9) is obtained. "e 2 1 t ? η g 
Only x; can be interchanged with v» in this tableau 09 64 B. rm 32 12 54 0 
according to the complementary constraints. Af- 10 10 10 10 
ter a few steps, we get the solution x* = m 0 0 0. = t 0 0 
(2, 5/2,0, 3/2)" is obtained. The last two equations ug 0 0 0 0-1 0 
of 2Cx— v-- ATu— uq Pare: v3 = Uy, V4 = Ug. u 2 0 E. 1 2 1 
Therefore, by eliminating u; and uz the πα... 10 10 10 10 
of the problem can be diced: 9 i 1 9 i 
18.2.3.2 Hildreth-d'Esopo Method 10 10 10 101. 
1. Principle 
The strictly convex optimization problem 

f(x) =x"Cx + px = min!, Ax<b (18.63) 
has the dual problem (see 1., p. 926) 

v(u)- uTEu--h'u-— min! u>0 with (18.644) 

1 1 
E- 14€ A, h= JAC p +b. (18.64b) 
Matrix E is positive definite and it has positive diagonal elements ej; ^ 0, (i — 1,2,...,m). The 
variables x and u satisfy the following relation: 
1 
x= -3€ (Αα +p). (18.65) 


2. Solution by Iteration 

The dual problem (18.64a), which contains only the condition u > 0, can be solved by the following 
simple iteration method: 

a) Substitute u! > 0, (e.g., ut = 0), k = 1. 

b) Calculate uF* for i — 1,2,..., m according to 


we --2(E eju m ig 3 εις JL (18.664) ut max (0, ως} . . (18.66b) 
€i ^ii 


j=i+1 
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c) Repeat step b) with k -- 1 instead of k until a stopping rule is satisfied, e.g., 
€ » 0. 

Under the assumption that there is an x such that Ax < b, the sequence {7)(u*)} converges to the 
minimum value Umin and sequence {x*} given by (18.65) converges to the solution x* of the original 
problem. The sequence {u*} is not always convergent. 





(ut) — w(u*)| « ε, 


18.2.4 Numerical Search Procedures 

By using non-linear optimization procedures acceptable approximate solutions can be found with rea- 
sonable computing costs for several types of optimization problems. They are based on the principle 
of comparison of function values. 


18.2.4.1 One-Dimensional Search 


Several optimization methods contain the subproblem of finding the minimum of a real function f(x) 
for x € [a,b]. It is often sufficient to find an approximation zT of the minimum point z*. 

1. Formulation of the Problem 

A function f(x), x € R, is called unimodal in [a,b] if it has exactly one local minimum point on every 
closed subinterval J C [a,b]. Let f be a unimodal function on [a, b] and z* the global minimum point. 
Then an interval [c, d] C [a, b] should be found with z* € [c, d] such that d — c « €, € > 0. 

2. Uniform Search 


IE . b—a 
A positive integer n is chosen such that ô = 
n 


1* T and the values f(2*) for së = a + kô (k = 





1,...,n) are calculated. If f(x) is the smallest value among these function values, then the minimum 
point x* is in the interval [r — ô, x + 6]. The number of required function values for the given accuracy 
can be estimated by 
2(b—a 
n> ca 1. (18.67) 
ε 

3. Golden Section Method, Fibonacci Method 
The interval [a,b] = [a1, b1] will be reduced step by step so that the new subinterval always contains 
the minimum point z*. The points A1, jj are determined in the interval [a1, b1] as 


àı = a + (1— T)(bı — a1), pi a1 7T(b— a1) with (18.68a. 
1 
T= αν — 1) z 0.618. (18.68b 
This corresponds to the golden section. Two cases are distinguished: 
a) If f(A1) < f(m), then az = a1, b2 = pı and u3 = A, are substituted. (18.69a. 
b) I£. f(À1) 2 f(ui) , then a — Ay, be = by and A — ji are substituted. (18.69b 


If by — ag > e, then the procedure is repeated with the interval [a2, 05], where one value is already 
known, f (Az) in case a) and f (y) in case b), from the first step. To determine an interval [an, bn}, which 
contains the minimum point z*, altogether n function values are calculated. From the requirement 

€ > b, — a, — 7-1 (b — a1) (18.70 
the necessary number of steps n can be estimated. 
By using the golden section method, at most one more function value should be determined compared to 
the Fibonacci method. Instead of subdividing the interval according to the golden section, the interva 
is subdivided according to the Fibonacci numbers (see 5.4.1.5, p. 375, and 17.3.2.4, 4., p. 908). 





18.2.4.2 Minimum Search in n-Dimensional Euclidean Vector Space 


The search for an approximation of the minimum point x* of the problem f(x) = min!, x € R”, can 
be reduced to the solution of a sequence of one-dimensional optimization problems. 
One takes 


a) x=x!, k=1, where x! is an appropriate initial approximation of x*. (18.71a) 
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b) The one-dimensional problems 


plar) = f(t... a Ti + Op, St ..., 28) = min! with œ ER (18.71b) 


are solved for r= — 1,2,...,n. If à, is an exact or approximating minimum point of the r-th problem, 
then t+! = xk + a, are al 
c) If two consecutive approximations are close enough to each other, i.e., with some vector norm, 


[x —x"|| «e or. |f) — f(x) « es, (18.719) 


then x**! is an approximation of x*. Otherwise step b) is repeated with k + 1 instead of k. The one- 
dimensional problem in b) can be solved, by using the methods given in 18.2.4.1, p. 930. 


18.2.5 Methods for Unconstrained Problems 
The general optimization problem 

f(x) = min! fot xc RR" (18.72) 
is considered with a continuously differentiable function f. Each method described in this section con- 
structs, in general, an infinite sequence of points (x^) € IR", whose accumulation point is a stationary 
point. The sequence of points will be determined starting with a point x! € IR" and according to the 


formula 
x 








=x*+a,d* (k=1,2,...), (18.73) 
i.e., first a direction d^ € R” is determined at x* and the step size a, € R indicates how far x*+! 
from x* in the direction d*. Such a method is called a descent method, if 

J(xF9) « f(x*) (k-12,...). (18.74) 


The equality V f(x) = 0, where V is the nabla operator (see 13.2.6.1, p. 715), characterizes a stationary 
point and can be used as a stopping rule for the iteration method. 


18.2.5.1 Method of Steepest Descent 
Starting from an actual point x", the direction d* in which the function has its steepest descent is 


d* =—-Vf(x*) (18.75a) and consequently x*t! = x* — a,V f(x*). (18.75b) 


A schematic representation of the steepest descent method with level lines f(x) — f(x!) is shown in 
Fig. 18.6. 


1 The step size ax is determined by a line search, i.e., 
f(x)=f(x ) ap is the solution of the one-dimensional problem: 
f(x*-- ad^) 2 min!, a> 0. (18.76) 
This problem can be solved by the methods given 
in 18.2.4, p. 930. 

The steepest descent method (18.75b) converges 
. relatively slowly. For every accumulation point x* 
level lines — of the sequence (x^), V f(x) — 0. In the case of a 
quadratic objective function, i.e., f(x) 2 xT Ox 4- 





Vf(c) 


Figure 18.6 px, the method has the special form: 
ἀνα’ 
xH = x" + αμα’ (18.77a) with d* = —(2Cx* +p) and ak = m. (18.77b) 


18.2.5.2 Application of the Newton Method 


Suppose that at the actual approximation point x* the function f is approximated by a quadratic 
function: 


q(x) — f(x*) -- (x - x)" V f(x) - 2x x5) HG )(x — x4). (18.78) 
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Here H(x*) is the Hessian matrix, i.e., the matrix of second partial derivatives of f at the point x*. If 
H(x*) is positive definite, then q(x) has an absolute minimum at x**! with Vq(x**!) — 0, therefore 
one gets the Newton method: 

x" lxk-H'(x9)vf(x') (k-12,.), ie, (18.792) 

dë = -H !(x*Vf(x*) and a, in (18.73). (18.79b) 

'The Newton method converges fast but it has the following disadvantages: 
a) The matrix H(x^) must be positive definite. 
b) The method converges only for sufficiently good initial points. 
c) The step size can not influenced. 
d) The method is not a descent method. 


e) The computational cost of computing the inverse of H^! (x^) is fairly high. 
Some of these disadvantages can be reduced by the following version of the damped Newton method (see 
also 19.2.2.2, p. 962): 

x" = x* a, Hl (x*)V f(x") (k-152,..). (18.80) 
The relaxation factor a, can be determined, for example, by the principle given earlier (see 18.2.5.1, 
p. 931). 
18.2.5.3 Conjugate Gradient Methods 


Two vectors d!, d? € R” are called conjugate vectors with respect to a symmetric, positive definite 
matrix C, if 

d' "cg? — o. (18.81 
Ifd',d?,...,d” are pairwise conjugate vectors with respect to a matrix C, then the convex quadratic 
problem g(x) = x?Cx + p?x, x € R”, can be solved in n steps if a sequence x**4 = x* + a,d* 
starting from x? is constructed, where ag is the optimal step size. Under the assumption that f(x 

1 

is approximately quadratic in the neighborhood of x*, i.e., C & zH), the method developed for 


quadratic objective functions can also be applied for more general functions f(x), without the explici 
use of the matrix H(x*). 
The conjugate gradient method has the following steps: 


a) x! c R^, d' 2 -Vf(x)), (18.82 
where x! is an appropriate initial approximation for x*. 


b) x*'*tz-zx*.4 apd" (k=1,...,n) with ak > 0 so that f(x" + ad*) will be minimized. (18.83a 





d+! = -V f(x") + udă (k =1,...,n— 1) with (18.83b 
k+1yT k+1 
Uk = Vile) Vile) and d" — —V f(x"), (18.83c 
V f(x*) V f(x") 
c) Repeating steps b) with x"*! and d"*! instead of x! and d!. 


18.2.5.4 Method of Davidon, Fletcher and Powell (DFP) 
With the DFP method, a sequence of points starting from x! € RR" is determined according to the 
formula 

xhtl — x* _ a, M,Vf(x*) (k= 1,2,...). (18.84) 
Here, Mj is a symmetric, positive definite matrix. The idea of the method is a stepwise approximation 
of the inverse Hessian matrix by matrices M; in the case when f(x) is a quadratic function. Starting 
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with a symmetric, positive definite matrix Mj, e.g., Mı = I (I is the unit matrix), the matrix Mx is 
determined from Mj. , by adding a correction matrix of rank two 











yhysT (Μι ιν) (Μι ιν) 
Mi =Ma + E νη (18.85) 
with vë = x" — x*-! and w* — V f(x*) — V f(x*-!) (k — 2,3,...). The step size aj is obtained from 
f(x * — aM,Vf(x*) — min!, o7 0. (18.86) 


If f(x) is a quadratic function, then the DFP method becomes the conjugate gradient method with 
1-1. 


18.2.6 Evolution Strategies 
18.2.6.1 Evolution Principles 


Evolution strategies are examples of stochastic optimization processes imitating natural evolution. 
They are based on the principles of mutation, recombination and selection. 

1. Mutation 

From a parent point x p a offspring (descendant) x9 = x p+dis formed by applying a random variation 
d. The components of d are (0, 02) normally distributed random variables Z(0, 02) determined newly 
at every mutation: 


di Ζ(0,σ1) Ζ(0,1) σι 
d; Z(0,0% Ζ(0, 1): σ. 

ᾱ-- ? — m 2) — ( J 2 (18.87) 
dn Z(0, on) Z(0, 1) - ex 


With a normally distributed d small changes have high probabilities while large changes occur very 
rarely. The changes are controlled by the standard deviation o;. 

2. Recombination 

From the population of j; parents offspring can be obtained by mixing the information from two or more 
parents, which are randomly selected. The recombination can follow two types of changes. 

At intermediate recombination a offspring becomes as weighted average of o randomly chosen parents: 














e e 
Xo =} aiXp,, Soa =1, 2<o<p. (18.88) 
ici ici 
At a discrete recombination of o parents the i-th component of a offspring xo is determined by the i-th 
component of a randomly chosen parent: 


dob. elle. $91, (18.89) 


3. Selection 

By using mutation and recombination, a set of offspring is formed randomly. In a subsequent selection 
process the objective function f(x) serves as a measure to compare the fitness of the individuals. The 
fittest individuals are selected for the next generation. At certain strategies only the offspring take part 
in the selection. Other strategies consider also the parents (see also [18.12]). 


18.2.6.2 Evolution Algorithms 


Every evolution strategy is based on the following algorithm: 

a) Determination of an appropriate starting population consisting of jj individuals. These are the 
first generation of parents. Xp = (X&,..., X5) 

b) In the k-th step the creation of A offspring X § = {x8 Los xb.) by mutation and recombination 


of parents of the actual generation X$ = (x &,..., X5. 
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c) Application of selection to get the best p individuals for the next parent generation X$! = 


IER orap J 

d) Repeating steps b) and c) until stopping rule is satisfied. It can be fulfilling an optimal criterium of 
theoptimization problem, or to reach a given number of generations, or exceeding a given computer 
time, etc. 


18.2.6.3 Classification of Evolution Strategies 


Every evolution strategy is characterized by a sequence of parameters. Essential parameters are the size 
of the population ju, the number of the offspring A, the number of parents o taking part in recombination 
and rules of making mutation, recombination and selection. To distinguish different types of strategies 
a special notation is commonly used. For the strategies using only mutation in producing offspring 
the (u + A), or (u, A) strategy notation is used. Strategies (u + A) and (u, A) differ from each other in 
the type of selection. At strategy (ji À) the selection of the new generation is made only among the 
offspring, while at strategy (u + ) the parents are also involved. 

For strategies using recombination the number 9 of the parents, which are involved, is seen in the 
notation (4/0 -- A)- and (gu o, A)-strategy. 


18.2.6.4 Generating Random Numbers 


For the numerical evaluation of evolution procedures uniformly and normally distributed random vari- 
ables are needed. Values of uniformly distributed variables can be got by the methods given in subchap- 
ter 16.3.5.2, p. 843. Normally distributed random variables can be produced from uniform variables in 
the following way: 

Box-Muller Method: If G; and G» are uniformly distributed random numbers in the interval [0, 1], 
then the following two equations give two statistically independent normally distributed (0, 0?) random 
numbers Z(0, 6?) and Z»(0, o?): 


Z,(0,07) 2 o4 —21n G4 cos(21G3) and  Z3(0,0?) 2 e —21n Gi sin(21G3) . (18.90) 


18.2.6.5 Application of Evolution Strategies 


In the practice optimization problems have usually high complexity. Here the conventional optimiza- 
tion processes described in 18.2.5, p. 931 are often not appropriate. Evolution strategies belong to the 
differentiation-free solution methods, which are based on comparisons of the values of the objective 
function. They have simple conditions on the structure of the objective function. The objective func- 
tion does not need to be differentiable or continuous. So the evolution strategies are appropriate for a 
wide spectrum of optimization problems. 

The application of evolution strategies is not restricted to unconstrained continuous optimization prob- 
lems. Optimization problems with constraints can also be handled, where the constraints are enforced 
by penalty terms in the objective function (see Penalty and Barrier Methods in 18.2.8, p. 940). 
Another field of application is the discrete optimization, where some or all components of x can take 
their values from a discrete set. One possible mutation mechanism is to replace the value of a discrete 
component by one of its neighboring values with the same probability. 


18.2.6.6 (1+ 1)-Mutation-Selection Strategy 
This method is similar to the gradient method discussed in 18.2.5, p. 931 with the difference that the 


direction d* is a normally distributed random vector. The population consists of a single individual 
which produces one offspring at every generation. 


1. Mutation Step 
In generation Kk a offspring is obtained from a parent by adding a normally distributed random vector: 


xb — x5 t ad*. (18.91) 
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The factor a is a parameter by which the speed of the convergence can be affected. a is considered as 
the step size of the mutation. 


2. Selection Step 
The new parent of the next generation (Kk 4- 1) is selected by comparing the objective function values 
of both individuals, i.e., from the parent with the formula: 


κα s i£ f(x5) « f(x). (18.92) 


xb otherwise. 


The procedure stops if no better offspring arrives over a given number of generations. The step size a 
can be increased if the mutation results mostly in improved offspring. At small improvements the value 
of a should be decreased. 


3. Step Size Control 

The choice of the mutation step size a is of important influence to the convergence properties of the 
evolution method. While large step sizes are recommended in order to have fast convergence, small step 
size is required in the close neighborhood of the optimum or in regions of fast changing or oscillating 
of the objective function. The optimal step size depends on the problem. Too small steps lead to 
stagnation, too large steps may result in overshooting of the evolution process. 

1. 1/5-Success rule: The ratio of the number of successful mutations and the total number of 
mutations in the last step defines the rate of success q. If q 7 1/5, then the step size can be increased. 
For smaller success rate, a should be decreased: 


1 
cO qeu. 
Ok — 11 1 with c=0,8...0,85. (18.93) 


e “Ok, q> 5 
2. Mutative Step Size Determination: The rule of 1/5 is a rough choice, and considering any 
concrete problem it is not always satisfactory. In an extended model the step size a and the standard 
deviations o;, i = 1,2...,n are in correlation. Here o and g; are multiplied with equal probability by 
one of the factors c, 1, 1/c, where c — 1,1...1,5. Further information see [18.12]. 


18.2.6.7 Population Strategies 


The (1+1) strategy presented in the preceding paragraph reflects the principles of the natural evolu- 
tion only in a very simplified form. With the extension to population models further properties of the 
evolution process can be considered. A large number of individuals in an evolution process assures that 
different regions of the solution space will be searched. 

1. (w+A)-Evolution Strategy 

The (u + à) strategy is a generalization of the (1 + 1) strategy. From the u parents of the cur- 
rent generation XE = {xb x} XP, } a set of A parents is chosen randomly with equal probability. 





Repeated choices are allowed und even necessary in the case if y < à. By mutation, À offspring 
X6 — (xBb,.....x5,) are produced. From the selection set X6 U XE the best u individuals are 
chosen to take over into the next generation. 

Since the parents are also taken in the selection, the quality of the population from a generation to 
the next one cannot be worse. The (u + A) strategy has the property that it keeps an already found 
local optimum, since large mutation steps, which are required to leave the optimal point, have very 
small probability. It means, that an individual can have an infinite life. This behavior can be avoided 
by adding penalty terms to the objective function values of parents that increase from generation to 
generation. In this way the aging of individuals can be simulated. 

2. (w,A)-Evolution Strategy 


In contrary to the (ju + A) strategy the selection step is now made among the A offspring, to choose the 
u individuals for the next generation, i.e., in this strategy the parents do not survive. Therefore \ > ju 
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must hold. The values of the objective function for the offspring can be larger than that for the parents. 
This procedure can depart from local optima. 

Selection Pressure: The ratio of the individuals taking part in the selection and the size of the pop- 
ulation defines the selection pressure S: 


λ 
BTA for (p 4- A)-strategy , 
8- κ᾿ with 1X S «oo. (18.94) 
= for (p, A)-strategy 
H 


If the selection pressure is close to 1, then the selection step has almost no impact. A large number of 
offspring \ > ji results in a strong selection pressure, since from the set of the present individuals only 
few will survive into the next generation. 

3. (uw/oa+A)- and (w/e, A)-Evolution Strategies with Recombination 

With the concept of recombination some relations are built between the individuals of a population, so 
the information of several parents are mixed in a offspring. 

In order to get a offspring, o parents are chosen from the set of parents X 5 with the same probability. 
It is assumed that for every member of the A offspring a separate choice of o parents is taken. 'The 
offspring is a discrete or an intermediate recombination of the chosen parents. The offspring produced 
in this way will be mutated and enters the selection process. 

In the previously described (u + A) and (u, À) strategies each individual is the result of a series of mu- 
tations applied to one individual of the first generation of parents. So, a wider evolution step is possible 
only through many generations. Wider evolution steps are possible with recombination. Especially, 
when the parents have large distances among each other, the offspring are formed with completely new 
properties. 

4. Evolution Strategies with more Populations 

The principles of evolution can be expanded formally to the dimension of populations. Instead of indi- 
viduals now populations are in competition. So, the evolution process has two steps. It is shown in the 
expanded notation: [u2/02 ® A2(u1/01 t à1)]. 

From a set of u2 parent populations a set of Àz offspring populations is created by recombination of o2 
populations that are chosen randomly for each offspring population. In these A2 offspring populations 
the optimization is performed using a (4/01 + A1) or (441/01, M) strategy. After a given number of 
generations the best populations are chosen based on an appropriate criterium. The comparison of 
populations can be done by considering the values of the objective function of the best individual or by 
the population mean. 


18.2.7 Gradient Method for Problems with Inequality 














Type Constraints 
If the problem 
f(x) = min! subject to the constraints g(x) <0 (i=1,...,m) (18.95) 
has to be solved by an iteration method of the type 
xP xk αμα" (k =1,2,...) (18.96) 


then two additional rules must be considered because of the bounded feasible set: 

1. The direction d* must be a feasible descent direction at x*. 

2. The step size a, must be determined so that x^^! is in M. 

The different methods based on the formula (18.96) differ from each other only in the construction of 
the direction d”. To ensure the feasibility of the sequence {x} C M, al, and a% are determined in the 
following way: 


o4, from f(x*+ad*) = min!, a>0 


18.2 Non-linear Optimization 937 





a! = max{a € R: x* +ad* € M}. (18.97) 
Then 

a, = min{ay, ay }. (18.98) 
If there is no feasible descent direction d* in a certain step k, then x* is a stationary point. 
18.2.7.1 Method of Feasible Directions 
1. Direction Search Program 


A feasible descent direction d* at point x* can be determined by the solution of the following optimiza- 
tion problem: 


o = min!, (18.99) 
Vg(x*) d < o, ic I(x), (18.100a) Vf(x*)'d«c,(18100b)  lld|| x1.  (18.100c) 


Ifo < 0 for the result d = d* of this direction search program, then (18.100a) ensures feasibility and 
(18.100b) ensures the descending property of d*. The feasible set for the direction search program is 
bounded by the normalizing condition (18.100c). If 7 = 0, then x* is a stationary point, since there is 
no feasible descent direction at x*. 

A direction search program, defined by (18.100a,b,c), can result in a zig-zag behavior of the sequence 
x^, which can be avoided if the index set Ip (x*) is replaced by the index set 


I(x") = {i € {1,... m}: —er < gi(x*) < 0}, k20 (18.101) 


which are the so-called €% active constraints in x*. Thus, local directions of descent are excluded 
which are going from x* and lead closer to the boundaries of M consisting of the e, active constraints 


(Fig. 18.7). 
81(x)=0 


i» 


8(x)=0 





Figure 18.7 
If o — 0 is a solution of (18.100a,b,c) after these modifications, then x* is a stationary point only if 


Io(x*) = Iz,(x*). Otherwise £j must be decreased and the direction search program must be repeated. 


2. Special Case of Linear Constraints 
If the functions g;(x) are linear, ie., gi(x) = 


a; T x — bi, then a simpler direction search method 

τ i can be established: 
Vx) Lag Vix’) Ls c — Vf(x*)' d 2 min! with (18.102) 
VAN τν aid «0, ie yx") or i € L, (x*), (18.1032) 
[|d]] € 1. (18.103b) 
a) b) The effect of the choice of different norms ||d|| = 


max{|d;|} < 1 or ||d|| = Vd7d < 1 is shown in 
Figure 18.8 Fig. 18.8a,b. 


In a certain sense, the best choice is the norm ||d|| — ||d||; — dd, since by the direction search 


program the direction d* is obtained, which forms the smallest angle with —V f (x^). In this case the 
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direction search program is not linear and requires higher computational costs. With the choice ||d|| = 
\|d||.. = max{|d;|} < 1 a system of linear constraints -1 € d; € 1, (4 — 1,...,n) is obtained, so the 
direction search program can be solved, e.g., by the simplex method. 

In order to ensure that the method of feasible directions for a quadratic optimization problem f(x) = 
x'Cx + p’x = min! with Ax < b results in a solution in finitely many steps, the direction search 
program is completed by the following conjugate requirements: If e 4 — o, , holds in a step, i.e., x* 
is an "interior" point, then the condition 


T 
d^! Cd-0 (18.104) 
is added to the direction search program. Furthermore the corresponding conditions are kept from the 
previous steps. Ifin a later step ap = a% then the condition (18.104) is removed. 


α f(x) =27 + 402 — 102, — 32%. = min! g(x) = —21 <0, go(x) = —a2 < 0, 
ga(x) — 214-2253 — 7 € 0, ga(x) — 211 4- 33 — 8 < 0. 


Step 1: Starting with x! — (3,0)*, V f(x!) — (—4, —32)7, Iy(x!) — (2). 
—4d, — 32d3 = min! 
—dz <0, ||dlJoo <1 


- ανν (αὶ) with C — (o "a 


Direction search program: { } = d=(1,1)". 


Minimizing constant: aj, = 


3d* Qa* 9. 8 
—g;(x* 1 2 

Maximal feasible step size: af = min { -40 for i such that a;d* > ο 4 = s. at = 3 => 

. [5 2 2 2 (5 =) 
ai = min {| =,=} = = c. 
: B^. oq" X39 

8. 8047 

Step 2 σῷ) -(--.-Ἕ). Ια) - {4} 


8 80 : F 

= re> αγ ! 1 152 

Direction search program: 3i 3 da = min! = g= (ο 1) =, oH 
2d, + dz <0, |\d|lo <1 


4 
Q = 3) x? = (3, 2). 


Step3: Vf(x*) - (-4,-16)", Io = (x*) = {3, 4}. 
Direction search program: 
—4d, — 16d = min! 3 
dy + 2d, <0, 2d, - d» € 0, ||d||;; <1 
1 T 5 T, 
(755) Q0 =1, of =3 = oF =1, xt = (2,2) ; / 
The next direction search program results ino = 0. Here 0 1 2 9 4 Χι 
the minimum point is x* — x* (Fig. 18.9). Figure 18.9 





18.2.7.2 Gradient Projection Method 


1. Formulation of the Problem and Solution Principle 

Suppose the convex optimization problem 

f(x) =min! with aj7x < bj, (18.105) 
fori = 1,...,m is given. A feasible descent direction d^ at the point x* € M is determined in the 
following way: 

If — V f (x^) is a feasible direction, then d* = —V f(x") is selected. Otherwise x^ is on the boundary 
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of M and —V f(x*) points outward from M. The vector — V f (x^) is projected by a linear mapping 
P; into a linear submanifold of the boundary of M defined by a subset of active constraints of x^. 
Fig. 18.10a shows a projection into an edge, Fig. 18.10b shows a projection into a face. Supposing 
non-degeneracy, i.e., if the vectors a;, i € Jo(x) are linearly independent for every x € R”, such a 
projection is given by 


d* = —P,Vf(x*) = — (1- AT(AkAsT) Ag) V f(x). (18.106) 


Here, A; consists of all vectors a;", whose corresponding constraints form the sub-manifold, into which 
—V f (xy) should be projected. 





Figure 18.10 


2. Algorithm 

The gradient projection method consists of the following steps, starting with x! € M and substituting 
k — 1 and proceeding in accordance to the following scheme: 

I: If — V f (x^) is a feasible direction, then d* — —V f (x^) is substituted, and continued with III. 
Otherwise A, is constructed from the vectors a; with i € Jo(x*) and continued with IT. 

II: d* — — (I- A,T(ALA;T) Az) V f (x^) is sunstituted. If d* 7 0, then continued with III. If 


d'—O0andu- —(A,A,7) AV f(x^) > 0, then x* is a minimum point. The local Kuhn-Tucker 


conditions —Vf(x*)= © wa;= Αρ ατο obviously satisfied. 

i€Io(x*) 
If u Z 0, then an i with u; « 0 is chosen, the i-th row from A, is deleted and II is repeated. 
III: Calculation of ap and x+! = x" + apd" and returning to I with k = k + 1. 


3. Remarks on the Algorithm 
If — V f (x*) is not feasible, then this vector is mapped onto the sub-manifold of the smallest dimension 


which contains x*. If d* = 0, then —V f(x*) is perpendicular to this sub-manifold. If u > 0 does 
not hold, then the dimension of the sub-manifold is increased by one by omitting one of the active 
constraints, so maybe d* 4 0 can occur (Fig. 18.10b) (with projection onto a (lateral) face). Since Ay 
is often obtained from A, by adding or erasing a row, the calculation of (ALAR!) can be simplified 
by the use of (Ak 1A s aT). 

W Solution of the problem of the previous example on p. 938. 

Step 1: x! = (3,0)7, 

I: Vf (x!) = (—4,-32)", —WV f(x!) is feasible, d+ = (4, 32)". 


aad : : : 1 5 16 84T 
III: The step size is determined as in the previous example: o1 — —, x^ — (= 5) 5 
Step 2: 
P 18  964T : 3 
I Vf) = (-=.-=) (not feasible), T(x?) = {4}, A2 = (21). 
5 


„l -2 :-( oy 
1t P=:( 7). @=(-g.e) κο 
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Ill: ag = 
Step 3: 


I: Vf (x3) = (—4, —16)? (not feasible), Jo(x*) — (3,4), A; = E i 


, = 8:2)". 


ο0!| σι 


28 8\T 
S-i) uz<0: Ag (13). 


- CH) 
515) © 
ΠΠ: as τρ 


) 
Step 4: 
I: Vf(x*) = (—6, —12)7 (not feasible), Io(x*) = {3}, Aa = Ag. 
I: P, 2 P3, d 2 (0,07, u262 0. 


It follows that x* is a minimum point. 


18.2.8 Penalty Function and Barrier Methods 

The basic principle of these methods is that a constrained optimization problem is transformed into 
a sequence of optimization problems without constraints by modifying the objective function. The 
modified problem can be solved, e.g., by the methods given in Section 18.2.5. With an appropriate 
construction of the modified objective functions, every accumulation point of the sequence of the solu- 
tion points of this modified problem is a solution of the original problem. 


18.2.8.1 Penalty Function Method 
The problem 


π:.Ῥ.-(ῃ 0). d-(.9^-( 


1/4 - 

i Ps=z(4, 7),a 
- 

x! = (2,5 
E 2 


f(x) = min! subject to gi(x) < 0 (i = 1,2,...,m) (18.107 
is replaced by the sequence of unconstrained problems 
A(x, pe) = f(x) + pS (x) = min! with x€ R”, pk >0 (k =1,2,...). (18.108 
Here, pp is a positive parameter, and for S(x) 
_f=0 xeM, 
ϑία) -{Ξῃ x d M, (18.109 


holds, i.e., leaving the feasible set M is punished with a "penalty" term py S(x). The problem (18.108 
is solved with a sequence of penalty parameters pp tending to oo. Then 


jim A(x, pe) = f(x), xe M. (18.110 
—oo 


If x* is the solution of the k-th penalty problem, then: 
A(x", pp) > H(t pra), F) > f), (18.111 

and every accumulation point x* of the sequence (x^) is a solution of (18.107). Ifx* € M, then x* i 

a solution of the original problem. 

For instance, the following functions are appropriate realizations of S(x): 


15 





Θ(κ) -- πιακ"{0, 91(X),---,9m(X)} (r =1,2,...) or (18.112a 
S(x) = 57 max" {0, g:(x)} τ io (18.112b 
i=l 





If functions f(x) and g;(x) are differentiable, then in the case r > 1, the penalty function H(x, pp 
is also differentiable on the boundary of M, so analytic solutions can be used to solve the auxiliary 
problem (18.108). 

Fig. 18.11 shows a representation of the penalty function method. 
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Hop) 








f(x) 





Figure 18.11 Figure 18.12 


E f(x) = x? + x} = min! for 7; +22 >1, H(x, py) = £? + £3 + pp max?{0, 1 — 24 — T2}. 
The necessary optimality condition is: 

_ (2z — 2p max{0, 1 — xı — ολλ _ /0 
VH(X, Pk) = n —2p,max(0,1— 214—293) / | NO: 
The gradient of H is evaluated here with respect to x. By subtracting the equations we have 1 — πο. 


Pk Νο get the 
1+ 2p, 





The equation 2x, — 2p, max{0, 1 — 21,) — 0 has a unique solution z£ — 25 — 





1 
solution rj — 25 — lim BE. os 3 by letting k — oo. 


k—oo 1 -- 2px 
18.2.8.2 Barrier Method 


A sequence of modified problems is considered in the form 

A(x, a) = f(x) + qyB(x) 2 min!, qx» 0. (18.113) 
The term q B(x) prevents the solution leaving the feasible set M, since the objective function increases 
unboundedly on approaching the boundary of M. The regularity condition 

M? 2 (x € M : gi(x) «0 (i—1,2,...,m)) ZÜ and M?» —M (18.114) 
must be satisfied, i.e., the interior of M must be non-empty and it is possible to get to any boundary 
point by approaching it from the interior, i.e., the closure of M? is M. 
The function B(x) is defined to be continuous on M. It increases to oo at the boundary of M. The 


modified problem (18.113) is solved by a sequence of barrier parameters gz tending to zero. For the 
solution x* of the k-th problem (18.113) 


FEŻ) < F(x), (18.115) 
holds and every accumulation point x* of the sequence {x*} is a solution of (18.107). 
Fig. 18.12 shows a representation of the barrier method. 
The functions, e.g., 
B(x) = —S2-In(-gi(x)), xe M°® or (18.116a) 
i=l 
BOs) = 3 — 
x)=) oF 
i=1 [-9i (x)] 


are appropriate realizations of B(x). 


(r=1,2,...), xeM? (18.116b) 
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E f(x) = r? + 23 = min! subject to z1 4- zo 2 1, H(x,qy) — a2 4 22 4 qk(— ln(z1 + z2 — 1)), 








231 — qk — — — 
tı — z9—1 : s 
212-29 2 1, VH (x, gy) = al = (3) , +22 > 1. The gradient of H is given 
2359 — qg —— ———— 
2 σι φαν--ἲ 
with respect to x. 
1 αν 
Subtracting the equations results in zı = 49, 23j — qu t ——— = 0, à > =, = αἲ- Tr Ee = 
2πι-- 1 2 2 4 
1 1 Ἱ "m 
0, zi» gui ak 2^ gtg% F +00, gp > 0: δν τσ 


The solutions of problems (18.108) and (18.113) at the k-th step do not depend on the solutions of the 
previous steps. The application of higher penalty or smaller barrier parameters often leads to conver- 
gence problems with numerical solutions of (18.108) and (18.113), e.g., in the method of (18.2.4), in 
particular, if no good initial approximation is available. Using the result of the k-th problem as the 
initial solution for the (k 4- 1)-th problem the convergence behavior can be improved. 


18.2.9 Cutting Plane Methods 


1. Formulation of the Problem and Principle of Solution 
Let consider the problem 


f(x)—-c'x-minl, ceR” (18.117) 
over the bounded region M C R" given by convex functions g;(x) (i = 1,2,...,m) in the form gi(x) < 
0. A problem with a non-linear but convex objective function f(x) is transformed into this form, if 

f(x)—zxna €0, «πμ ER (18.118) 
is considered as a further constraint and 

F(E) = £n = min! forall X= (X, £n41) € R**1 (18.119) 


is solved with g;(x) — g;(x) < 0. 
The basic idea of the method is the iterative linear approximation of M by a convex polyhedron in the 
neighborhood of the minimum point x*, and therefore 
the original program is reduced to a sequence of linear 
programming problems. 
First, a polyhedron is determined: 

P,={xeR": ax <b;, i=1,...,s}. (18.120) 
From the linear program 

f(x) =min! with xe P (18.121) 
an optimal extreme point xt of P} is determined with re- 
spect to f(x). If x! € M holds, then the optimal solu- 
Figure 18.13 tion of the original problem is found. Otherwise, a hy- 

perplane is determined: 





Hi; —Í[x: aj,47x 2 b,a, aj 4T! 7 0,44), which separates the point x! from M, so the new polyhe- 
dron contains 

P,- (xe D: ax E54). (18.122) 
Fig. 18.13 shows a schematic representation of the cutting plane method. 
2. Kelley Method 
The different methods differ from each other in the choice of the separating planes Hj. In the case of 
the Kelley method Hj is chosen in the following way: A jg is chosen so that 
gj, (x) ^ max(gi(x*) (i 5 1,...,m)). (18.123) 
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At the point x — x^, the function gj,(x) has the tangent plane 

E T 
T(x) = g(x") + (x — x") Vj, (x). (18.124) 
The hyperplane H, = {x € R^ : T(x) — 0] separates the point x* from all points x with gj, (x) € 0. 
So, for the (k + 1)-th linear program, T(x) € 0 is added as a further constraint. Every accumulation 
point x* of the sequence (x^) is a minimum point of the original problem. 
In practical applications this method shows a low speed of convergence. Furthermore, the number of 
constraints is always increasing. 


18.3 Discrete Dynamic Programming 


18.3.1 Discrete Dynamic Decision Models 

A wide class of optimization problems can be solved by the methods of dynamic programming. The 
problem is considered as a process proceeding naturally or formally in time, and it is controlled by 
time-dependent decisions. If the process can be decomposed into a finite or countably infinite number 
of steps, then it is called discrete dynamic programming, otherwise it is a continuous dynamic program- 
ming. In this section, only n-stage discrete processes are discussed. 

18.3.1.1 n-Stage Decision Processes 

An n-stage process P starts at stage 0 with an initial state x, = Xp and proceeds through the interme- 
diate states X;, X9, ..., X, , into a final state x,, — x, E€ Xe C R”. The state vectors x; are in the 
state space X; C RR". To drive a state x; , into the state x;, a decision u; is required. All possible 
decision vectors u; in the state x; ; form the decision space U;(x; ;) C R^. From x; ; the consecutive 
state x; can be obtained by the transformation (Fig. 18.14) 









































Xx;—gj(X;,uj, j-l1(l)n. (18.125) 
xm x x. XE Xn = Xe 
ce o eodd) Lu mou) L9. — 95 aix su) στον 

u,€ U,(xy) uE U,(x,) une Ux, i) 


Figure 18.14 
18.3.1.2 Dynamic Programming Problem 


Our goal is to determine a policy (u,,...,u,) which drives the process from the initial state x, into 
the state x, considering all constraints so that it minimizes an objective function or cost function 
f(fi(Xo, U1), +--+ fn(Xn-1,Un)). The functions f;(x; ,, u;) are called stage costs. The standard form 
of the dynamic programming problem is 


OF: — f(fi(xo1)..... fa, 1, u,)) — min! (18.1262) 


CT: X; — gj(Xj 1, Wy), j = 1(1)n, 
Xo = Xa, Xn = Xe E€ Xe, Xj E Xj C R”, j= tw. (18.126b) 
u; € Uj(x;.,) C R", j =1(1)n. 
The first type of constraints x; are called dynamic and the others xo, u; are called static. Similarly to 
(18.126a), a maximum problem can also be considered. A policy (u,,...,u,,) satisfying all constraints 


is called feasible. The methods of dynamic programming can be applied if the objective function satis- 
fies certain additional requirements (see 18.3.3, p. 944). 
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18.3.2 Examples of Discrete Decision Models 
18.3.2.1 Purchasing Problem 


In the j-th period of a time interval which can be divided into n periods, a workshop needs v; units 
of a certain primary material. The available amount of this material at the beginning of period j is 
denoted by c; 4, in particular, rg — x, is given. The amounts u; are to be determined for a unit price 
cj, which should be purchased by the workshop at the end of each period. The given storage capacity 
K must not be exceeded, i.e., vj-1 + uj < K. A purchase policy (u,..., Un) should be determined, 
which minimizes the total cost. This problem leads to the following dynamic programming problem: 


OF: f(u... Un) = Σ filu) = X cju; — min! (18.127a) 
4 = 
CT: Tj = Xjadcduj—u, j =1(1)n, 
t=%, Isa; <k, j — M1)n, (18.127b) 
U;(zj-1) 2 (uj: max(0,vj— zja]b X uj € K — xj i), j — M)n. 


In (18.1275) it is ensured that demands are satisfied and the storage capacity is not exceeded. If there is 

also a storage cost / per unit per period, then intermediate cost in the j-th period is (vj. 4- uj — v;/2)l, 
and the modified cost function is 
n 

F (x0, U1, . a1, un) — Do (cpus + (5-1 + uy — 05/2) - 1). (18.128) 


j= 


18.3.2.2 Knapsack Problem 


One has to select some of the items A;,...,A, with weights w1,...,W, and with values c1,..., Cp so 
that the total weight does not exceed a given bound W, and the total value is maximal. This problem 
does not depend on time. It will be reformulated in the following way: At every stage a decision uj 
about the selection of item A; is made. Here, u; = 1 holds if A; is selected, otherwise u; = 0. The 
capacity still available at the beginning of a stage is denoted by z;. ;, so the following dynamic problem 
arises: 


OF:  f(u...,u4) 2 Y cju; — max! (18.1292) 
j=l 
CT: tj = Tj—1 — WjUj, j — 1(1)n, 
ας W, 0zxz; XW, j= πι 
r € (0,1), falls xj; 17 > μι (18.129b) 
=0, falls zj- < wj, J =m 


18.3.3 Bellman Functional Equations 
18.3.3.1 Properties of the Cost Function 


In order to state the Bellman functional equations, the cost function must satisfy two requirements: 
1. Separability 


The function f(fi(xo, uj), ..., fa(X 1, n)) is called separable, if it can be given by binary functions 
Hiy,..., H4 and by functions F,,..., F, in the following way: 


F(filXo, U1), pas Aaa = Fi (fı (x0; u1), ο. fax, 4, u4)); 
Γι(7ι(ξο; αι), tt) fh (X4, u4)) =H; (fi Xo; 1); Pa f(x, Uy), κα Fn(Xn—1;Un))) ; 





Fr- ΠΚ (x Xp—2, Un— ws f(x. Xp—1;U, Un)) = Ηπ ι (fn— 1(X X2; Us 1) Fal fn(Xn—1; Un))) » 
Επ ης. Xn- U u,)) = f(x Xn-1: U, un). 
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2. Minimum Interchangeability 
A function H (f (a), F(b)) is called minimum interchangeable, if: 
(in. , H (fa), F(b)) — min H (Fla), min Duy (18.131) 
This property is satisfied, for example, if H is monotone increasing with respect to its second argument 
for every a € A, i.e., if for every a € A, 
H (f(a), F(b))) € H (f(a), F(bj)) for F(bi) € F(b»). (18.132) 


Now, for the cost function of the dynamic programming problem, the separability of f and the minimum 
interchangeability of all functions H;, j — 1(1)n — 1 are required. The following often occurring type 
of cost function satisfies both requirements: 


fem = Mx; uj) Or a = aa σι ; uj). (18.133) 
j=l 
The functions H; are 
Hy" = f(x; u;)- », fiGu-i ux) and (18.134) 
k=j+1 
Ay’ = ma , ma j 18.135 
wax [6 Xj-1,U uj) pax Je 1 w}. ( ) 


18.3.3.2 Formulation of the Functional Equations 


The following functions are defined: 


Φαν) — min — F;(fjQt i uj); ss fu(Xs i us) j — Y)n, (18.136) 
uy€Ux(xy 4) 
k=j(1)n 
n+1(X_) = 0. (18.137) 


If there is no policy (u;,...,u,) driving the state x; , into a final state x. E€ Xe, then we substitute 
— oo. Using the separability and minimum interchangeability conditions and the dynamic 
Pj Xj. 1 8 8 
constraints for j — 1(1)n we get: 


ó;(Xj 1) . min Hj(f;(xj 1, uj). min Fiaa(fja(xj uj). t3 fas u,))), 
u, €U;(x;_1) uL eU, (x, |) 
k=j+1(1)n 


= min Hj (Hæj), jala) 


α,εζη(ε; 1) 


b(a) = min H; (Hj w), djalo Ej- u))) - (18.138) 


ujeUj(x, i) 
Equations (18.138) and (18.137) are called the Bellman functional equations. Φι(Χρ) is the optimal 
value of the cost function f. 
18.3.4 Bellman Optimality Principle 


The evaluation of the functional equation 


Ba) = min ον) (0680 1) όμας))) (18.139) 


πι 
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corresponds to the determination of an optimal policy (u u;) for which the subprocess P; starting 
at state x;_, and consisting of the last n — j 4- 1 stages of m total process P minimizes the cost άρπα, 
i.e., 


F(x Sj- p Uj)-- adn Xn—1) u,)) — min!. (18.140) 
The optimal policy of the process Pj with the initial state x; ; is independent of the decisions u;, .. . 
uj. , of the first j — 1 stages of P which have driven P to the state x; 1. To determine $;(xj 1) the 


value $;,1(x;) is needed to know. Now, if (u5,...,u5) is an optimal policy for Pj, then, obviously, 
(uj ,,,..., U;) is an optimal policy for the subprocess P;,1 starting at x; — gj Bet. 1,uj). This state- 
ment is generalized i in the Bellman optimality principle. 

Bellman Principle: If (uj,...,u*) is an optimal policy of the process P and (xj, ..., x7) is the corre- 
sponding sequence of states, then for every subprocess P;, 7 = 1(1)n, with initial state xj_, the policy 
(uj, ...,U*) is also optimal. 


18.3.5 Bellman Functional Equation Method 


18.3.5.1 Determination of Minimal Costs 

With the functional equations (18.137), (18.138) and starting with ¢,+1(X,) = 0 every value ¢;(X;_1) 
with x;_; € Xj, is determined in decreasing order of j. It requires the solution of an optimum problem 
over the decision space U; (x;—1) for every Xj—ı € X;.,. For every x;_, there is a minimum point u; € U; 
as an optimal decision for the first stage of a subprocess P; starting at x; 4. If the sets X; are not finite 
or they are too large, then the values 9; can be calculated for a set of selected nodes x; ;, € Xj. 
The intermediate values can be calculated by a certain interpolation method. $i(xg) is the optimal 
value of the cost function of process P. The optimal policy (uj,...,u*) and the corresponding states 


)n 


(x5, ..., X5) can be determined by one of the following two methods. 


18.3.5.2 Determination of the Optimal Policy 


1. Variant 1: During the evaluation of the functional equations, the computed uj; is also saved 
for every Xj; ; € Xj.,. After the calculation of $;(xo), an optimal policy is got if xi = — 4 (Xo, uj) is 
determined from x, — xj and the saved u, — uj. From xj and the saved decision už follows χο, etc. 

2. Variant 2: For every x; ; € Xj, only $;(x; i) is saved. After every g(x Xj |) 15 known, a 
forward calculation is made. Starting with j — 1 and xj — xj one determines uj in increasing order of 
j by the evaluation of the functional equation 


ο ο min A (HE) bal 1 2)))- (18.141) 


.,ΕΌ)(κ} 


xj — g;(Xj ,, uj) isobtained. During the forward calculation, an optimization problem must be solved 
again at every stage. 

3. Comparison of the two Variants: The computation costs of variant 1 are less than variant 2 
requires because of the forward calculations. However decision uj is saved for every state x; 1. which 
may require very large memory in the case of a higher dimensional decision space U;(x; ,), while in 
the case of variant 2, only the values o;(x; ,) must be saved. Therefore, sometimes dece 2 is used 
on computers. 


18.3.6 Examples for Applications ofthe Functional Equation 
Method 
18.3.6.1 Optimal Purchasing Policy 


1. Formulation of the Problem 
The problem from 18.3.2.1, p. 944, to determine an optimal purchasing policy 
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OF j(u,...,u,) 2 Y, cju; — min! (18.1422) 
j=l 
CT $X;— Tad uj— 4 j=1)n, 
z9— 4, 0 € x; € K, j=1(1)n, (18.142b) 


U;(zj-1) = {u; : max{0, vj — zj} < uj < K — zj}, 1--1(1}η 

leads to the functional equations 
Pn+1(£n) = 0, (18.143) 
(45-1) = min i (cju; + Gja1(zj-12- uj — vj)), Jj — 1(1)n. (18.144) 


uj€U;(rj-i 


2. Numerical Example 








€ —4, c9 —3, cg —5, c4,—3, c5 —4, cg —2, 
Uj—6, v =T, vg —4, v4 —2, v5 =4, vg — 3. 
Backward Calculation: The function values ¢;(x;-1) will be determined for the states xj; = 


n=6, ΑΞ1βρ, Πα. 








0,1,...,10. Now, it is enough to make the minimum search only for integer values of u;. 
j 6: φΦοίας)-- min | cgug — cg max(0, vg — z5) — 2max(0,3 — v5). 
ug€Ue(zs) 


According to variant 2 of the Bellman functional equation method, only the values of ós(25) are written 
in the last row. For example, ¢4(0) is determined. 


(0) = nin, (3u4 + obs (tg = 2)) 
= min(28, 27, 26, 25, 24, 25, 26, 27, 30) = 24. 








zP-0|i1i[2[3|4[5[6[7[|8[9 [10] 
j=l 75 
2| 59 |56|53 50|47 44|41 | 38 | 35 | 32 | 29 
3| 44 [39|34|29| 24 21 18 |15 | 12| 9 | 6 
4| 34 [21|18|15]12, 9 | 6 415160 
5| 22 |18]14]106 42 0|0 0[|0 
6l 6 |4[|[2|0[|0]0/0/0/0/0/0 









































Forward Calculation: 
Φι(2) -- τὸ -- min (Aus + ó»(ui — 4)). 
One gets uj = 4 as the minimum point, therefore a} = x9 + uj — v; = 0. This method is repeated for 
¢2(0) and for all later stages. The optimal policy is: 
(uj, u5, u3, u4, uz, uc) = (4, 10, 1,6, 0,3). 
18.3.6.2 Knapsack Problem 


1. Formulation of the Problem 
Consider the problem given in 18.3.2.2, p. 944 


OF :f(ui,...,us) — Y, cju; — max! (18.1452) 
j=l 
CT: = &j = 4-1 — Wj, j — 1(1)n, 
zo = W, 0zz; € W, j — 1(1)n A 
u € {01} it za zw, iq (18.145b) 
uj — 0, if rj 4 < Wj; j — Yn. 


Since this is a maximum problem, the Bellman functional equations are now 


Ón41 (n) — 0, 
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Φα 1) Ξ- max {οι Φαν μι), 1 1η. 


uy CU; (3-1) 
The decisions can be only 0 and 1, so it is practical to apply variant 1 of the functional equation method. 
For j=n,n—-1,...,1: 
; ο Εφ — wj) if mj, 2 wj and cj + bj41(@j-1 — wy) > O41 (25-1), 
ój(zj-1) — 7 
Φμι(αη-1) otherwise, 


τι lif zj- > w; and cj + j1 (£j-1 — wy) > Φμι(Ώῃ.1). 
21 — 10 otherwise. 
2. Numerical Example 


.. = € 2-1, 902, cg=3, co =1, co=5, cg =4, 
Wo 10s πω w =2, we=4, w3=6, wa=3, w5=7, we =6. 














Since the weights w; are integers, the possible values for zj are xj € {0,1,...,10}, j = 1(1)n, and ap = 
10. The table contains the function values ġ;j(xj—1) and the actual decision "ία 0). 1) for every stage and 
for every state 1; ;. For example, the values of ós(x5), 63(2), ¢3(6), and $3(8) are calculated: 


delis e , if £5 < w = 4, tig(ts) = lo if v5 <4, 


ce = 6, otherwise, 0, otherwise. 
$3(2): T2 = 2« w3 = ο $3(2) = (2) = 3, ua(2) Ξ0. 
$3(6): | v9» ws and c3 + ós(za — wa) Ξ 6-5 «φιίαο)Ξ 10: φα(6) -- 10, υα(6) -- 0. 
Q3(8): v2» ws and c3 + ġ3(£2 — w3) = 6 + 9 > ġ4(z2)= 10:  $3(8) — 15, ua(8) — 1. 
The optimal policy is 








(uj, ug, UZ, Ug, Us, ug) = (0,1,1,1,0, 1), Φι(10) -- 19. 
r;-—0 1 2 3 4 5 6 7 8 9 10 | 
I 19: 0 
2 0:0 3:0 | 4:1 | 71 | 950) 30;1 | 13:1, 13; 1| 15;0 | 16;0 | 19; 1 
3 0:0 3:0 | 3:0 6:1 οτι 9:0 | 10:0 | 121 | 15; 1| 16; 1| 16;0 
4 0;0 | 3;1 | 3;1 | 3;1]6;0} 9;1 | 10;1 | 10;1 | 10;1 | 13;0} 16;1 
5 0:0) 0:0) 0:0 | 0:0) 6;0) 71) 7:1) 7:1) 1) 13; 1) 11 
6 0:0) 0:0) 0:0 | 00) 6:1) G1) 6:1) G1) G1) G1) &1 









































19 Numerical Analysis 


The most important principles of numerical analysis will be the subject of this chapter. The solution 
of practical problems usually requires the application of a professional numerical library of numerical 
methods, developed for computers. Some of them will be introduced at the end of Section 19.8.3. The 
special computer algebra system Mathematica will be discussed with its numerical programs in Chapter 
20, p. 1023 and in Section 19.8.4.2, p. 1016. Error propagation and computation errors will be examined 
in Section 19.8.2, p. 1004. 


19.1 Numerical Solution of Non-Linear Equations in a Single 


Unknown 
Every equation with one unknown can be transformed into one of the normal forms: 
Zero form: f(a) =0. (19.1) 
Fixed point form: x = y(x). (19.2) 


Suppose equations (19.1) and (19.2) can be solved. The solutions are denoted by r*. To get a first 
approximation of z*, we can try to transfom the equation into the form fi(r) — fao(x), where the 
curves of the functions y — fi(x) and y — f2(x) are more or less simple to sketch. 

WB f(x) =2?—sinz = 0. From the shapes of the curves y = 2? and y — sin x it can be seen that rj = 0 
and z5 £ 0.87 are roots (Fig. 19.1). 





-2 -1 
X*,-0 x*,20.87 





Figure 19.1 Figure 19.2 


19.1.1 Iteration Method 


The general idea of iterative methods is that starting with known initial approximations ry (k — 
0, 1,...,n) a sequence of further and better approximations is formed, step by step, hence the solu- 
tion of the given equation is approached by iteration, by a convergent sequence. A sequence is tried to 
be created with convergence as fast as possible. 


19.1.1.1 Ordinary Iteration Method 


To solve an equation given in or transformed into the fixed point form x = y(x), the iteration rule 








Inti = (Ln) (n=0,1,2,...; zo given) (19.3) 
is used which is called the ordinary iteration method. It converges to a solution «* if there is a neigh- 
borhood of x* (Fig. 19.2) such that 

vip cien) «X K «1 (K const) (19.4) 
T-T 
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holds, and the initial approximation x is in this neighborhood. If y(x) is differentiable, then the cor- 
responding condition is 





le (z)| € K « 1. (19.5) 
The convergence of the ordinary iteration method becomes faster with smaller values of K. 
E z’ = sinz, i.e., n 0 1 2 3 4 5 
Inti = VSin Lp. Cri 0.87 | 0.8742 | 0.8758 | 0.8764 | 0.8766 | 0.8767 


























sin £n | 0.7643 | 0.7670 | 0.7681 | 0.7684 | 0.7686 | 0.7686 


Remark 1: In the case of complex solutions, x = u + iv is substituted. Separating the real and the 
imaginary part, a system of two equations is obtained for the real unknowns u and v. 


Remark 2: The iterative solution of non-linear equation systems can be found in 19.2.2, p. 961. 
19.1.1.2 Newton's Method 
1. Formula of the Newton Method 


To solve an equation given in the form f(x) — 0, mostly the Newton method is used which has the 
formula 
F(E) 


Tn41 = Ln — fila ) 
n 


i.e., to get a new approximation £n+1, the values of the function f(x) and its first derivative f'(x) at £n 
are needed. 





(n — 0,1,2,... ; xo is given), (19.6) 


2. Convergence of the Newton Method 
The condition 
f(x) £0 (19.7a) 

is necessary for convergence of the Newton method, and the condition 

x) f"(« 
fefe) X K «1 (K const) (19.7b) 

P) 
is sufficient. The conditions (19.7a,b) must be fulfilled in a neighborhood of z* such that it contains all 
the points x, and z* itself. If the Newton method is convergent, it converges very fast. It has quadratic 
convergence, which means that the error of the (n+ 1)-st approximation is less than a constant multiple 
of the square of the error of the n-th approximation. In the decimal system, this means that after a 
while the number of exact digits will double step by step. 
E The solution of the equation f(x) = a? — a — 0, i.e., the calculation of x = va (a > 0 is given), 
with the Newton method results in the iteration formula 


trpi = : δι + z) : (19.8) 
njoj ı ἡ. 2 | 3 
1.416 6666 | 1.4142157 | 14142196 











We get for a = 2: 





Dr, | 15 


3. Geometric Interpretation 
The geometric interpretation of the Newton method is represented in Fig. 19.3. The basic idea of the 
Newton method is the local approximation of the curve y = f(x) by its tangent line. 
4. Modified Newton Method 
If the values of f’(an) barely change during the iteration, it can be kept as constant for a while, and the 
so-called modified Newton method can be used: 
T. 
Inti = Ta — ^ J (m fixed, m « n). (19.9) 


Tm 
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Figure 19.3 Figure 19.4 


The goodness of the convergence is hardly modified by this simplification. 
5. Differentiable Functions with Complex Argument 


The Newton method also works for differentiable functions with complex arguments. 


19.1.1.3 Regula Falsi 


1. Formula for Regula Falsi 
To solve the equation f(x) — 0, the regula falsi method has the rule: 


nyi απ -- ας πο τς η (n—1,2,...; m «mn; £o, xı are given). (19.10) 
F(%n) — | (απ) 

Only the function values are calculated. The method follows from the Newton method (19.6) by approx- 

imating the derivative f'(xn) by the finite difference of f(x) between £n and a previous approximation 

Lm (m <n). 

2. Geometric Interpretation 

The geometric interpretation of the regula falsi method is represented in Fig. 19.4. The basic idea of 

the regula falsi method is the local approximation of the curve y = f(x) by a secant line. 

3. Convergence 

The method (19.10) is convergent if m is chosen so that f (£m) and f(x) always have different signs. 

If the convergence already seems to be fast enough during the process, it will speed up if one ignores 

the change of sign, and substitutes x, — r4. 4. 


WB f(z) =2?-sinz =0. 














n | Aty = Ln — Ln-1 pm | fs) | Ayn = f (£n) — f (£n-1) Atn 
Ayn 

0 0.9 0.0267 

1 —0.3 0.87 —0.0074 —0.0341 0.8798 

2 0.0065 0.8765 —0.000252 0.007148 0.9093 

3 0.000229 .876729 0.000003 0.000255 0.8980 

4 —0.000003 0.876726 


If during the process the value of Axzn/Ayn only barely changes, one does not need to recalculate it 
again and again. 

4. Steffensen Method 

Applying the regula falsi method with £m = £n—1 for the equation f(x) = x— y(x) = 0 the convergence 
often can be sped up, especially in the case g'(x) < —1. This algorithm is known as the Steffensen 
method. 

E To solve the equation x? = sin x with the Steffensen method, the form f(x) = z — vsin x = 0 should 
be used. 
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Ages, 
n | Aty, = Ln — 2ῃ 1 A | f (24) | Ay = f (£n) — f (£n-1) A 
Un 
0 0.9 0.014942 
1 —0.03 0.87 —0.004259 —0.019201 1.562419 
2 0.006654 0.876654 | —0.000046 0.004213 1.579397 
3 0.876727 | 0.000001 
19.1.2 Solution of Polynomial Equations 
Polynomial equations of n-th degree have the form 
f(x) = pax) = anz” + Oya" + +: + ax + ao = Q. (19.11) 


For their effective solution efficient methods are needed to calculate the function values and the deriva- 
tive values of the function p, (x) and an initial estimate of the positions of the roots. 


19.1.2.1 Horner's Scheme 


1. Real Arguments 
To determine the value of a polynomial p, (x) of n-th degree at the point x = xo from its coefficients, 
first the decomposition 

Ῥπ(α) Ξ- αμα” + anu"! + +++ JF agz? + arx 4c ag — (x — xo)ps (2) - pa (xo) (19.12) 
is considered where p, 4 (x) is a polynomial of (n — 1)-st degree: 


pac) = dL agnos + a 4a"? Tod aqoa. (19.13) 
The recursion formula 
ayy = Xo, tan, (kK=n,n—-1,...,0; a, =0, ay = pr(2o)) (19.14) 


is obtained by coefficient comparison in (19.12) with respect to z^. (Note that a^, , — a4.) In this 
way, the coefficients aj, of p, .1(x) and the value p, (xo) are determined from the coefficients aj of p, (x). 
Furthermore fewer multiplications are required than by the "traditional" way. By repeating this pro- 
cedure, a decomposition of the polynomial p, 4(v) with the polynomial p, (v) is obtained, 

















Pn-1(£) — (x — vo)ps-2(x) + Pn—1(Xo) (19.15) 
etc., and a sequence of polynomials p, (x), ps 1(x) ... , pi(@), po(x) is resulted. The calculations of the 
coefficients and the values of the polynomial is represented in (19.16). 

απ απ- 1 απ 9 s>s GARI ag ay αρ 
10 Loa, ; Coal, _9 ... Cod, Toaz Toa) Todo 
t + 1 / / / n 
an-ı πο αμ. +++ yA 6 Dro) 
Xo X90, 9 Toa 4... Toaz Toa] Toag 
n "n n n n 
a ox uw an ET. α Pn—1(X0) 
n—2 n—3 n—4 1 0 n 
—— (19.16) 
-1 
To αρας ) 
(n-1) 
a pi (uo) 
Xo 
(n) m 
ay = po(Xo) 





From scheme (19.16) the value p, (zo), and derivatives p“) (ao) are as: 


p. (xo) — Mpa-i(xo), pl(zo) — 2!py a(xo), ..., p? (zo) = n!po(zo). (19.17) 
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B pi(z) = xt + 223 — 32? — T. 12 3 0.7 We see: 
'The substitution value and 2 2 8 10 20 pa(2) - 13, 
derivatives of p4(x) are cal- 14 5 10113 pi,(2) = 44 
culated at x9 = 2 according 9 2 1934 pi(2) = 66. 
to (19.16). 16 17 m pt(2) ae 60, 
2]. 5 m pP) = 24. 
1 8 |33 
2| 2 
1 10 
2 E——Á 
1 
Remarks: 





1. The polynomial p(x) can be rearranged with respect to the powers of « — xo, e.g., in the example 
above there is p4(2) = (x — 2)4 + 10( — 2)8 + 33(a — 2)? + 44(a — 2) + 13. 

2. The Horner scheme can also be used for complex coefficients az. In this case for every coefficient we 
have to compute a real and an imaginary column according to (19.16). 

2. Complex Arguments 

If the coefficients a; in (19.11) are real, then the calculation of p, (xo) for complex values zo — uo 4- iuo 
can be made real. In order to show this, p, (x) is decomposed as follows: 


Dn(&) = Ant” + απ ια 1 +++» + aye + ag 








= (x? — pz — q)(a4, 33" ? +--+ +45) +rie+ro with (19.18a) 
z? — pr — q = (£ — To)(£ — To), i.e., p = 2uo, q = — (u0? + vo’). (19.18b) 
Then, 
pa(xo) — rav + ro = (r1uo + ro) + ir1vo. (19.18c) 
To find (19.18a) the so-called two-row Horner scheme introduced by Collatz can be constructed: 
An Qn- n-2 ... a3 Gg αἱ αρ 
q qa, 3 ... qas qd» qd, qa 
p pa 9 pa, s... pas pa^ pag (19.18d) 
αγ ο Q4 3 Q4 4 0j αρ Ti Το 
— as 
WB (1) -- xt + 2x? — 3x? — 7. Calculate the value of p4 at £o = 2 — i, i.e., for p = 4 and q = —5. 


1 2 —3 0 -T7 Tt results in: 


—9 —30 —80 pa(%9) = 349 — 87 = —19 — 34i. 
424 64 pala) ° 








19.1.2.2 Positions ofthe Roots 


1. Real Roots, Sturm Sequence 
The Cartesian rule of signs gives a first idea of whether the polynomial equation (19.11) has a real root, 
or not. 
a) The number of positive roots is equal to the number of sign changes in the sequence of the cooeffi- 
cients 

Qn, Gn—1,-++, Q1, G0 (19.19a) 


or it is less by an even number. 
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b) The number of negative roots is equal to the number of sign changes in the coefficient sequence 

αρ, --ᾱα, dg,..., (—1)"an (19.19b) 

or it is less by an even number. 

Bl ps(z) — a? — 62^ 4- 102? -- 132? — 15x — 16 has 1 or 3 positive roots and 0 or 2 negative roots. 

To determine the number of real roots in any given interval (a, b), Sturm sequences are used (see 1.6.3.2, 
2., p. 44). 

Alter ο πῶς the function values y, — p, (xy) at a uniformly distributed set of nodes x, = xo +v- h 
(h constant, v — 0, 1,...) (which can be easily performed by using the Horner scheme) a good guess 
of the graph of the function and the locations of roots are obtained. If p,(c) and p,(d) have different 
signs, there is at least one real root between c and d. 

2. Complex Roots 

In order to localize the real or complex roots into a bounded region of the complex plane the following 
polynomial equation is considered which is a simple consequence of (19.11): 

f(x) = janar 4 Jas 2|r"? 4 --- 9 [ai|r 4- [ao] = lan|r” (19.20) 
and an upper bound ro is determined for the positive roots of (19.20), e.g., by systematic repeated trial 
and error. Then, for all roots zz, (k — 1,2,...,n) of (19.11), 

xx] < ro- (19.21) 
E /(7) — pi(x) — z*--4.42?—20.0127—50.12z4-29.45 — 0, f*(x) — 4.4r?4-20.011?4-50.12r -29.45 — 
r^. Some trials are 
6: f*(6) = 2000.93 > 1296 = r4, 
τι f*(7) = 2869.98 > 2401 = r4, 
=8: f*(8)- 3963.85 « 4096 — r*. 


From this it follows that |vz| « 8 (k — 1,2,3, 4). Actually, for the root 2} with maximal absolute value 
—7 < xj « —6 holds. 

Remark: A special method has been developed in electrotechnics in the so-called root locus theory 
for the determination of the number of complex roots with negative real parts. It is used to examine 
stability (see [19.11], [19.31]). 


19.1.2.3 Numerical Methods 
1. General Methods 


'The methods discussed in Section 19.1.1, p. 949, can be used to find real roots of polynomial equations. 
'The Newton method is well suited for polynomial equations because of its fast convergence, and the 
fact that the values of f(r,) and f'(x,) can be easily computed by using Horner's rule. By assuming 
that an approximation z;, of the root z* of a polynomial equation f(x) = 0 is sufficiently good, then 
the correction term ó — z* — x, can be iteratively improved by using the fixed-point equation 


1 1 s 
b= -FGJ flan) + al (ene +| = (ô). (19.22) 
2. Special Methods 

The Bairstow method is well applicable to find root pairs, especially complex conjugate pairs of roots. 
It starts with finding a quadratic factor of the given polynomial like the Horner scheme (19.18a-d) by 
determinung the coefficients p and q which make the coefficients of the linear remainder ro and rı equal 
to zero (see [19.30], [19.11], [19.31]). 

If the computation of the root with largest or smallest absolute value is required, then the Bernoulli 
method is the choice (see [19.19]). 

The Graeffe method has some historical importance. It gives all roots simultaneously including complex 
conjugate roots; however the computation costs are tremendous (see [19.11], [19.31]). 


r 
r 
r 
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19.2 Numerical Solution of Systems of Equations 
In several practical problems, there are rn conditions for the n unknown quantities x; (i = 1,2,...,n) 
in the form of equations: 

Pi(ri239,...,24) =0, 


fóixm,43.....844) 0, 
μα ) . (19.23) 


πα, οι...) Ξ 0. 


The unknowns z; are to be determined so that they form a solution of the system of equations (19.23). 
Mostly m = n holds, i.e., the number of unknowns and the number of equations are equal to each 
other. In the case of m > n, (19.23) is called an over-determined system; in the case of m < n it is an 
under-determined system. 

Over-determined systems usually have no solutions. Then one looks for the “best” solution of (19.23), 
in the Euclidean metric with the least squares method 


XO F? (x1, £25... , 8n) = min! (19.24) 
i-i 


or in other metrics as another extreme value problem. Usually, the values of n — m variables of an under- 
determined problem can be chosen freely, so the solution of (19.23) depends on n — m parameters. It 
is called an (n — m)-dimensional manifold of solutions. 

Linear and non-linear systems of equations are distinguished, depending on whether the equations are 
only linear or also non-linear in the unknowns. 


19.2.1 Systems of Linear Equations 


Consider the linear system of equations 





aj121 d- à1232 4. t + antn = br, 
Q21T1 d 422339 + +++ + dann = bo, 
μα. (19.25) 
Ani XL1 + AngLg + +- + Annin = bn- 
The system (19.25) can be written in matrix form 
Ax=b (19.26a) 
with 
Q11 Q12 *** Q1n bi, Ti, 
οι Q22 ` `` A2 bz, T2, 
A-|. ets ‘Be Ps 14 x= |. |. (19.26b) 
Gn1 Qn2 *** Qnn b, Yn 


Suppose the quadratic matrix A = (a;,) (i,k = 1,2,...,n) is regular, so system (19.25) has a unique 
solution (sce 4.5.2.1, 2., p. 309). In the practical solution of (19.25) two types of solution methods are 
distinguished: 

1. Direct Methods are based on elementary transformations, from which the solution can be obtained 
immediately. These are the pivoting techniques (see 4.5.1.2, p. 307) and the methods given in 19.2.1.1— 
19.2.1.3. 

2. Iteration methods start with a known initial approximation of the solution, and form a sequence 
of approximations that converges to the solution of (19.25) (see 19.2.1.4, p. 960). 


19.2.1.1 Triangular Decomposition of a Matrix 
1. Principle of the Gauss Elimination Method 


By elementary transformations 





956 19. Numerical Analysis 





1. interchanging rows, 
2. multiplying a row by a non-zero number and 
3. adding a multiple of a row to another row, 


the system A x — b is transformed into the so-called row echelon form 


Tu Ti fig ... Tin 
T22 T23 vd T2n 
Rx=c with R= T3300 +++ Τϑπ |, (19.27) 
0 i 
Tan 


Since only equivalent transformations were made, the system of equations R x = c has the same solu- 
tions as A x = b. From (19.27) it follows: 


in 1 D i 
io, a= —lq- Σ ται (i2n—1,n—2,...,1). (19.28) 
Tnn Τη k=i+1 


The rule given in (19.28) is called backward substitution, since the equations of (19.27) are used in the 
opposite order as they follow each other. 





The transition from A to R is made by n — 1 so-called elimination steps, whose procedure is shown by 
the first step. This step transforms matrix A into matrix Aj: 
1 1 1 
a) αἴθ... ας) 
αιι 012... Qn a) a 
a2, Q2 ... Aan 0 2393 ... 02s 
A=| %1 a3 ---  @3n |, Ai-| 0 | a...aP |]. (19.29) 
a a eats d 1 
ni n2 nn 0 al D . a) 











Then: 


1. An a4, 4 0 is chosen (according to (19.33)). If there is none, stop: A is singular. Otherwise a;4 is 
called the pivot. 


2. The first and the r-th row of A are interchanged. The result is A. 


3. The liz (i = 2,3,...,n) multiple of the first row is subtracted from the i-th row of the matrix A. 
The result is the matrix A; and analogously the new right-hand side b, with the elements 


αν = ūik — lūk with la = E 
ai 
b® = bi- lab (i,k =2,3,...,n). (19.30) 


The framed submatrix in A (sce (19.29)) is of type (n — 1,n — 1) and it will be handled analogously 
to A, etc. This method is called the Gaussian elimination method or the Gauss algorithm (see 4.5.2.4, 
p. 312). 

2. Triangular Decomposition 

The result of the Gauss elimination method can be formulated as follows: To every regular matrix A 
there exists a so-called triangular decomposition or LU factorization of the form 


PA-LR (19.31) 
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with 
Tn Tig Tig e Tin 1 
T22 T23 ΕΠ T2n [21 1 0 
R= T3 ee. Tan |) =| lar l2 1 . (19.32) 
0 e : : : E 
Tnn ln Ina m Ll 1 


Here R is called an upper triangular matrix, L is a lower triangular matrix and P is a so-called permu- 
tation matrir. A permutation matrix is a quadratic matrix which has exactly one 1 in every row and 
every column, and the other elements are zeros. The multiplication P A results in row interchanges in 
A, which comes from the choices of the pivot elements during the elimination procedure. 


3.16 Tı 2 
E The Gauss elimination method should be used for the system (: 1 s) (s) = (7) . In 
l d l T3 4 
schematic form, where the coefficient matrix and the vector from the right-hand side are written next 
to each other (into the so-called extended coefficient matriz), the calculations are: 


3 16 | 2 31 6 |2 


























3 16/|/2 
(Ab-||2 1a|7|»o |2/[|us-1|/s]| 5 [1/33/3 —1 10/3 |. ie, 
1114/4 1/3 | [2/3] -1 || 10/3 2/3 1/2|—1/2]| 4 






































100 3 1 ϐ 1 0 0 3 1 6 
P-|0 1) + PA=(1 1 1),L=[1/3 1 0}],R=[0 2/3 -1 J. 
010 213 2/3 1/2 1 0 0 -1/2 


In the extended coefficient matrices, the matrices A, A, and Ag, and also the pivots are shown in boxes. 
Solution: 73 = —8, £2 = —7, 7, = 19. 


3. Application of Triangular Decomposition 

With the help of triangular decomposition, the solution of the linear system of equations A x — b can 
be described in three steps: 

1. PA — L R: Determination of the triangular decomposition and substitution Rx = ο. 

2. Lc - P b: Determination of the auxiliary vector c by forward substitution. 


3. Rx — c: Determination of the solution x by backward substitution. 

If the solution of a system of linear equations is handled by the expanded coefficient matrix (A, b), 
as in the above example, by the Gauss elimination method, then the lower triangular matrix L is not 
needed explicitly. This can be especially useful if several systems of linear equations are to be solved 
after each other with the same coefficient matrix, with different right-hand sides. 


4. Choice ofthe Pivot Elements 


: ο : 
Theoretically, every non-zero element alt > of the first column of the matrix A, could be used as a 


pivot element at the k-th elimination step. In order to improve the accuracy of solution (to decrease 
the accumulated rounding errors of the operations), the following strategies are recommended. 


1. Diagonal Strategy The diagonal elements are chosen successively as pivot elements if possible, 
i.e., there is no row interchange. This kind of choice of the pivot element makes sense if the absolute 
value of the elements of the main diagonal are fairly large compared to the others in the same row. 


2. Column Pivoting To perform the k-th elimination step, such row index r is chosen for which: 


ak), (19.33) 





k-1 
lark | = max 
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Ifr z k, then the r-th and the k-th rows will be interchanged. It can be proven that this strategy makes 
the accumulated rounding errors smaller. 


19.2.1.2 Cholesky's Method for a Symmetric Coefficient Matrix 


In several cases, the coefficient matrix A in (19.26a) is not only symmetric, but also positive definite, 
i.e., for the corresponding quadratic form Q(x) holds 


Q(x) =x"Ax= Y: 5 aiia, 2 Ὁ (19.34) 


i-lk-l 
for every x € R^, x # 0. Since for every symmetric positive definite matrix A there exists a unique 
triangular decomposition 





A=LL* (19.35) 
with 
lu 
lı l2 0 
L= | αι ἴοι ls ] (19.364) 
lni Ino Ing ttt lan 
af) 
lkk = ab D. lik = $ (i=k,k+1,...,n); (19.365) 
kk 
αὖ - ας) — lal, Gj=k+1,k+2,...,n), (19.360) 


the solution of the corresponding linear system of equations A x = b can be determined by the Cholesky 
method by the following steps: 

1. A = LLT: Determination of the so-called Cholesky decomposition and substitution LTx = c. 

2. Lc — b: Determination of the auxiliary vector c by forward substitution. 

3. LTx = c: Determination of the solution x by backward substitution. 

For large values of n the computation cost of the Cholesky method is approximately half of that of the 
LU decomposition given in (19.31), p. 956. 


19.2.1.3 Orthogonalization Method 
1. Linear Fitting Problem 


Suppose an over-determined linear system of equations 
n 
» απο ὃν (i =1,2,...,m;m>n), (19.37) 
k=1 


is given in matrix form 

Ax =b. (19.38) 
Suppose the coefficient matrix A = (aip) with size (m x n) has full rank n, i.e., its columns are linearly 
independent. Since an over-determined linear system of equations usually has no solution, instead of 
(19.37) the so-called error equations are considered 


r= Σ Apt, —b; (i—1,2,...,m; mm) (19.39) 
k=1 


with residues r;, and the sum of their squares should be minimalized: 


m 


m a 2 
Sor? = Σ | Σ αμ -- Ἴ — F(ri,22,..., 24) — min! (19.40) 
i=1 k=l 


i=1 
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The problem (19.40) is called a linear fitting problem or a linear least squares problem (see also 6.2.5.5, 
p. 456). The necessary condition for the relative minimum of the sum of residual squares F (x1, £2,..., 
Ln) is 


OF 


--- kS eil 19.41 

Fae 70 (HAL) (19.41) 
and it leads to the linear system of equations 

ATAx = ATb. (19.42) 


The transition from (19.38) to (19.42) is called a Gauss transformation, since the system (19.42) arises 
by applying the Gaussian least squares method (see 6.2.5.5, p. 456) for (19.38). Since A is supposed 
to be of full rank, ATA is a positive definite matrix of size (n x n), and the so-called normal equations 
(19.42) can be solved numerically by the Cholesky method (see 19.2.1.2, p. 958). 

One can have numerical difficulties with the solution of the normal equations (19.42) if the condition 
number (see [19.24]) of the matrix ATA is too large. The solution x can then have a large relative 
error. Because of this problem, it is better to use the orthogonalization method for solving numerically 
linear fitting problems. 


2. Orthogonalization Method 


The following facts are the basis of the following orthogonalization method for solving a linear least 
squares problem (19.40): 


1. The length of a vector does not change during an orthogonal transformation, i.e., the vectors x and 
X = Qox with 

QQ =E (19.43) 
have the same length. 
2. For every matrix A of size (m, n) with maximal rank n (n < m) there exists an orthogonal matrix 
Q of size (m, m) such that 


T11 T12 --- Tin 
122 +++ Tan 
^ . T a R See d 
A-QR (1944 wih QQ—-E απὶ Β-|ρ]- $a (19.45) 
Tnn 
ο 


Here R is an upper triangular matrix of size (n, n), and O is a zero matrix of size (m — n, n). 
The factored form (19.44) of matrix A is called the QR decomposition. So, the error equations (19.39) 
can be transformed into the equivalent system 





Tiiti + riz +... + Tinn οι. =f, 
T9939 -...-- Τηῃζῃ —b3 Ξ- f, 
Tnnin —bn = Pas (19.46) 


—bnti = Pati, 


without changing the sum of the squares of the residuals. From (19.46) it follows that the sum of the 
squares is minimal for 7, = 7g =--- = 7, = 0 and the minimum value is equal to the sum of the squares 
of f441 tO Pm. The required solution x can be got by backward substitution 


Rx = by, (19.47) 
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where bo is the vector with components bs, bo, ... , b, obtained from (19.46). 
There are two methods most often used for a stepwise transition of (19.39) into (19.46): 


1. Givens transformation, 

2. Householder transformation. 

The first one results in the QR decomposition of matrix A by rotations, the other one by reflections. 
The numerical implementations can be found in [19.23]. 


Practical problems in linear least squares approximations are solved mostly by the Householder trans- 
formation, where the frequently occurring special band structure of the coefficient matrix A can be 
used. 


19.2.1.4 Iteration Methods 
1. Jacobi Method 


Suppose in the coefficient matrix of the linear system of equations (19.25) every diagonal element aj 
(i = 1,2,...,n) is different from zero. Then the i-th row can be solved for the unknown z;, and it 
immediately results the following iteration rule, where ju is the iteration index: 


b; nr a, 
gq(D — DEL yo PESO) (12...) (19.48) 
απ ἔτι Qu 
(k#i) 
(n. 0,1, 2,...5 a. a, εν πο are given initial values). 


Formula (19.48) is called the Jacobi method. Every component of the new vector x "^9 is calculated 
from the components of x ^. If at least one of the conditions 














max Σ [5 column sum criterion (19.49) 
Aii 
á Zh) 
or 
eas 
max 3 |= row sum criterion (19.50) 
rr ee 
ἆπθ 


holds, then the Jacobi method is convergent for any initial vector x. 


2. Gauss-Seidel Method 


If the first component ot) is calculated by the Jacobi method, then this value can be used in the 


calculation of α΄ +D. While proceeding similarly in the calculation of the further components, the 


following iteration formula is obtained: 


b Ela n a; 

qu E aun ή, Σ Cik (H) (19.51) 
Qi pay Vii k=i+1 “ii 

(021,2; f; ας) VQ ...,c© given initial value; ji — 0,1,2,...). 


Formula (19.51) is called the Gauss-Seidel method. The Gauss-Seidel method usually converges faster 
than the Jacobi method, but its convergence criterion is more complicated. 
E 1024 = 322 = 423 T 2x4 = 14, 
—3zı + 26x2 + 5zx3—  z4,-— 22, 
—4zı + 5x2 + 16x3 + δα -- 17, 
2%, + ὅπο-- ἀπ -- 12145 --20. 
The corresponding iteration formula according to (19.51) is: 
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με 1 (u) (u) QD Some approximations and the solution 

i — — (14432 4r3 | — 2r 3 

E 10 ( dra ΕΞΩ is ) ᾿ are given here: 
1 (0) (1) (4) (5) 

it) = 26 (22 + βαν — 5a + af”) ; x | x | * | x | x 
1 0 |14 1.5053 | 1.5012 | 1.5 

of) = — (17+ 4a" — 5a — 5a), 0 | 1.0077 | 0.9946 | 0.9989 | 1 
16 0 | 1.0976 | 0.5059 | 0.5014 | 0.5 
1 

av) = 5 (-20+ θα) 4 a, r7 Ax). O0 | 1.7861 | 1.9976 | 1.9995 | 2 


3. Relaxation Method 
The iteration formula of the Gauss-Seidel method (19.51) can be written in the so-called correction 
form 








jc n 
af) 2 4 (2 ik utd) ap), ie., 


af’ 4 αγ 
αἱ ια kzi lii 
sP = κα (i=1,2,..., n; p =0,1,2,...). (19.52) 
By an appropriate choice of a relaxation parameter w and rewriting (19.52) in the form 
th) 20 Lud (i =1,2,...,n; 1 =0,1,2,...), (19.53) 
one can try to improve the speed of convergence. It can be shown that convergence is possible only for 
0<w<2. (19.54) 


For w = 1 we retrieve the Gauss-Seidel method. In the case of w > 1, which is called over-relaxation, 
the corresponding iteration method is called the SOR method (successive overrelaxation). The deter- 
mination of an optimal relaxation parameter is possible only for some special types of matrices. 
Iterative methods are applied to solve linear systems of equations in the first place when the main diag- 
onal elements a, of the coefficient matrix have an absolute value much larger than the other elements 
aip (i # k) (in the same row or column), or when the rows of the system of equations can be rearranged 
in a certain way to get such a form. 


19.2.2 System of Non-Linear Equations 
Suppose the system of n non-linear equations 

Fi(rida.,25)—0 (6-1,2,...;n) (19.55) 
for the n unknowns 21, 15, ..., r, has a solution. Usually, a numerical solution can be given only by 
an iteration method. 
19.2.2.1 Ordinary Iteration Method 


The ordinary iteration method can be used if the equations (19.55) can be transformed into a fixed- 
point form 

ων πο μα. ο (19.56) 
Then, starting from estimated approximations α΄), a. .., ©), the improved values are obtained 
either by 
1. iteration with simultaneous steps 


ott) = fi (a, of, ΝΙΝ πρ] (11,2, n; 416 :0,1,2,...) (19.57) 


an 
or by 
2. iteration with sequential steps 


ott) — fi [ασ is aD. πώ, alt), iu ae) ο ο 1 HHO, 22 2.) (19.58) 


i 
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It is of crucial importance for the convergence of this method that in the neighborhood of the solution 
the functions f; should depend only weakly on the unknowns, i.e., if f; are differentiable, the absolute 
values of the partial derivatives must be rather small. We get as a convergence condition 

















n Of, 
K «1 with K = max | X` max fi I (19.59) 

UC T OX, 

With this quantity A’, the error estimation is the following: 
K 
max ee? = tij < qog Max la 7 - αἴ ‘ (19.60) 
t — 1 

Here, x; is the component of the required solution, αἰ) and αν +D are the corresponding p-th and 


(u + 1)-th approximations. 


19.2.2.2 Newton’s Method 


The Newton method is used for the problem given in the form (19.55). After finding the initial ap- 


ο i 0 0 2 " : 
proximation values ας ) af ..., ©), the functions F; are expanded in Taylor form as functions of n 


independent variables z4, 15,...,c, (see p. 471). Terminating the expansion after the linear terms, 
from (19.55) a linear system of equations is obtained, and iterative improvements can be got by the 
following formula: 


F; (a9, a9... a) +E E (e, ... a9) (a — (P) — 0 (19.61) 





ο πω ο = ve). 
The coefficient matrix of the linear system of equations (19.61), which should be solved in every iteration 
step, is 


F'(x()) — (= (P eaa) (i,k =1,2,...,n) (19.62) 
Tk 


and it is called the Jacobian matriz. Ifthe Jacobian matrix is invertible in the neighborhood of the so- 
lution, the Newton method is locally quadratically convergent, i.e., its convergence essentially depends 


on how good the initial approximations are. If 2" — 2 = d\ are substituted in (19.61), then the 
Newton method can be written in the correction form 
a Lu) 4 di? (151,2: 3:05 15:012). (19.63) 


To reduce the sensitivity to the initial values, analogously to the relaxation method, a so-called damping 
or step length parameter y can be introduced (damping method): 

ot) = al? +d (21,2,...,5; 525 0,1,2,:..; 0). (19.64) 
Methods to determine y can be found in [19.24]. 


19.2.2.3 Derivative-Free Gauss-Newton Method 

To solve the least squares problem (19.24), one proceeds iteratively in the non-linear case as follows: 
1. Starting from a suitable initial approximation αἴ), πο, ... 420. the non-linear functions F;(21, c3, 
sss, q8) (i = 1,2,...,m;m » n) are approximated as in the Newton method (see (19.61)) by linear 





approximations Fio, 12,..., 24), which are calculated in every iteration step according to 
F(a, rte) =H [ο al? που zW) + x oF [οὗν un αρ) (ο. — af) 
ΠΠ ; ΠΠ n 2 δαι ΠΟ 


((—1,2,...,05mj 4 0,1,2,...). (19.65) 
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2. dU? — x, — a(? are substituted in (19.65) and the corrections d^ are determined by using the 
Gaussian least squares method, i.e., by the solution of the linear least squares problem 


35 F2... m.) = min, (19.66) 
e.g., with the help of the normal equations (see (19.42)), or the Householder method (see 19.6.2.2, 


p. 985). 
3. The approximations for the required solution are given by the formulas 


ot) - a) + di? ot (19.672) 
apr egi ead?” (le ο vn), (19.67b) 
where y (y > 0) is astep length parameter similar to the Newton method. 


By repeating steps 2 and 3 with zm instead of ot) one gets the Gauss-Newton method. It results in 


a sequence of approximation ON whose convergence strongly depends on the accuracy of the initial 
approximation. The sum of the error squares can be reduced by introducing a length parameter y. 





OF; 
Ifthe evaluation of the parti: 53 : (ay, n 209) (i =1,2,...,m;k =1,2,...,n) requires 
Tk 
too much work, the partial derivatives can be approximated by difference quotients : 
OF; 1 
a ion ee ae a) N lh [εν ENCOUNTER a) M a) 
-Ε C m=i am i Doe e= N e: (19.68) 


The so-called discretization step sizes nO may depend on the iteration steps and the values of the vari- 
ables. 

If the approximations (19.68) are used, then only function values F; are to be calculated while perform- 
ing the Gauss-Newton method, i.e., the method is derivative free. 


19.3 Numerical Integration 


19.3.1 General Quadrature Formulas 


The numerical evaluation of the definite integral 


= [ fe) dx (19.69) 


must be done only approximately if the integrand f(x) cannot be integrated by elementary calculus, 
or it is too complicated, or when the function is known only at certain points x,, at the so-called inter- 
polation nodes from the integration interval [a,b]. The so-called quadrature formulas are used for the 
approximate calculation of (19.69). They have the general form 


= 5 CovYv + y ey qp sese Ὁ» Cy, JP (19.70) 
v=0 v=0 v=0 


with y) = f(a) (u = 1,2,...,p; v — L2,...,n), y, — f(z), and constant values of c,,. Obvi- 
ously, 


Uf) = Q(f) +R, (19.71) 


where R is the error of the quadrature formula. In the application of quadrature formulas it is supposed 
that the required values of the integrand f (x) and its derivatives at the interpolation nodes are known 
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as numerical values. Formulas using only the values of the function are called mean value formulas; 
formulas using also the derivatives are called Hermite quadrature formulas. 


19.3.2 Interpolation Quadratures 


The following formulas represent so-called interpolation quadratures. Here, the integrand f(x) is in- 
terpolated at certain interpolation nodes (possibly the least number of them) by a polynomial p(x) of 
corresponding degree, and the integral of f (x) is replaced by that of p(x). The formula for the integral 
over the entire interval is given by summation. Here the formulas are given for the most practical cases. 
The interpolation nodes are equidistant: 


b—a 


TL 





νΞταρ vh (v=0,1,2,...,n), rzo—a, zx, —b, h— (19.72) 


An upper bound for the magnitude of the error |R] is given for every quadrature formula. Here, M, 
means an upper bound of | /? (z)| on the entire domain. 


19.3.2.1 Rectangular Formula 


In the interval [xo, xo 4- A], f (x) is replaced by the constant function y — yo — f (xo), which interpolates 
f (x) at the interpolation node xo, which is the left endpoint of the integration interval. In this way the 
simple rectangular formula is obtained: 


d h? 
I f@)dem hyo, |R] < ZM. (19.73a) 
zo 


The left-sided rectangular formula by summation is 


)h 


b—a 
fiou (yo +y +yz +: + Yna) μμ] κ P y, (19.73b) 


Mi denotes an upper bound of | /"(z)| on the entire domain of integration. 
One gets analogously the right-sided rectangular sum, if one replaces yo by y; in (19.73a). The formula 
is 


a)h 


fioa h(y ya: ys) |R| < 9n, (19.74) 


19.3.2.2 Trapezoidal Formula 

f(x) is replaced by a polynomial of first degree in the interval [ro, xo 4- A], which interpolates f(x) at 

the interpolation nodes xo and zı = zo + h. The approximation is 

ΠΡ h h? 

[ fa)dem Sw tin), R< Me (19.75) 
To 

The so-called trapezoidal formula can be obtained by summation: 


(b— E. 


"n )ds e n (9 πρ. ra c I), |R| € (19.76) 


M» denotes an upper bound of | /"(x)| on the entire integration domain. The error of the trapezoidal 
formula is proportional to h?, i.e., the trapezoidal sum has an error of order 2. It follows that it converges 
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to the definite integral for  — 0 (hence, n — oc), if rounding errors are not considered. 


19.3.2.3 Simpson's Formula 
f(x) is replaced by a polynomial of second degree in the interval [:79, 29 + 2h], which interpolates f(z) 
at the interpolation nodes x, £1 = £o + h and x3 — x 4- 2h: 
xot2h A hb 
f(x) da z 3 (Yo t4yucy) |R|< 9o M (19.77) 


To 


n must be an even number for a complete Simpson formula. The approximation is 


b 
h 
[κω de ~ (yo ο) (19.78) 
b — a)h* 
IR| < OW ah Me 


Ma is an upper bound for | f ()| on the entire integration domain. The Simpson formula has an error 
of order 4 and it is exact for polynomials up to third degree. 


19.3.2.4 Hermite's Trapezoidal Formula 
f(x) is replaced by a polynomial of third degree in the interval (vo, xo 4- ^], which interpolates f(a) and 
f' (x) at the interplation nodes zo and z4 — xo +h: 

roth 


he 
(=y) [RIS Ma. (19.79) 


h h? 
f(x) de ~ z(u +u) + 35 720 


12 


20 


The Hermite trapezoidal formula is obtained by summation: 


b 
Yo Yn hs ; (b — a)h* 
r dr h (2 ης. tn) Zap- R< CL mM, (19.80 
fro sahig taty Ἔα Ἐφ] ETU Y) IRIS ag Me (19-80) 
MA denotes an upper bound for |f? (z)| on the entire integration domain. The Hermite trapezoidal 
formula has an error of order 4 and it is exact for polynomials up to third degree. 


19.3.3 Quadrature Formulas of Gauss 


Quadrature formulas of Gauss have the general form 


b n 
| κο dz ~ X` cy with y = f (£) (19.81) 
a v=0 


where not only the coefficients c, are considered as parameters but also the interpolation nodes ην. 
These parameters are determined in order to make the formula (19.81) exact for polynomials of the 
highest possible degree. 

The quadrature formulas of Gauss result in very accurate approximations, but the interpolation nodes 
must be chosen in a very special way. 


19.3.3.1 Gauss Quadrature Formulas 

If the integration interval in (19.81) is chosen as [a, b] — [—1, 1], and the interpolation nodes are chosen 
as the roots of the Legendre polynomials (see 9.1.2.6, 3., p. 566, 21.12, p. 1108), then the coefficients 
c, can be determined so that the formula (19.81) gives the exact value for polynomials up to degree 
2n 4- 1. The roots of the Legendre polynomials are symmetric with respect to the origin. For the cases 
n — 1,2 and 3 they are: 
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n=l αρ---αι, c9 — l, 
1 
αι —- —0.577350269..., c-—1. 
178 1 : 
n=2: 2% =—2o, Co — 9^ 
8 
21 =0, απο; (19.82) 
q9 — [i =0.774506660... C2 = CQ. 
n=3: To= —ī3, Co = 0.347 854 854... , 
πι --ᾱο, c1 — 0.652145 154... , 
z2 = 0.339 981043..., C9 — C4, 
$3 — 0.861136311..., C3 — CQ. 
Remark: A general integration interval [a,b] can be transformed into [—1, 1] by the transformation 
b- b 
t= τω: (t € [a,b], x € [-1, 1]). Then 








2 2 








= b 
Sem ) (19.83) 


b 
”-ας 
t) dt = —— n dy 
[roast vas (ta + Ss 
with the values x, and c, given above for the interval [—1, 1]. 


19.3.3.2 Lobatto's Quadrature Formulas 

In some cases it is reasonable also to choose the endpoints of the subintervals as interpolation nodes. 
Then, there are 2n more free parameters in (19.81). These values can be determined so that polynomials 
up to degree 2n — 1 can be integrated exactly. In the cases n — 2 and n = 3: 


Hom Ὢ, nm =. 3: 
2 1 1 1 ] 
to = — L1, C&O = z, zo = — 1, = = 
0 o=3 0 076 
A ue ..4 (19.84a) "P 29 
t= ] == vi T a= p (19.84b) 
v2 — 4, C= Co: $9 — —2 —0.47213595..., C2 = G1, 
2 2 2 1 
t= 1, c3 = Co 


The case n = 2 represents the Simpson formula. 


19.3.4 Method of Romberg 


To increase the accuracy of numerical integration the method of Romberg can be recommended, where 
one starts with a sequence of trapezoid sums, which is obtained by repeated halving of the integration 
step size. 

19.3.4.1 Algorithm of the Romberg Method 

The method consists of the following steps: 

1. Trapezoid sums determination 

The trapezoid sum T'(h;) according to (19.76) in 19.3.2.2, p. 964 is determined as an approximation of 


b 
the integral / f(x)dx with the step sizes 
Ja 


hy = = (i =0,1,2,...,m). (19.85) 
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Here, the recursive relation 


A hia hy} 1 1 | hia | Lh: | i δι 
T(hi) τί J ) 3 ΗΟ : s(a PTS ) + f(at his) 4 f(a i shia) 


ee eee a eee (a p Ehia) 4 70) (19.86) 




















y he NI . . h= 
= iT (h 1) : AS (e+ : τ ελ} (i21,2,...,m;n 2 2*1) 


is considered. Recursion formula (19.86) tells that for the calculation of T'(h;) from T (h; 1) the function 
values must be calculated only at the new interpolation nodes. 
2. Triangular Scheme 
Toi = T(h;) (i — 0,1,2,...) issubstituted and the values 
Ty a4 — Tk 

Ta — Tai AE CIA (k—1,2,...,m; i kk 1...) (19.87) 
are calculated recursively. The arrangement of the values calculated according to (19.87) is most prac- 
tical in a triangular scheme, whose elements are calculated in a column-wise manner: 


T (ho) = Too 

T(hı)= To Tu 

T(ha) — Too Ti» T» (19.88) 
T(ha)-— Tos Tis Toa Ta 


The scheme will be continued downwards (with a fixed number of columns) until the lower values at 


the right are almost the same. The values T3; (i — 1,2,...) of the second column correspond to those 
calculated by the Simpson formula. 


19.3.4.2 Extrapolation Principle 
The Romberg method represents an application of the so-called extrapolation principle. This will be 
demonstrated by deriving the formula (19.87) for the case k = 1. The required integral is denoted by J, 
the corresponding trapezoid sum (19.76) by T(h). If the integrand of I is (2m + 2) times continuously 
differentiable in the integration interval, then it can be shown that an asymptotical expansion with 
respect to h is valid for the error R of the quadrature formula, and it has the form 

R(h) =I —T(h) = ah? + agh* +--+ + amh?” + O(R™*?) (19.89a) 
or 

T(h) = I — ayh? — agh* — ++ — an h?™ + O(W?™*?). (19.89b) 
The coefficients a1, @2,...,@m are constants and independent of A. 


h 
T(h) and T (5) are formed according to (19.89b) and the linear combination 





Ty(h) = aT (h) + aeT (5) = (a, + a2)I — ay (a | =) h? — ag (a | τ) h^—... (19.90) 


α 
is considered. If a, + ag = 1 and a, + a = Q0 are substituted, then T1(A) has an error of order 4, while 


T(h) and T (h/2) both have errors of order only 2. The formula is 
h 


T(c]-T(h) 
τι) 3T) | στ) στίς) jue ~ (19.91) 
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This is the formula (19.87) for k — 1. Repeated application of the above procedure results in the 
approximation Tz; according to (19.87) and 


Ty — I4 O(n**?). (19.92) 


1 sing 





H The definite integral J = / dx: (integral sine, see 8.2.5, 1., p. 513) cannot be obtained in an 
0 


T 


elementary way. Calculate the approximate values of this integral (calculating for 8 digits). 





k-0 | k=1 k-2 | k-3 
0.92073549 
1. Romberg method: 0.93979328 | 0.94614588 


0.94451352 | 0.94608693 | 0.94608300 
0.94569086 | 0.94608331 | 0.94608307 | 0.94608307. 


The Romberg method results in the approximation value 0.94608307. The value calculated for 10 digits 
is 0.9460830704. The order O ((1/8)°) ~ 6 - 1078 of the error according to (19.92) is verified. 

2. Trapezoidal and Simpson Formulas: From the scheme of the Romberg method it can be got 
directly that for hg = 1/8 the trapezoid formula has the approximation value 0.94569086 and the 


Simpson formula gives the value 0.94608331. 
The correction of the trapezoidal formula by Hermite according to (19.79) results in the value [ = 


0.30116868 
945 —— = 0.9% 1. 
0.94569086 + 64-12 0.9460830 
3. Gauss Formula: By the formula (19.83) we get for 
1 1 1 1 1 . 
π-1: Imi [eos (50 + ;) raf (50 + >) = 0.94604113: 





1 1 1 1 i 1 1 
m=2 Iz 5 [eos (50 + 5) taf (50 + 5) + cof (95: + 3] — 0.94608313; 





1 1 1 1 1 
θε Πω pat era dps erg ~a3+— = 0.94608307. 
2 5 [ef (57 a) ἜΝ (57 TE 2] 

It can be observed that the Gauss formula results in an 8-digit exact approximation value for n = 3, 
ie., with only four function values. With the trapezoidal rule this accuracy would need a very large 


number (> 1000) of function values. 

Remarks: 

1. Fourier analysis has an important role in integrating periodic functions (see 7.4.1.1, 1., p. 474). The 
details of numerical realizations can be found under the title of harmonic analysis (see 19.6.4, p. 992). 
The actual computations are based on the so-called Fast Fourier Transformation FFT (see 19.6.4.2, 
p. 993). 

2. In many applications it is useful to take the special properties of the integrands under consideration. 
Further integration routines can be developed for such special cases. A large variety of convergence 
properties, error analysis, and optimal integration formulas is discussed in the literature (see, e.g., 
(19.4]). 

3. Numerical methods to find the values of multiple integrals are discussed in the literature (see, e.g., 


[19.26]). 
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19.4 Approximate Integration of Ordinary Differential 
Equations 


In many cases, the solution of an ordinary differential equation cannot be given in closed form as an 
expression of known elementary functions. The solution, which still exists under rather general circum- 
stances (see 9.1.1.1, p. 540), must be determined by numerical methods. These result only in particular 
solutions, but it is possible to reach high accuracy. Since differential equations of higher order than one 
can be either initial value problems or boundary value problems, numerical methods were developed 
for both types of problems. 


19.4.1 Initial Value Problems 


The principle of the methods presented in the following discussion to solve initial value problems 


y= f(x,y),  y(xo) = Yo (19.93) 
is to give approximate values y; for the unknown function y(x) at a chosen set of interpolation points 
xi. Usually, equidistant interpolation nodes are considered with a previously given step size h: 


zi =zo+ih (i=0,1,2,...). (19.94) 
19.4.1.1 Euler Polygonal Method 


An integral representation of the initial value problem (19.93) is given by integration 


v2) = w+ f fæ yl) de. (19.95) 
zo 
This is the starting point for the approximation 
zo+h 
wes) — ao f Λίωιυ(ω)) ἀν 55 yo + hf (2oy) = v (19.96) 
Zo 


which is generalized as the Euler broken line method or Euler polygonal method: 

Yiyi = Yi + hf(xi yi) (¢=0,1,2,... y(vo) — vo). (19.97) 
For a geometric interpretation see Fig. 19.5. Comparison of (19.96) with the Taylor expansion 

y 


y(x) y(@1) = y(a +h) 





i 
— yo -- f (zo, yo)h -- Va (19.98) 
with £o < € < % +h shows that the approx- 
imation y, has an error of order ^?. The ac- 
curacy can be improved by reducing the step 
Ys ». Size h. Practical calculations show that halv- 
0 Xo X X2 X3 x ing the step size h results in halving the error 
n fS /N / of the approximations Yi. 
A quick overview of the approximate shape of 
h h h : τ 
the solution curve can be got by using the Eu- 
Figure 19.5 ler method. 





19.4.1.2 Runge-Kutta Methods 


1. Calculation Scheme 

'The equation y'(x) = f(x,y) determines at every point (x9, yo) a direction, the direction of the tangent 
line of the solution curve passing through the point (xo, yo). The Euler method follows this direction 
until the next interpolation node. The Runge-Kutta methods consider more points “between” (£o, yo) 
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and the possible next point (a9+h, yi) of the curve, and depending on the appropriate choice of these ad- 
ditional points more accurate values are obtained for yı. There exist Runge-Kutta methods of different, 
orders depending on the number and the arrangements of these "auxiliary" points. Here a fourth-order 
method (see 19.4.1.5, 1., p. 972) is shown. (The Euler method is a first-order Runge-Kutta method.) 


The calculation scheme of fourth order 


























for the step from πρ tor, = To +h S y k=h- f(2,y) 
to get an approximate value for y; of 
(19.93) is given in (19.99). The further πρ Yo ky 
steps follow the same scheme. zo + h/2 Yo + kı/2 ka 
The error of this Runge-Kutta method zo +h/2 Yo + k2/2 kg (19.99) 
has order A? (at every step) accord- vo ch Yo + ks ka 
ing to (19.99), so with an appropriate 1 
choice of the step size high accuracy αι zo | yi — yod ge - 2ks -- 2ka -- k4) 
can be obtained. 
1 1 
WB y-— πα y?) with y(0) — 0. y(0.5) is determined x y k= giv” ty?) 
in one step, i.e. h = 0.5 (see the table on the right). The 0 0 0 
exact value for 8 digits is 0.01041860. 0.25 | 0 0.00781250 
2. Remarks 0.25 | 0.00390625 | 0.00781441 
1. For the special differential equation y' — f(x), this 0.5 | 0.00781441 | 0.03125763 
Runge-Kutta method becomes the Simpson formula (see 
19.3.2.3, p. 965). 0.5 0.01041858 





2. For a large number of integration steps, a change of step size is possible or sometimes necessary. The 
change of step size can be decided by checking the accuracy so that the step is repeated with a double 
step size 2h. If, e.g., the approximate value is y2(h) for y(a9+2h) (calculated by the single step size) and 
y»(2h) (calculated by the doubled step size), then the estimation for the error Ro(h) = y(xo4-2h) — ya(h) 
is 
1 

Ra(h) z geh -- ψα(2!ν)]- (19.100) 
Information about the implementation of the step size changes can be found in the literature (sce 
[19.24]). 
3. Runge-Kutta methods can easily be used also for higher-order differential equations, see [19.24]. 
Higher-order differential equations can be rewritten in a first-order differential equation system (see 
p. 550). Then, the approximation methods are performed as parallel calculations according to (19.99), 
as the differential equations are connected to each other. 


19.4.1.3 Multi-Step Methods 


The Euler method (19.97) and the Runge-Kutta method (19.99) are so-called single-step methods, since 
we start only from y; in the calculation of y;+1. In general, linear multi-step methods have the form 
Yi+k F Qk-1Yi+k-1 + Ak—2Yi+k—2 T 777 d 03i + CY 
= h( Br fi+k + Br-ifi+k-1 + + bifita + Bofi) (19.101) 
with appropriately chosen constants o; and 8; (j — 0,1,...,k; o — 1). The formula (19.101) is 
called a k-step method if |ag| -- |Bo| 4 0. It is called explicit, if 8j — 0, since in this case the values 
fij — f(xizj yis;) on the right-hand side of (19.101) only contain the already known approximation 
values Yj, Yiti,--- 5 Yitr-1- If By 4 0 holds, the method is called implicit, since then the required new 
value y;,4, occurs on both sides of (19.101). 
The k initial values yọ, y, .. ., yy-1 must be known in the application of a k-step method. These initial 
values can be got, e.g., by one-step methods. 
A special multi-step method to solve the initial value problem (19.93) can be derived if the derivative 
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y'(xi) in (19.93) is replaced by a difference formula (see 9.1.1.5, 1., p. 549) or if the integral in (19.95) 
is approximated by a quadrature formula (see 19.3.1, p. 963). 

Examples of special multi-step methods are: 

1. Midpoint Rule The derivative y'(x;+1) in (19.93) is replaced by the slope of the secant line between 
the interpolation nodes x; and z;,2, i.e.: 


ο ὅσα (19.102) 
2. Rule of Milne The integral in (19.95) is approximated by the Simpson formula: 
h 
Yi+2 — Yi = 3 T Afia fia). (19.103) 


3. Rule of Adams and Bashforth The integrand in (19.95) is replaced by the interpolation poly- 
nomial of Lagrange (see 19.6.1.2, p. 983) based on the & interpolation nodes ση, z;,1,... , v; 44-1. Inte- 
grating between x;4,_1 and x;,, results in 


k-i| εκ k-1 
Yi+k — Yitk-1 = >> / L(x) dx} f (wing, Ying) =h X Bif (Eiti vis) (19.104) 
3-0 [rij i j=0 


The method (19.104) is explicit for y;4,. For the calculation of the coefficients 3; see [19.2]. 
19.4.1.4 Predictor-Corrector Method 


In practice, implicit multi-step methods have a great advantage compared to explicit ones in that they 
allow much larger step sizes with the same accuracy. But, an implicit multi-step method usually re- 
quires the solution of a non-linear equation to get the approximation value y;,4. This follows from 
(19.101) and has the form 


k k-1 
Yirk =h >? Bi fits — 0 agyieg = λος (19.105) 

j-0 j-0 
The solution of (19.105) is an iterative one. The procedure is the following: An initial value y is 


determined by an explicit formula, the so-called predictor. Then it will be corrected by an iteration 
rule 
+1 
oe P (0,52 (19.106) 
which is called the corrector coming from the implicit method. Special predictor-corrector formulas 
are: 





L h = mt Bf- 1652 285), (19.107a) 
gt? = yet Ate -8} 50} (μ-ο,1,..3; (19.107b) 
2. vy = Yi- + Iyi — Iy: + bhl fi- + Fi), (19.108a) 
gt? = yia t Si 4) f) (μ-ο,1,..Λ. (19.108b) 


The Simpson formula as the corrector in (19.108b) is numerically unstable and it can be replaced, e.g., 
by 


h 
yt? — 0.9y a + 0.1y; + πο + 6.7 fi-1 + 30.7f; + 8.1/9). (19.109) 
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19.4.1.5 Convergence, Consistency, Stability 


1. Global Discretization Error and Convergence 
Single-step methods can be written generally in the form: 

Yii = Yi + hF (zi yi, h) (i —0,1,2,...; yo given). (19.110) 
Here F(x,y, h) is called the increment function or progressive direction of the single-step method. The 
approximating solution obtained by (19.110) depends on the step size h and it should be denoted by 
y(x, h). Its difference from the exact solution y(x) of the initial value problem (19.93) is called the global 
discretization error g(x, h) (see (19.111)), and we say: The single-step method (19.110) is convergent 
with order p if p is the largest natural number such that 

g(a, h) = y(x, h) — y(x) = O(h?) (19.111) 
holds. Formula (19.111) says that the approximation y(x, h) determined with the step size h = ΕἼ 

n 

converges to the exact solution y(x) for every x from the domain of the initial value problem if  — 0. 
B The Euler method (19.97) has order of convergence p = 1. For the Runge-Kutta method (19.99) 
p = 4 holds. 
2. Local Discretization Error and Consistency 
The order of convergence according to (19.111) shows how well the approximating solution y(x, h) ap- 
proximates the exact solution y(z). Beside this, it is an interesting question of how well the increment 
function F(z, y, h) approximates the derivative y' = f(x,y). For this purpose the so-called local dis- 
cretization error I(x, h) (see (19.112)) is introduced. The single-step method (19.110) is consistent with 
order p, if p is the largest natural number with 


y(x + h) — y(x) 


I(x, h) = 7 — F(x,y, h) = O(h?). (19.112) 
v 
It follows directly from (19.112) that for a consistent single-step method 
lim F(x,y, h) = f(x,y). (19.113) 
> 


WB The Euler method has order of consistency p — 1, the Runge-Kutta method in (19.99) has order of 
consistency p = 4. 
3. Stability with Respect to Perturbation of the Initial Values 
In the practical performance of a single-step method, a rounding error O(1/h) adds to the global dis- 
cretization error O(h?). Consequently, we have to select a not too small, finite step size h > 0. It is 
also an important question of how the numerical solution y; behaves under perturbations of the initial 
values or in the case z; — oo. 
In the theory of ordinary differential equations, an initial value problem (19.93) is called stable with 
respect to perturbations of its initial values if: 

g(x) — y(@)| € [do — vol. (19.114) 
Here g(x) is the solution of (19.93) with the perturbed initial value (xo) — jo instead of yg. Estimation 
(19.114) tells that the absolute value of the difference of the solutions is not larger than the perturbation 
of the initial values. 
In general, it is hard to check (19.114). Therefore the linear test problem 

y — Ay with y(xo) 2 yo (A constant, \ < 0) (19.115) 
is considered which is stable, and a single-step method is applied to this special initial value problem. 
A consistent method is called absolutely stable with step size h > 0 with respect to perturbed initial 
values if the approximating solution y; of the above linear test problem (19.115) obtained by using the 
method satisfies the condition 

lvi] < lyol- (19.116) 
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W Applying the Euler polygon method for equation (19.115) results in the solution yi41 — (1 4- A&)yi 
(i — 0,1,...). Obviously, (19.116) holds if |1 -- Ah| € 1, and so the step size must satisfy —2 < Ah < 0. 
4. Stiff Differential Equations 
Many application problems, including those in chemical kinetics, can be modeled by differential equa- 
tions whose solutions consist of terms converging to zero exponentially but in a high different kind of 
exponential decreasing. These equations are called stiff differential equations. For example: 

y(a) = Cie? - C58 9* — (Ci, C5, Ax, Az const) (19.117) 
with Ay < 0, A2 < 0 and |Ai| & |Ag|, e.g., Ar = —1, Az = —1000. The term with A2 does not have a 
significant affect on the solution function, but it does in selecting the step size h for a numerical method. 
In such cases the choice of the most appropriate numerical method has special importance (see [19.23]). 


19.4.2 Boundary Value Problems 


'The most important methods for solving boundary value problems of ordinary differential equations 
will be demonstrated on the following simple linear boundary value problem for a differential equation 
of the second order: 

y (x) + play (2) + 4(x)y(e) = fle) (a <x <b) with yla) = a, yb) = 2. (19.118) 
The functions p(x), q(x) and f(x) and also the constants a and £ are given. 


The given method can also be adapted for boundary value problems of higher-order differential equa- 
tions. 


19.4.2.1 Difference Method 
The interval [a, b] is subdivided by equidistant interpolation points x, = £o + vh (v = 0,1,2,...,n; 


Zo α, &n = b) and the values of the derivatives are substituted into the differential equation at the 
interior interpolation points 


y" (vy) + p(z,)y (xv) F (αν) (αν) = f(x) (v —1,2,...n— 1) (19.119) 
by so-called finite divided differences, e.g.: 


y (zv) e y, o 00 el (19.1202) 
2h 
v το 2 v Yv— 
y (zy) s yf = eR Ae Met ΕΣ (19.120b) 


In this way n — 1 linear equations are obtained for the n — 1 approximation values y, z y(x,) in the 
interior of the integration interval [a, b], considering the conditions yo — a and yn = £. If the boundary 
conditions also contain derivatives, they must also be replaced by finite expressions. 

Eigenvalue problems of differential equations (see 9.1.3.2, p. 569) are handled analogously. The ap- 
plication of the difference method, described by (19.119) and (19.120a,b), leads to a matrix eigenvalue 
problem (see 4.6, p. 314). 

E The solution of the homogeneous differential equation y” + My = 0 with boundary conditions 
y(0) = y(1) = 0 leads to a matrix eigenvalue problem. The difference method transforms the dif- 
ferential equation into the difference equation yy41 — 2yy + yp1 + h?y, = 0. If three interior points 
are chosen, hence h = 1/4, then considering yo = y(0) = 0, ys = y(1) = 0 the discrete system is 





M 
-9 i 4 ΕΞ 
( + τ) yı + Yo 0, 
X2 
1 25 yo + yz = 0 
yi + ( F i) ya ys > 
λ2 
+ (-2+—]y3 =0. 
Yo ( + τ) Ys 

This homogeneous system of equations has a non-trivial solution only when the coefficient determinant 
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is zero. This condition results in the eigenvalues 4,? — 9.37, A? — 32 and A3? = 54.63. Among them 
only the smallest one is close to its corresponding true value 9.87. 

Remark: The accuracy of the difference method can be improved by 

1. decreasing the step size h, 

2. application of a derivative approximation of higher order (approximations as (19.120a,b) have an 
error of order O(h?)), 

3. application of multi-step methods (see 19.4.1.3, p. 970). 

If the problem is a non-linear boundary value problem, then the difference method leads to a system of 
non-linear equations of the unknown approximation values y, (see 19.2.2, p. 961). 


19.4.2.2 Approximation by Using Given Functions 


The approximate solution of the boundary value problem (19.118) is a linear combination of suitably 
chosen functions g;(a), which are linearly independent and each one satisfies the boundary value con- 
ditions: 


y(x) © g(a) = D3 agi(x). (19.121) 
il 
Substituting g(x) into the differential equation (19.118) results in an error, the so-called defect 
£(2;01,02,..., a5) — g" (x) + p(x)g'(x) + q(x)g(x) — f(x). (19.122) 


To determine the coefficients a; the following principles (see also p. 978) can be used: 
1. Collocation Method The defect has to be zero at n given points x,, the so-called collocation 
points. The conditions 

€(@y301,02,...,Qn)=0 (v=1,2,...,n), a<a<22<...<an<b (19.123) 
result in a linear system of equations for the unknown coefficients. 
2. Least Squares Method The integral 





b 
F(a4,02,...,a4) — [9 01,03, ..., Q5) dz, (19.124) 
depending on the coefficients, should be minimal. The necessary conditions 
F 
" zi (32. iym) (19.125) 


give a linear system of equations for the coefficients a;. 
3. Galerkin Method The requirement is that the so-called error orthogonality is satisfied, i.e., 
b 
f «e 01,03,...,04)gi(z) dz — 0. (i— 1,2,...,m), (19.126) 
a 
and in this way a linear system of equations is obtained for the unknown coefficients. 
4. Ritz Method Thesolution y(x) often has the property that it minimizes the variational integral, 





b 
Ty) = f Hlev’) dx (19.127) 
(see (10.4), p. 610). If the function H (x, y, y) is known, then y(x) is replaced by the approximation 
g(x) as in (19.121) and I[y] ^ I(a1, a2, ..., a4) is minimized. The necessary conditions 
I 
i =0 (6=1,2,...,n) (19.128) 


result in n equation for the coefficients a;. 
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W Under certain conditions on the functions p, q, f and y, the boundary value problem 
-[p() Gy - a(x)y(x) — f(z) with yla) = a, y) = B (19.129 


and the variational problem 


Ily] = [won + q(x)y?(x) — 2f(x)y(a)| dx = min! with y(a) — o, y(b) — 8 (19.130 


are equivalent, so H(x,y,y’) can be got immediately from (19.130) for the boundary value problem o 
the form (19.129). 
Instead of the approximation (19.121), one often considers 








g(x) = go(x) - Y aigi(z), (19.131 
i=l 
where go(x) satisfies the boundary values and the functions g;(z) satisfy the conditions 
gila) = gi(b) =0 (6 =1,2,...,n). (19.132 
For the problem (19.118), an appropriate choice is, e.g., 
4. α 
glz) πας ^ (x — a). (19.133 
—a 


Remark: In a linear boundary value problem, the forms (19.121) and (19.131) result in a linear system 
of equations for the coefficients. In the case of non-linear boundary value problems non-linear systems 
of equations are obtained, which can be solved by the methods given in Section 19.2.2, p. 961. 


19.4.2.3 Shooting Method 


With the shooting method, the solution of a boundary value problem is reduced to the solution of an 
initial value problem. The basic idea of the method is described below as the single-target method. 
1. Single-Target Method 
The initial value problem 

y" 4 p(x)y' 4 q(x)y — f(x) with y(a) =a, y'(a) = 5 (19.134) 
is associated to the boundary value problem (19.118). Here s is a parameter, from which the solution 
y of the initial-value problem (19.134) depends on, i.e., y = y(x, s) holds. The function y(x, s) satisfies 
the first boundary condition y(a, s) = a according to (19.134). The parameter s should be determined 
so that y(x, s) satisfies the second boundary condition y(b, s) — 8. Therefore, one has to solve the 
equation 

F(s) = y(b, s) — 2, (19.135) 
and the regula falsi (or secant) method is an appropriate method to do this. It needs only the values of 
the function F(s), but the computation of every function value requires the solution of an initial value 
problem (19.134) until x = b for the special parameter value s with one of the methods given in 19.4.1. 


2. Multiple- Target Method 

In a so-called multiple-target method, the integration interval [a, b] is divided into subintervals, and we 
use the single-target method on every subinterval. Then, the required solution is composed from the 
solutions of the subintervals, where the continuous transition at the endpoints of the subintervals must 
be ensured. 

This requirement results in further conditions. For the numerical implementation of the multiple-target 
method, which is used mostly for non-linear boundary value problems, see. [19.24]. 
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19.5 Approximate Integration of Partial Differential 


Equations 
In this section only the principles of numerical solutions of partial differential equations are discussed 
using the example of linear second-order partial differential equations with two independent variables 
with the corresponding boundary or/and initial conditions. 


19.5.1 Difference Method 
A regular grid is considered on the integration domain by the chosen points (x, y,). Usually, this grid 
is chosen to be rectangular and equally spaced: 

Lyp=tot+ph, yw=ytvl (uv =1,2,...). (19.136) 
It results in squares for l = h. If the required solution is denoted by u(x, y), then the partial derivatives 
occurring in the differential equation and in the boundary or initial conditions are replaced by finite 
divided differences in the following way, where u,,, denotes an approximate value for the function value 
u(xy, y): 








Partial Derivative | Finite Divided Difference | Order of Error 

CN ee — a) Ot aruis = tuw) | O(h) or O(P) 

a, it) Τίμα — uus) or a noni — uy 1) O(l) or O(2) 

δεν, y.) gu enn Ὁ — pci, d Uy 1-1) O(hl) ee 
an Y) EP -δαμν Ἔ uy x) Olh?) 

Dune) (nett — D ct) OP) 


The error order in (19.137) is given by using the Landau symbol O. 

In some cases, it is more practical to apply the approximation 

Pu Up+ly+1 — 2u, v+1 7 Up-ly+l1 Upti — 2u, vt Up-ly 

Fae tm Ww) RO n2 F(1— c) 72 (19.138) 
with a fixed parameter ø (0 < ø < 1). Formula (19.138) represents a convex linear combination of two 
finite expressions obtained from the corresponding formula (19.137) for the values y = y, and y = y41. 








A partial differential equation can be rewritten as a difference equation at every interior point of the 
grid by the formulas (19.137), where the boundary and initial conditions are considered, as well. This 
system of equations for the approximation values up, has a large dimension for small step sizes h and 
l, so usually, it is solved by an iteration method (see 19.2.1.4, p. 960). 


W A: The function u(x, y) should be the solution of the differential 
equation Au = Uss +Uyy = —1 for the points (x, y) with |x| < 1, |y] « 2, 
i.e., in the interior of a rectangle, and it should satisfy the boundary 
conditions u = 0 for |x| = 1 and |y| = 2. The difference equation corre- 
sponding to the differential equation for a square grid with step size h 
is: Au, — uy di, T Up d usi, d uui h?. The step size h = 1 
(Fig. 19.6) results in a first rough approximation for the function val- 
ues at the three interior points: 4uo = 0 + 0 + 0 + uo, + 1, 4uo, = 
0 + uo, +0 ++ uo- +1, 4uo, -1 = 0 + uoo +0+0+41. 




















The solution is ug, = f ~ 0.429, uo, — up, 1 — i α 0.357. 


Figure 19.6 
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W B: Thesystem of equations arising in the application of the difference method for partial differential 
equations has a very special structure. It is demonstrated by the following example which is a more 
general boundary value problem. The integration domain is the square G: 0 << z € 1,0 € y € 1. 


A function u(x, y) should be determined for which Au = usr 





+ Uyy = f(x,y) in the interior of G, 


u(x,y) = g(x,y) on the boundary of G. The functions f and g are given. The difference equation 
associated to this differential equation is, for h = 1 = 1/n: 


Uppity F Uuwi F Upi, + 


ένα 4 


1 
tpv = f(m) (v=1.01). 


In the case of n = 5, the left-hand side of this system of difference equations for the approximation 


values Up, in the 4 x 4 in 
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if the grid is considered row-wise from left to right, and considering that the values of the function are 
given on the boundary. The coefficient matrix is symmetric and is a sparse matrix. This form is called 
block-tridiagonal. It is obvious that the form of the matrix depends on how the grid-points are selected. 
For different classes of partial differential equations of second order, such as elliptic, parabolic and 
hyperbolic differential equations, more effective methods have been developed, and also the convergence 
and stability conditions have been investigated. There is a huge number of books about this topic (see, 


e.g., [19.22], [19.24]). 


19.5.2 Approximation by Given Functions 


The solution u(x, y) is approximated by a function in the form 

n 
u(z, y) 55 υ(α, α) -- οὐ(α, ϱ) + E avil, y). (19.140) 

i-l 

Here, two cases are distinguished: = 

1. vo(r, y) satisfies the given inhomogencous differential equation, and the further functions v;(r, y) (i — 

1,2,...,n) satisfy the corresponding homogeneous differential equation (then the linear combination 

has to be found which approximates the given boundary conditions as well as possible). 

2. v9(x,y) satisfies the inhomogeneous boundary conditions and the other functions v;(x,y) (i = 

1,2,...,n) satisfy the homogeneous boundary conditions (then the linear combination has to be found 

which approximates the solution of the differential equation on the considered domain as well as pos- 

sible). 
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In both cases substituting the approximating function v(x, y) from (19.140) in the first case into the 
boundary conditions, in the second case into the differential equation results in an error term, the so- 
called defect: 

E = (L,Y; 441, dg,..., An). (19.141) 
To determine the unknown coefficients a; one of the following methods can be applied. 
1. Collocation Method 
The defect £ should be zero in n reasonably distributed points, at the collocation points (£p, yp) (V = 
1,2,.2- 8): 

£(2y,9v;01,02,...,04) 2-0. (v —1,2,...,n). (19.142) 
The collocation points in the first case are boundary points (it is called boundary collocation), in the 
second case they are interior points of the integration domain (it is called domain collocation). 
From (19.142) are obtained n equations for the coefficients. Boundary collocation is usually preferred 
to domain collocation. 
W This method is applied to the example solved in 19.5.1 by the difference method, with the functions 
satisfying the differential equation: 
U(x, Y; 41, 2,43) = —} (2? + y?) + ay + a(x? — y?) + a3(a4 — 6a?y? 4 μη). 
The coefficients are determined to satisfy the boundary conditions at the points (1,91) — (1,0.5), 
(12, 2) — (1, 1.5) and (£3, y3) = (0.5, 2) (boundary collocation). The linear system of equations 
—0.3125 +aı +0.75a2 — 0.4375a3 = 0, 
—0.8125 +aı — 1.25a2 — 7.4375a3 = 0, 
—1.0625 +aı — 3.75a2 + 10.0625a3 = 0 
has the solution a, — 0.4562, ag = —0.2000, a3 = —0.0143. The approximate values of the solution can 
be calculated at arbitrary points with the approximating function. To compare the values with those 
obtained by the difference method: v(0, 1) — 0.3919 and v(0, 0) = 0.4562. 
2. Least Squares Method 
Depending on whether the approximation function (19.140) satisfies the differential equation or the 
boundary conditions, it is required 
1. either the line integral over the boundary C 


[= / e7(a(t), y(t); a1,..-,@n) dt = min, (19.143a) 
(€) 








where the boundary curve C is given by a parametric representation x = x(t), y = y(t), 
2. or the double integral over the domain G 


T= I e? (x,y; a4,..., as) dz dy = min. (19.143b) 
(G) 


ll 
5 
=~ 
= 

ll 
d 
n 


From the necessary conditions, ...,2), n equations are obtained for computing the 





Qi 
parameters a4, d2,. .. , An- 


19.5.3 Finite Element Method (FEM) 


After the appearance of modern computers the finite element methods became the most important 
technique for solving partial differential equations. These powerful methods give results which are 
easy to interpret. 

Depending on the types of various applications, the FEM is implemented in very different ways, so 
here only the basic idea is given. It is similar to those used in the Ritz method (sce 19.4.2.2, p. 974) 
for numerical solution of boundary value problems for ordinary differential equations and is related to 
spline approximations (see 19.7, p. 996). 

The finite element method has the following steps: 
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1. Defining a Variational Problem A variational problem is formulated to the given boundary 
value problem. The process is demonstrated on the following boundary value problem: 


Au = Uge + Uyy = f in the interior of G, u = 0 on the boundary of G. (19.144) 
The differential equation in (19.144) is multiplied by an appropriate smooth function v(x, y) vanishing 
on the boundary of G, and it is integrated over the entire G to get 


Pu Pu 
I (23 * 35) vdedy = ff fo dz dy. (19.145) 
(G) (G) 


Applying the Gauss integral formula (see 13.3.3.1, 2., p. 725), where P(r, y) ^ —vu, and Q(r, y) — vu; 
are substituted in (13.121), the variational equation from (19.145) 

a(u, v) — b(v) (19.1462) 
is obtained with 


OuOv | OuOv 
a(u,v) — d ros + x) dxdy, b(v)— I fv da dy. (19.146b) 


y 





Figure 19.7 Figure 19.8 


2. Triangularization The domain of integration G is decomposed into simple subdomains. Usually, 
a triangularization is used, where G is covered by triangles so that the neighboring triangles have a 
complete side or only a single vertex in common. Every domain bounded by curves can be approximated 
quite well by a union of triangles (Fig. 19.7). 
Remark: To avoid numerical difficulties, the triangularization should not contain obtuse-angled tri- 
angles. 
W A triangularization of the unit square could be performed as shown in Fig. 19.8. Here one starts 
from the grid points with coordinates x, — uh, y, — vh (u,v — 0,1,2,..., N; h — 1/N). There are 
(N — 1)? interior points. Considering the choice of the solution functions, it is always useful to consider 
the surface elements G,,, composed of the six triangles having the common point (x, y,). (In other 
cases, the number of triangles may differ from six. These surface elements are obviously not mutually 
exclusive.) 
3. Solution A supposed approximating solution is defined for the required function u(x, y) in every 
triangle. A triangle with the corresponding supposed solution is called a finite element. Polynomials 
in z and y are the most suitable choices. In many cases, the linear approximation 

ü(x,y) — ay 4- ax + azy (19.147) 
is sufficient. The supposed approximating function must be continuous under the transition from one 
triangle to neighboring ones, so a continuous final solution arises. 
'The coefficients a4, a? and ag in (19.147) are uniquely defined by the values of the functions ui, u» 
and ua at the three vertices of the triangle. The continuous transition to the neighboring triangles is 
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ensured by this at the same time. The supposed solution (19.147) contains the approximating values 
u; of the required function as unknown parameters. For the supposed solution, which is applied as an 
approximation in the entire domain G for the required solution u(x, Ὁ), 

N-1N-1 

ula, y) = 5 5 Op Up (2, y). (19.148) 

μ-1 ν-ι 
is chosen. The appropriate coefficients a,, are determined. The following must be valid for the func- 
tions u,, (r, y): They represent a linear function over every triangle of G,,, according to (19.147) with 
the following conditions: 

1 fork=p,l=y, 

1. urs go) = { 0 atany other grid point of G,,. 


2. uy,,(z,y) 50 for (vr,y) Z Gu- (19.149b) 


(19.1492) 


The representation of u,,(r,y) over G,, is shown in 
Fig. 19.9. 

The calculation of u,, over Gy, i.e., over all triangles 1 
to 6 in Fig. 19.8 is shown here only for triangle 1: 


Up (®,Y) = a1 + agx + az with (19.150) 
1 for £ = £u, Y =Y, 

Up (x,y) = 4 0 for z — z,4,y — yi (19.151) 
0 for £ = Tp, Y = Yv- 


From (19.151) a1 = 1 — v,a2 = 0, ag = 1/h, follow and 
so for triangle 1 


ulz, y) =1+ (s = v) (19.152) 





Figure 19.9 


Analogously, there is: 


bell Ἔ n) + (S — v) for triangle 2, 
h h 


[= (z — u for triangle 3, 
1 
Up (x,y) = 4 1- 6 -— v) for triangle 4, (19.153) 
14 ( = D + E = v) for triangle 5 
h h 
i+ (= = D for triangle 6. 


4. Calculation of the Solution Coefficients The solution coefficients a,, are determined by the 
requirements that the solution (19.148) satisfies the variational problem (19.146a) for every solution 
function upv, i.e., (x,y) is substituted for u(x, y) and uy,(x, y) for v(x, y) in (19.146a). In this way, a 
linear system of equations 

N-1N-1 

» 5 amalu, un) = blun) | (k,1—1,2,..., N — 1) (19.154) 

μι νι 


is obtained for the unknown coefficients, where 


Ou, Oui n Ouk — ; B i 
α(!μν» υκι) E ὃς Br ur oe) dxdy, b(uxz) =I fur dx dy. (19.155) 
kl 
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In the calculation of a(uyv, ux) the fact must be kept in mind that integration is needed only in the 
cases of domains G, and Gy, with non-empty intersection. These domains are denoted by shadowing 
in Table 19.1. 


Table 19.1 Auxiliary table for FEM 
















































































Surface Graphical Triangle of | ður | ϑυμν Oui Ou, 
region representation | Gi Gy, δα δα Ox Ox 
11 0 0 
2 2 —1/h | —1/h 
pH4- k 3 3 —l/h | —1/h 4 
"v-l 4 4 0 0 h2 
5 5 /h L/h 
6 6 /h 1/h 
3 H= k 1 5 0 | 1/h 0 
ν-ἰ-ι 2 4 —1/h 0 
3 =k+1 2 6 —1/h | 1/h 2 
“pal ὃ ὅ —l1/h | l/h 13 
4“ k+1 3 1 —1/h 0 0 
'v=l+1 4 6 0 | 1/h 
εμ-ἰὼ 4 2 0 | I/h 0 
” y=l+1 5 1 —1/h 0 
6 p=k-1 5. 3 l/h | —1/h 2 
“ned ΑΝ 6 2 1/h | -1/h E 
LH- k—1 6 4 L/h 0 0 
"v-i-1 a / 1 3 0 | —1/h 
The integration is always performed over a triangle with an area h?/2, so for the partial derivatives = 


with respect to x: 


1 h? 
το (4αμι = 20411 = 20%-1,1) t (19.156a) 
h2 , "2 

is obtained. Analogously, for the partial derivatives with respect to y the corresponding term is 
1 . h? s 
2 (4o — 20,141 — 20,1) z (19.156b) 
1 
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The calculation of the right-hand side b(ugz) of (19.154) gives: 
blur) = / f(a, yuna, ἡ) ἀν ἀν πε Πιν», (19.157a) 
Gr 


where Vp is the volume of the pyramid over Gyw, with height 1, determined by μία.) (Ε16. 19.9). 
Since 


1 1 
Vp = 3! 6- ahs the approximation is b(uyi) 56 fak. (19.157b) 
So, the variational equations (19.154) result in the linear system of equations 
4αμι-- O41, — Ok-14 — Ok41 — Qk- = h? fri (k,l =], 2, ...,N- 1) (19.158) 


for the determination of the solution coefficients. 


Remarks: 

1. If the solution coefficients are determined by (19.158), then ü(v, y) from (19.148) represents an 
explicit approximating solution, whose values can be calculated for an arbitrary point (x, y) from G. 
2. Iftheintegration domain must be covered by an irregular triangular grid then it is useful to introduce 
triangular coordinates (also called barycentric coordinates). In this way, the position of a point can 
be easily determined with respect to the triangular grid, and the calculation of the multidimensional 
integral is made easier as in (19.155), because every triangle can be easily transformed into the unit 
triangle with vertices (0,0), (0,1), (1,0). 

3. If accuracy must be improved or also the differentiability of the solution is required, then piecewise 
quadratic or cubic functions must be applied to obtain the supposed approximation (see, e.g., [19.22]). 
4. In practical applications, usually systems of huge dimensions are obtained. This is the reason why 
so many special methods have been developed, e.g., for automatic triangularization and for practical 
enumeration of the elements (the structure of the system of equations depends on it). For detailed 
discussion of FEM see [19.13], [19.7], [19.22]. 


19.6 Approximation, Computation of Adjustment, Harmonic 
Analysis 
19.6.1 Polynomial Interpolation 


The basic problem of interpolation is to fit a curve through a sequence of points (r,, y,) (v — 0,1, 
.,n). This can happen graphically by any curve-fitting gadget, or numerically by a function g(x), 
which takes given values y, at the points x, , at the so-called interpolation points. That is g(x) satisfies 
the interpolation conditions 
g(r,) 2^ y» (v—0,1,2,...,n). (19.159) 
In the first place, polynomials are used as interpolation functions, or for periodic functions so-called 
trigonometric polynomials. In this last case one talks about trigonometric interpolation (see 19.6.4.1, 
2., p. 992). There are n + 1 interpolation points, the order of the interpolation is n, and the highest 
degree of the interpolation polynomial is at most n. Since with increasing degree of the polynomials, 
strong oscillation may occur, which is usually not required, the interpolation interval can be decom- 
posed into subintervals and a spline interpolation (see 19.7, p. 996) can be performed. 


19.6.1.1 Newton’s Interpolation Formula 
To solve the interpolation problem (19.159) a polynomial of degree n is considered in the following form: 
g(2) = palz) = ao + a(z — 2o) -Ε-αὐίο -- σο)(α -- αι) 3ε---- 
+an(x — zo)(x — x1)... (£ — £n-1). (19.160) 
This is called the Newton interpolation formula, and it gives an easy calculation of the coefficients 
à; (i — 0,1,...,n), since the interpolation conditions (19.159) result in a linear system of equations 
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with a triangular matrix. 


W For » — 2 one gets the annexed sys- 
tem of equations from (19.159). Theinter-  p»(ro) = ao =y% 
polation polynomial p„(x) is uniquely de- pə(x1) = ao + a1(z1 — 20) = 














termined by the interpolation conditions pə(£2) = ao + a1(£2 — £0) + az(£2 — To)(£2 — £1) — ya 
(19.159). 

The calculation of the function values can be simplified by the Horner schema (see 19.1.2.1, p. 952). 
19.6.1.2 Lagrange’s Interpolation Formula 


A polynomial of n-th degree can be fitted through n 4- 1 points (rj, y,) (v — 0,1,...,n), with the 
Lagrange formula: 


glx) = pr(x) = 5 yu Lu (x). (19.161) 
μ-0 


Here L,(x) (u = 0,1,...,n) are the Lagrange interpolation polynomials. Equation (19.161) satisfies 
the interpolation conditions (19.159), since 





SA e — fl forp=y, 
Ls da {9 aa (19.162) 
Here ô, is the Kronecker symbol. The Lagrange interpolation polynomials are defined by the formula 
L= (z — To)(£ — z1): (£ — £p1)(£ — αμ) τα ο) . Π oy (19.163) 
(zy, — xo) (zy — 21) (zy — tyi) (Xp — tui) 7 (x — 22) szo Τμ ty 
E ; : r|01 3 
W ^ polynomial is fitted through the points given by the table vii 3 2° 


The Lagrange interpolation formula (19.161) is used: 
_ (@- 1)(x — 3) 1 


ae co a θ΄“ 
us S NOM 
L(x) = G OGT) = -g(x — 1); 


5 17 
p(x) = 1- Lo(a) + 5: Ιη(α) +2- La(x) = τ ρα + ee +1. 


The Lagrange interpolation formula depends explicitly and linearly on the given values y, of the func- 
tion. This is its theoretical importance (see, e.g., the rule of Adams-Bashforth, 19.4.1.3, 3., p. 971). 
For practical calculation the Lagrange interpolation formula is rarely reasonable. 


19.6.1.3 Aitken-Neville Interpolation 


In several practical cases, the explicit form of the polynomial p(x) is not needed, but only its value at 
a given location x of the interpolation domain. These function values can be obtained in a recursive 
way developed by Aitken and Neville. The useful notation 


Da (2) — poa,..n (2). (19.164) 
is applied in which the interpolation points xo, z1,..., x, and the degree n of the polynomial are de- 
noted. Notice that 


Po... (2) — (x — zo)pig,..n (2) — (x — Tn) POA Asn UC) (19.165) 


Ln — Lo 
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ie. the function value po3,.,^(r) can be obtained by linear interpolation of the function values of 
D13,..,n (1) and poa 5.5 1(x), two interpolation polynomials of degree € n — 1. Application of (19.165) 
leads to a scheme which is given here for the case of n — 4: 


to | Yo = Po 
Tı | Yı 5 Pı Por 
T2 | Y2 = P2 Pı2 Po12 (19.166) 


T3 | Y3 = P3 P23 P123 P0123 
T4 | Y4 = P4 P34 P234 P1234 P01234 = pa(x). 


The elements of (19.166) are calculated column-wise. A new value in the scheme is obtained from its 
west and north-west neighbors 














(x — x2)p3 — (a — x3)po T — T3 
P23 = ) ( τες pa - —— — (ps — pa). (19.167a 
X3 — 22 X3 — 239 
£ — £1)po3 — (x — xa)p 2-2 . 
P123 = ( ) 23 C s) = Pat- 3 (poa pio); (19.167b 
z3 — Tı πα πι 
£ — £1)po3a — (x — z4)p: T-T 
P1234 = ( x)pasa — ( a)pızs = P234 + i (p234 — p123). (19.167c 
24— σι 34— 44 


For performing the Aitken-Neville algorithm on a computer only a vector p with n+ 1 components (see 

[19.4]) is introduced, which takes the values of the columns in (19.166) after each other according to 

the rule that the value pi; x41, (6 — k, k- 1,..., n) ofthe k-th column will be the i-th componen 

pi of p. The columns of (19.166) must be calculated from down to the top, so p contains all necessary 

values. The algorithm has the following two steps: 

1. For i—0,1,...,n set p; — yi. (19.168a. 
zy — m 


2. For k — 1,2,...,n andfor i 2 n,n — 1,..., k compute p; — p; 4- ————— (pi — pi-1).(19.168b 
Ti — Tick 





After finishing (19.168b) we have the required function value p, (x) at x in element p,. 


19.6.2 Approximation in Mean 


The principle of approximation in mean is known as the Gauss least squares method. In calculations 
continuous and discrete cases are distinguished. 


19.6.2.1 Continuous Problems, Normal Equations 
The function f(x) is approximated by a function g(x) on the interval {a, b] so that the expression 


F= Jero — g(x)]? dz, (19.169) 


depending on the parameters contained by g(x), should be minimal. w(x) denotes a given weight func- 
tion, such that w(x) > 0 in the integration interval. 
If the best approximation g(x) is supposed to have the form 





g(v) — X aigla) (19.170) 
i=0 
with suitable linearly independent functions go(x), g1(x), ... , ga (x), then the necessary conditions 
OF 
ο (19.171) 
Oa; 


for an extreme value of (19.169) result in the so-called normal system of equations 
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Magi 9) — (figs) (k —0,1,...,m) (19.172) 
i=0 
to determine the unknown coefficients a;. Here the brief notations 
b 
(gi, gk) — [ ooo) dx, (19.173a 
b 
(f, gk) = f «fen dx (i,k =0,1,...,n) (19.173b 


are used, which are considered as the scalar products of the two indicated functions. 

The system of normal equations can be solved uniquely, since the functions go(x), gi(x), ..., ga( 
are linearly independent. The coefficient matrix of the system (19.172) is symmetric, so the Cholesky 
method (see 19.2.1.2, p. 958) can be applied. The coefficients a; can be determined directly, withou 
solving the system of equations, if the system of functions g;(2) is orthogonal, that is, if 





(gi, gk) — 0 for i z k. (19.174 
It is called an orthonormal system, if 

die {  .΄-᾿᾿. ο (19.175 
With (19.175), the normal equations (19.172) are reduced to 

αι —(f,gi) (¢=0,1,...,n). (19.176 
Linearly independent function systems can be orthogonalized. From the power functions gi(a) = αἱ 
(i = 0,1,...,n), depending on the weight function and on the interval, the orthogonal polynomials in 


Table 19.2 can be obtained. 
Table 19.2 Orthogonal polynomials 

















| ία, b] w(x) | Name of the polynomials | see p. | 
[-1, 1] 1 Legendre polynomial P,(x) | 566 
1 
[-1,1] V-3 Chebysev polynomial T;,(x) | 989 (19.177) 
[0, oo) e^ Laguerre polynomial L,(x) | 568 
(—00,00) | e-7'? Hermite polynomial H,(x) | 568 





These polynomial systems can be used on arbitrary intervals: 

1. Finite approximation interval. 

2. Approximation interval infinite at one end, e.g., in time-dependent problems. 
3. Approximation interval infinite at both ends, e.g., in stream problems. 

Every finite interval [a, b] can be transformed by the substitution 

_b+a E b—a 
ΠΡ 2 
into the interval [—1, 1]. 


19.6.2.2 Discrete Problems, Normal Equations, Householder's Method 
Let N pairs of values (x, y,) be given, e.g., by measured values. A function g(x) has to be determined, 
whose values g(z,) differ from the given values y, in such a way that the quadratic expression 








x 


t (x€][a,b], t e [-1,1]) (19.178) 


F=} lw- g(a)? (19.179) 
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is minimal. The value of F depends on the parameters contained in the function g(x). Formula (19.179) 
represents the classical sum of residual squares. The minimization of the sum of residual squares is called 





OF 
the least squares method. From the assumption (19.170) and the necessary conditions =0 (i= 


αι 
0, 1,...,n) for a relative minimum of (19.179) for the coefficients the normal system of equations is 
obtained: 
> algae] = [ug] (& — 0,1... m). (19.180) 
i=0 


Here the Gaussean sum symbols are used in the following notation: 


[σι0ε] 2X ν)θε(αν) (19.181a) 


[vg] -Y uala) (5E 0,1, sun (19.181b) 


Usually, n «& N. 

Wl For the polynomial g(x) — ao -- az ^: - - -- a2", the normal equations are ao[z^] 4- a1 [3**!] I - -- 4 
as|r**^] — [r*y] (k 9 0,1,...,n) with [z*] = DX a v,*, [e] = N, [ety] = E rty, y] = 
XN a yy. The coefficient matrix of the normal system of equations (19.180) is symmetric, so for the 
numerical solution the Cholesky method can be applied. 


The normal equations (19.180) and the residue sum square (19.179) have the following compact form: 


G'Ga-Gl'y, F-(y-Ga)'(y- Ga) with (19.1822) 
go(x1) (x1) θείο) -... gs) γι ag 
go(t2) οι.)  ga(v2) ...  ga(v2) Va di 

α-- | golts) οι.)  ga(vs) ... ο.) |, y — | vs ac |a (19.182b) 
golan) gi(uw) ge(vw) ... ga(tw) UN απ 


If, instead of the minimalization of the sum of residual squares, one wants to solve the interpolation 
problem for the N points (x, y,), then the following system of equations should be solved: 

Ga — y. (19.183) 
This system of equations is over-determined in the case of n < N — 1, and usually it does not have any 
solution. The equations (19.180) or (19.182a) are obtained by multiplying (19.183) by GT. 
From a numerical viewpoint, the Householder method (see 4.5.3.2, 2., p. 314) is recommended to solve 
equation (19.183), and this solution results in the minimal sum of residual squares (19.179). 


19.6.2.3 Multidimensional Problems 
1. Computation of Adjustments 


Suppose that there is a function f (x1, £2, . . . , &n) Of n indepen- 
dent variables z4, 15, ... , r4. Its explicit form is not known; 
only N substitution values f, are given, which are, in general, 
measured values. These data can be written in a table (see 
(19.184)). 

The formulation of the adjustment problem is clearer by intro- 
ducing the following vectors: 


(19.184) 
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x = (T1, £2,- -;, En)! Vector of n independent variables, 
x) = (2 2... a) + Vector of the v-th interpolation node (v = 1,..., N), 
f =(fi,fo,---fw)™ : Vector of the N function values at the N interpolation nodes. 
f (21,22, ..., v5) — f(x) is approximated by a function of the form 
m 
g(21,22,...,24) — Y aigi(ni, ο ος (19.185) 
i=0 


Here, the m 4- 1 functions g;(z1, 12, ..., v5) — g;(X) are suitable, selected functions. 
W A: Linear approximation by n variables: g(11,29,...,24) — à d- à421 + a282 +-+ + AnTn- 
W B: Complete quadratic approximation with three variables: 
η(σι, 01, 79) Ξαρ-γαιτι-Γ αοτο + A3X3 + ΠΠ + 52” + αρα + A721 XQ + AgX1X3 αρ Το. 
The coefficients are chosen to minimize X7 , [ h-9 (ai, a... atr. 


2. Normal System of Equations 
Analogously to (19.182b) the matrix G is formed in which the interpolation nodes x, are replaced by 


vectorial interpolation nodes x” (v = 1,2,...,N). To determine the coefficients, the normal system 
of equations 

G7TGa — GTf (19.186) 
can be used or the over-determined system of equations 

Ga =f. (19.187) 


W For an example of multidimensional regression see 16.3.4.3, 3., p. 842. 


19.6.2.4 Non-Linear Least Squares Problems 

The main idea is discussed for a one-dimensional discrete case. The approximation function g(a) de- 
pends non-linearly on certain parameters. 

W A: g(x) = ape™* + age™*. This expression does not depend linearly on the parameters a; and a3. 
Β B: g(x) = ape" cos ax. This function does not depend linearly on the parameters a; and ag. 


The fact that the approximation function g(x) depends on a parameter vector a = (ao, @1,---,@n)* is 
indicated by the notation 
g = g(x,a) = g(£; ao, 1, . . - , an). (19.188) 
Suppose, N pairs of values (z,, y,) (v — 1,2,..., N) aregiven. To minimize the sum of residual squares 
N 
Ml — g(a; a, a1,...,Gn)]? = F (ao, a1, ..., Gn) (19.189) 
v=1 





un OF ] . : ; 
the necessary conditions — 0 (i — 0,1,...,n) lead to a non-linear normal equation system which 
ai 


1 
must be solved by an iterative method, e.g., by the Newton method (see 19.2.2.2, p. 962). 
Another way to solve the problem, which is usually used in practical problems, is the application of 
the Gauss-Newton method (see 19.2.2.3, p. 962) given for the solution of the non-linear least squares 
problem (19.24). The following steps are needed to apply it for this non-linear approximation problem 
(19.189): 


1. Linearization of the approximating function g(x, a) with the help of the Taylor formula with respect 


to a;. To do this, the approximation values a (i — 0,1,...,n) are needed: 


g(a, a) 55 δ(υ, a) = glz, a®) + > δα a9) (a; — al). (19.190) 
i-o €" 
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2. Solution of the linear minimum problem 


Y — j(z,, a)? — min! (19.191) 
v=1 
with the help of the normal equation system 
G"™GAa = GTAy (19.192) 
or by the Householder method. In (19.192) the components of the vectors Aa and Ay are given as 
Aa; = a; — a (i — 0,1,2,...,n) and (19.193a) 
Ay, — y, — g(x,, a9) (v —1,2,..., N). (19.193b) 


The matrix G can be determined analogously to G in (19.182b), where g;(r,) are replaced by 
0g 
Oa; 
3. Calculation of a new approximation 

al? = al) + Aa; or al? = al + Aa; (0.5: 0,1,2,..., 0), (19.194) 
where y > 0 is a step length parameter. 


(z,, a9) (i 20,1,...,m v 21,2,..., N). 


By repeating steps 2 and 3 with a instead of a), etc. a sequence of approximation values is ob- 
tained for the required parameters, whose convergence strongly depends on the accuracy of the initial 
approximations. The value of the sum of residual squares can be reduced with the introduction of the 
multiplier 7. 


19.6.8 Chebyshev Approximation 


19.6.3.1 Problem Definition and the Alternating Point Theorem 
1. Principle of Chebyshev Approximation 


Chebyshev approximation or uniform approximation in the continuous case is the following: The func- 
tion f(a) has to be approximated in an interval à € x < b by the approximation function g(x) = 
g(x; a9, 01, ..., 44) so that the error defined by 


max |f(x) — g(z; ao, a1, ..., a4)| 7 9(ao, a1, ..., a4) (19.195) 
a<r<b 
should be as small as possible for the appropriate choice of the unknown parameters a; (i = 0,1,..., n). 


If there exists such an approximating function for f(x), then the maximum of the absolute error value 
will be taken at least at n+ 2 points x, of the interval, at the so-called alternating points, with changing 
signs (Fig. 19.10). This is actually the meaning of the alternating point theorem for the characteriza- 
tion of the solution of a Chebyshev approximation problem. 





Figure 19.10 


E Ifthefunction f(x) — z" is approximated on the interval [—1, 1] by a polynomial of degree € n — 1 
in the Chebyshev sense, then the Chebyshev polynomial T,,() is obtained as an error function whose 
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maximum is normed to one. The alternating points, being at the endpoints and at exactly n — 1 points 
in the interior of the interval, correspond to the extreme points of T; (z) (Fig. 19.11a-f). 
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Figure 19.11 


19.6.3.2 Properties of the Chebyshev Polynomials 
1. Representation 
Ta(x) = cos(n arccos Ὁ), (19.196a 
1 n n 
ay es haw = a ape = 
Tala) = 3 [(« Ην 1) + (a T 1) |. (19.196b 
„>; _ f cosnt, « = cost for|e| <1, -. 3 
Τε (α) τ- , x=cosht for |x| >1 (n=1,2,...). ΠΕ 
2. Roots of T, (x) 
2u—1 
αμ = cos CH= D (Ii 1,2, n) (19.197 
n 
3. Position of the extreme values of T, (x) for x € [—1, 1] 





νπ 
Ly =cos— (v —0,1,2,...,n). (19.198 
n 


4. Recursion Formula 
Tai = 22T, (4) - Ta i(z) (n212,...; To(z) 2 1, Τι(α) -- α). (19.199 
This recursion results in e.g. 


T(x) — 2a? —1, Τη(α) -- Δαῦ — 32, (19.200a 
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Τι(α) -- 8a* — 827 +1, Τε(α) -- 16α5 -- 20α7 + 52, (19.200b) 
Te(x) — 3215 — 48x* -- 182? — 1, (19.200c) 
Tr(a) — 64z* — 1122? + 562° — 7o, (19.200d) 
Ta(v) — 128x5 — 25629 + 16024 — 32x? + 1, (19.200€) 
To(v) — 256x9 — 51627 -- 432x5 — 120x? + 9x, (19.200f) 
Tio(z) — 512x? — 128025 + 11202 — 4002 ἠ- 5047 -- 1. (19.200g) 


19.6.3.3 Remes Algorithm 


1. Consequences ofthe Alternating Point Theorem 
The numerical solution of the continuous Chebyshev approximation problem originates from the alter- 
nating point theorem. The approximating function is chosen as 


(x) = Yuan) (19.201) 


with n 4- 1 linearly independent known functions, and the coefficients of the solution of the Chebyshev 
problem are denoted by a;* (i = 0,1,...,n) and the minimal deviation according to (19.195) by o = 
P(ao*,a1*,...,@n*). In the case when the functions f and gi (i — 0,1,...,n) are differentiable, from 
the alternating point theorem one has 


Yan) +(-1)’0 = f(a), Ya gile) = f'(a) (»21,2,...,n 4*2). (19.202) 


The nodes x, are the alternating points a 

Q X i1 € 22 € ... X $q43 € b. (19.203) 
The equations (19.202) give 2n + 4 conditions for the 2n + 4 unknown quantities of the Chebyshev 
approximation problem: n + 1 coefficients, n + 2 alternating points and the minimal deviation o. If 
the endpoints of the interval belong to the alternating points, then the conditions for the derivatives 
are not necessarily valid there. 
2. Determination of the Minimal Solution according to Remes 
According to Remes, one proceeds with the numerical determination of the minimal solution as follows: 
1. An approximation of the alternating points z, (v =1,2,...,n+2) are determined according to 
(19.203), e.g., equidistant or as the positions of the extrema of T;,41(«) (see 19.6.3.2, p. 988). 


2. The linear system of equations 
So aigi(ay) +(-1)’o= f(a,) (v=1,2,...,n+2) 


is solved and the solutions are the approximations a; (i = 0,1,...,n) and go. 

3. A new approximation of the alternating points x,“ (v = 1,2,...,n + 2) is determined, e.g., as 
n 

positions of the extrema of the error function f(r) — Y; a/9g;(1). Now, it is sufficient to apply only 
i=0 

approximations of these points. 

By repeating steps 2 and 3 with 2, and a;“) instead of «, and a;, etc. a sequence of approx- 

imations is obtained for the coefficients and the alternating points, whose convergence is guaranteed 


under certain conditions, which can be given (see [19.25]). The calculations are stopped if, e.g., from 
a certain iteration index ji 


lou] = max | f(x) - 9; ai" g(z) (19.204) 
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holds with a sufficient accuracy. 


19.6.3.4 Discrete Chebyshev Approximation and Optimization 


From the continuous Chebyshev approximation problem 
n 
f(a) - £ aigi(x) 
i=0 


the corresponding discrete problem can be got, if requiring N nodes z, (v = 1,2,...,N; N > n+ 2) 
are chosen with the property a € z1 «€ z3--: « zy € b and requiring 


n 
ES l (ου) - 3, igi) 


The substitution 


max 
a<r<b 


= min! (19.205) 








=min!. (19.206) 











y= max |f(x,) — Y aigi(v,)|. (19.207) 
v=i,2,...N = 
has obviously the consequence 
f (av) — So aigi(ay) <A (=L 2 aN): (19.208) 
i=0 








ES 


Eliminating the absolute values from (19.208) a linear system of inequalities is obtained for the coefi 
cients a; and y, so the problem (19.206) becomes a linear programming problem (see 18.1.1.1, p. 909): 


V+ Lage.) > fer), 





y= min! subject to (v. ο N): (19.209) 


vy- Een) 2 - Fer) 


Equation (19.209) has a minimal solution with y > 0. For a sufficiently large number N of nodes and 
with some further conditions the solution of the discrete problem can be considered as the solution of 
the continuous problem. 


n 
If instead of the linear approximation function g(x) — Y; a;gi(x) a non-linear approximation function 
i=0 


g(x) = g(x; a0, 41,...,@,) is used, which does not depend linearly on the parameters a, a1, . . . , à, 
then analogously a non-linear optimization problem is obtained. It is usually non-convex even in the 
cases of simple function forms. This essentially reduces the number of numerical solution methods for 
non-linear optimization problems (see 18.2.2.1, p. 926). 





992 19. Numerical Analysis 





19.6.4 Harmonic Analysis 


A periodic function f(a) with period 27, which is given formally or empirically, should be approximated 
by a trigonometric polynomial or a Fourier sum of the form 


g(x) = 7 + So (a, cos ka + by sin kx), (19.210) 
ki 


where the coefficients ag, a, and by are unknown real numbers. The determination of the coefficients is 
the topic of harmonic analysis. 


19.6.4.1 Formulas for Trigonometric Interpolation 


1. Formulas for the Fourier Coefficients 


Since the function system 1, coskx, sinka (k = 1,2,...,n) is orthogonal in the interval [0,27] with 
respect to the weight function w = 1, the formulas for the coefficients are obtained as 


2π 2π 
Y 1 
ap = -= f ft) coskx dr, bk = = f F) sinkrvdz  (k —0,1,2,...,n) (19.211) 
π π 
0 0 


by applying the continuous least squares method according to (19.172). The coefficients a; and by 
calculated by formulas (19.211) are called Fourier coefficients of the periodic function f(a) (see 7.4, 
p. 474). 


If the integrals in (19.211) are complicated or the function f(a) is known only at discrete points, then 
the Fourier coefficients can be determined only approximately by numerical integration. 


Using the trapezoidal formula (see 19.3.2.2, p. 964) with N + 1 equidistant nodes 
20 


ὧν = vh =0,1,...,N), A= 
x, — vh (v ), A N 


(19.212) 


the approximation formula 


N N 
ak © ük = X flav) coskay, bk œ brk = x f(a,)sinka, (k=0,1,2,...,n) (19.213) 
v=1 v=1 


is obtained. The trapezoidal formula becomes the very simple rectangular formula in the case of peri- 
odic functions. It has higher accuracy here as a consequence of the following fact: If f(x) is periodic 
and (2m + 2) times differentiable, then the trapezoidal formula has an error of order O(h?™*?). 


2. Trigonometric Interpolation 

Some special trigonometric polynomials formed with the approximation coefficients à; and b, have 
important properties. Two of them are mentioned here: 

1. Interpolation Suppose N = 2n holds. The special trigonometric polynomial 


n=l 


1 ~ 1 
q(x) = z% + So (Gq cos ka + by sin ka) + gan cos na: (19.214) 
[πει 


with coefficients (19.213) satisfies the interpolation conditions 

Jlrs) — f (xv) (=l DaN) (19.215) 
at the interpolation nodes x, (19.212). Because of the perodicity of f(x) f(xo) = f(xy) holds. 
2. Approximation in Mean Suppose N = 2n. The special trigonometric polynomial 


1 m > 
(2) — 5o Ε Y (à, cos ka: 4- by sin kar) (19.216) 
k=1 
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for m « n and with the coefficients (19.213) approximates the function f(z) in discrete quadratic mean 
with respect to the N nodes x, (19.212), that is, the residual sum of squares 


Ex fes ds (m) (19.217) 


is minimal. The formulas (19.213) are the originating point for the different ways of effective calculation 
of Fourier coefficients. 


19.6.4.2 Fast Fourier Transformation (FFT) 


1. Computation costs of computing Fourier coefficients 
The sums in the formulas (19.213) also occur in connection with discrete Fourier transformation, e.g., 
in electrotechnics, in impulse and picture processing. Here N can be very large, so the occurring sums 
must be calculated in a rational way, since the calculation of the N approximating values (19.213) of 
the Fourier coefficients requires about N? additions and multiplications. 
For the special case of N = 2”, the number of multiplications can 

be largely reduced from N?(= 27?) to pN (= p2?) with the help of p 
the so-called fast Fourier transformation FFT. The magnitude of 
this reduction is demonstrated on the example on the right-hand 
side. 


Ν2 






pN 


~ 104 
~ 107 






(19.218) 


By this method, the computation costs and computation time are reduced so effectively that in some 
important application fields even a smaller computer is sufficient. 

The FFT uses the properties of the N-th unit roots, i.e., the solutions of equation z'" = 1 to a successive 
sum up in (19.213). 
2. Complex Representation of the Fourier Sum 
The principle of FFT can be described fairly easily if the Fourier sum (19.210) is rewritten with the 
formulas 








ect 1 Κα -ἱΚα s ect i —ikr ikx 
cos ka = 7 (c +e ) , sinka = 5 (e —e ) (19.219 
into the complex form 
1 z à 1 T fak—ibk ike Gk + idk ikr 
g(x) — zao - Y (ap cos kx + bp sin kz) = žao + X | ~~—e — e . (19.220 
2 k=l 2 kel 2 2 
By substitution 
Gp — ibe 1 2n im 
act ι (19.221a) because of (19.211) ck = z Í f (z)e * dx, (19.221b 
m Jo 
so (19.220) becomes the complex representation of the Fourier sum: 
ο(ῳ) - Σ) ee" with c — δι. (19.222 
k-—n 
If the complex coefficients c; are known, then the required real Fourier coefficients can be got in the 
following simple way: 
ag — 2c, aj —2Re(c), by — —2Im(e) (k— 1,2,...,n). (19.223 


3. Numerical Calculation of the Complex Fourier Coefficients 
For the numerical determination of c; the trapezoidal formula can be applied for (19.221b) analogously 
to (19.212) and (19.213), and the discrete complex Fourier coefficients ¢ are obtained: 
1 Na . N-1 
Bu Yle - V fu (k=0,1,2,...,n) with (19.2242) 
v=0 v=0 
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1 2 2n 
ἘΞ w/o», By = (v =0,1,2,..., N — 1), UN—e N. (19.224b) 


Relation (19.224a) with the quantities (19.224b) is called the discrete complex Fourier transformation 
of length N of the values f, (v —2 0,1,2,..., N — 1). 
The powers uy — z (v — 0,1,2,..., N — 1) satisfy equation 2N = 1. So, they are called the N-th unit 
roots. Since e?" = 1, 

wh =1, whtt awh, oN m... (19.225) 
The effective calculation of the sum (19.224a) uses the fact that a discrete complex Fourier transfor- 


mation of length N = 2n can be reduced to two transformations with length > = nin the following 
way: 
a) For every coefficient ci with an even index, i.e., k — 2l, 

2n-1 n-1 n-1 


Ca = x fawn” = Σ [Awa F Famin | = X [fv s "m ων, (19.226) 
v=0 v=0 v=0 
holds. Here the equality wt = ww = wl” is used. 
Substituting 
Ww = fut fate (v =0,1,2,...,n—1) (19.227) 
and considering that w2, — w,, the sum 
n—l 
Gy — Syl" (v—0,1,2,...,n— 1) (19.228) 
v=0 
is the discrete complex Fourier transformation of the values y, (v — 0,1,2,...,n — 1) with length 
N 
n= >. 
2 


b) For every coefficient & with an odd index, i.e., with k = 2l + 1 


2n—1 n—l 
5 211-1)ν ν v 
an= D fok m Y Uo — fuss) oN (19.229) 
v=0 v=0 
is obtained analogously. Substituting 
Yniv = (fr — frivlwy (v —50,1,2,...,n— 1) (19.230) 
and considering that w2, — w,, the sum 
n—1l 
on = $ Ynya” (v —0,1,2,...,n — 1) (19.231) 
v=0 
is the discrete complex Fourier transformation of the values y44, (v = 0,1,2,...,n — 1) with length 
N 
n= >. 
2 


The reduction according to a) and b), i.e., the reduction of a discrete complex Fourier transformation 
to two discrete complex Fourier transformations of half the length, can be continued if N is a power of 2, 
i.e., if N = 2? (pisa natural number). The application of the reduction after p times is called the FFT. 


N 
Since every reduction step requires 3 complex multiplications because of (19.230), the computation 


cost of the FFT method is 
N N 
$3? E log N. (19.232) 
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4. Scheme for FFT 
For the special case N = 8 = 2°, the three corresponding reduction steps of the FFT according to 
(19.227) and (19.230) are demonstrated in the following Scheme 1: 












































Scheme 1: 
Step 1 Step 2 Step 3 
fo | Yo = fo + fa Yo = Yo + Y2 Yo = Yo + Yt = Co 
Alm=firt fs Y1 := Y1 + Y3 yı := (yo — vi)e = C4 
fa | Y2 = f2 + fo y2 := (yo — va)e Y2 := Y2 + Y3 Ξ- ὃν 
fs | Y3 = fs + fr Ys = (Yr — y3)w4 ys :— (ya — ys)wy = 6g 
fa | Ya = ο... Ya = Ya + Yo Ya = Ya + Ys = C1 
fs us - fs)ui Ys := Ys + Yr ys :— (ya — us)eS = és 
16 E -= fojo? Ye := (ya — Yow Ye := Ye + Y7 = 63 
fr = (fs — fou y7 := (ys — Yr)w4 yr := (ye — yr)w9 Ξ- ὅτ 
παπα N :=4, n:= 2, w= w? | N:=2,n:=1, wo =u} 


It can be observed how terms with even and odd indices appear. In Scheme 2 (19.233) the structure 
of the method is illustrated. 


Scheme 2: 
ὅ Čsk 
z ES “k = { Čsk+4 
a z ες. { Čsk+2 
č > m ΒΕ (19.233) 
s » Čsk+1 ` 
ο ο... εις 
Οι -» ο ois 
C4k+3 => { Cao 
(k=0,1,...,7) (k—0,1,2,3) (k—0,1  (k-0) 


Scheme 3: Index | Step 1 | Step 2 | Step3 Index 





If the coefficients č are substi- E Ξ z x 

tuted into Scheme 1 and one i ΠΝ a a fo gug 

considers the binary forms of ἔι 001, ῶ C4 [2 L00 

the indices before step 1 and z 2 . - 

after step 3, then it is easy to e ate Es a i ΠΡ 

recognize that the order of the ὦ OLL Če Če C6 LLO 

required coefficients can be ob- ie x ss > 

tained by simply reversing the ca 100 a a σι 001, 

order of the bits of the binary Čs LOL C3 Cs C5 LOL 

form of their indices. This is 2 - - S 

shown in Scheme 3. Čs LLO (5 65 c3 OLL 
& | LLL Cr Čr Cr LLL 


2 Fe 
a for x = 0, with period 27, the FFT is used for the dis- 
g for 0 <r < 2r, 


2m 1 
crete Fourier transformation. N = 8 is chosen. With x, = *' i= 8 


2πὶ 
e 8 —0.707107(1— i), ug — —i, u$ — —0.707107(1 4- i) Scheme 4 is got: 


E In the case of the function f(a) = { 


f(av) (v =0,1,2,...,7), ws = 
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Scheme 4: Step 1 Step 2 Step 3 

fo — 2.467401 | yo — 3.701102 Yo = 6.785353 Yo = 13.262281 — 

fı = 0.077106 | yı = 2.004763 yı = 6.476928 yı = 0.308425 — 

fo = 0.308425 | ys — 3.084251 y — 0.616851 y» = 0.616851 4- 2.4674021 — c 

fa — 0.693957 | ys — 4.472165 y3 = 2.4674021 | ys — 0.616851 — 2.4674021 — 6g 

f4 — 1.233701 | yg = 1.233700 ya = 1.233700 ya = 2.106058 + 5.956833i = & 
+2.467401i 

fs = 1.927657 | ys = —1.308537(1 — i) | ys = 0.872358 Ys = 0.361342 — 1.022031 i = č 
+3.489432 i 

Je = 2.775826 | ye = 2.467401 i ye = 1.233700 ye = 0.361342 + 1.022031 i = cs 
—2.467401i 

fz = 3.778208 | y7 = 2.180895(1 + i) yz = —0.872358 | yz = 2.106058 — 5.956833i = č 
+3.489432 i 














== [4 
From the third (last) reduction step the required real do; = 26 524 D62 


Fourier coefficients are obtained according to (19.223). E i eem e hn - Ee B 
(See the right-hand side.) a3 = 0.722684 b. — — 2.044 062 
In this example, the general property ae 0.616 850 b= 0 ` 

Ne= ik (19.234) 


of the discrete complex Fourier coefficients can be observed. For k = 1,2,3, it can be observed that 
C7 = C1, C6 = C2, C5 = C3. 


19.7 Representation of Curves and Surfaces with Splines 


19.7.1 Cubic Splines 


Since interpolation and approximation polynomials of higher degree usually have unwanted oscillations, 
it is useful to divide the approximation interval into subintervals by the so-called nodes and to consider 
a relatively simple approximation function on every subinterval. In practice, cubic polynomials are 
mostly used. A smooth transition is required at the nodes of this piecewise approximation. 


19.7.1.1 Interpolation Splines 


1. Definition ofthe Cubic Interpolation Splines, Properties 
Suppose there are given N interpolation points (x;, fi) (i = 1,2,..., N; £1 < £2 < ... xw). The cubic 
interpolation spline S(x) is determined uniquely by the following properties: 
1. S(x) satisfies the interpolation conditions S(x;) = fi (i = 1,2,..., N). 
2. S(x) is a polynomial of degree < 3 in any subinterval [x;, i43] (i = 1,2,..., N — 1). 
3. S(x) is twice continuously differentiable in the entire approximation interval [x1, xy]. 
4. S() satisfies the special boundary conditions: 
a) S"(z1) = S” (xy) = 0 (we call them natural splines) or 
b) S’(a1) = fi’, S(x) = fy’ (fy and fy’ are given values) or 
c) S(x1) = S(xy), in the case of fı = fy, S (£1) = S (xy) and S” (x1) = S” (xy) (they are called 


periodic splines). 
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It follows from these properties that for all twice continuously differentiable functions g(a) satisfying 
the interpolation conditions g(r;) — f; (i = 1,2,..., N) 


/ [S" (a) dx « ] 9 cor as (19.235) 


is valid (Holladay's Theorem). Based on (19.235) one can say that S(x) has minimal total curvature, 
since for the curvature & of a given curve, in a first approximation, x ~ S" (see 3.6.1.2, 4., p. 246). 
It can be shown that if a thin elastic ruler (its name is spline) is led through the points (z;, f;) (; — 
1,2,..., N), its bending line follows the cubic spline S(x). 
2. Determination of the Spline Coefficients 
The cubic interpolation spline S(x) for x € [z;, z;,1] has the form: 

S(x) — S;(z) ^ ai - bi(x — zi) - ci(z — vi)? 4- di(z — zi? ((21,2,...,N — 1). (19.236) 
The length of the subinterval is denoted by h; — z;,1 — x;. The coefficients of the natural spline can 
be determined in the following way: 
1. From the interpolation conditions we get 

a=f; (i=1,2,...,N—-1). (19.237) 
It is reasonable to introduce the additional coefficient ay = fy, which does not occur in the polynomi- 
als. 
2. The continuity of S"(x) at the interior nodes requires that 
Ci — Ci] 

3h 4 

The natural conditions result in c; = 0, and (19.238) still holds for i = N, if cy = 0 is introduced. 
3. The continuity of S(x) at the interior nodes results in the relation 


dii = (i—2,3,..., N — 1). (19.238) 


ay — αι αι 261 + CG 
hia 3 


4. The continuity of S’(a) at the interior nodes requires that 


bii = hia (i=2,3,..., N). (19.239) 


ciahica 4 2(hia + hi)ci + ciah; — 3 (3 = em) (i 2 2,3,..., N — 1). (19.240) 
li πια 


Because of (19.237), the right-hand side of the linear equation system (19.240) to determine the coef- 
ficients c; (4 — 2,3,..., N — l; ec — cw = 0) is known. The left hand-side has the following form: 


2(hi + h2) ha C2 
ha 2(h2 + h3) ha ο C3 
h 2(h3 + h4) h c 
? i t 3 on : (19.241) 
ο hy-2 
hy—2 2(hw—2 + hy-1) CN-1 


The coefficient matrix is tridiagonal, so the system of equations (19.240) can be solved numerically 
very easily by an LR decomposition (see 19.2.1.1, 2., p. 956). Then all other coefficients in (19.239) 
and (19.238) can be determined with these values c;. 


19.7.1.2 Smoothing Splines 

The given function values f; are usually measured values in practical applications so they have some 
error. In this case, the interpolation requirement is not reasonable. This is the reason why cubic smooth- 
ing splines are introduced. This spline is obtained if in the cubic interpolation splines the interpolation 
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requirements are replaced by 


$ [a +À f [S"(z)f" dr — mint. (19.242) 


i=l i 


The requirements of continuity of S', S’ and S” are kept, so the determination of the coefficients is a con- 
strained optimization problem with conditions given in equation form. The solution can be obtained 
by using a Lagrange function (see 6.2.5.6, p. 456). For details see [19.26]. 
In (19.242) A (A > 0) represents a smoothing parameter , which must be given previously. For \ = 0 
the result is the cubic interpolation spline, as a special case. For “large” A the result is a smooth ap- 
proximation curve, but it returns the measured values inaccurately, and for À — oo the result is the 
approximating regression line as another special case. A suitable choice of \ can be made, e.g., by 
computer-screen dialog. The parameter o; (c; > 0) in (19.242) represents the standard deviation (see 
16.4.1.3, 2., p. 851) of the measurement errors, of the values f; (i — 1,2,..., N). 
Until now, the abscissae of the interpolation points and the measurement points were the same as the 
nodes of the spline function. For large N this method results in a spline containing a large number of cu- 
bic functions (19.236). A possible solution is to choose the number and the position of the nodes freely, 
because in many practical applications only a few spline segments are satisfactory. It is reasonable also 
from a numerical viewpoint to replace (19.236) by a spline of the form 

r+2 


S(x) = 2 a; N;a(x). (19.243) 


Here r is the number of freely chosen nodes, and the functions N;4(r) are the so-called normalized B- 
splines (basis splines) of order 4, i.e., polynomials of degree three, with respect to the i-th node. For 
details see [19.5]. 


19.7.2 Bicubic Splines 
19.7.2.1 Use of Bicubic Splines 


Bicubic splines are used for the following problem: A rectangle R of the x, y plane, given bya < x X b, 
c < y < d, is decomposed by the grid points (xi, yj) (i — 0,1,...,n; j —0,1,..., m) with 

a = to < t1 <- «r4, —bD, c-—yo «yp: «ys d (19.244) 
into subdomains Rij, where the subdomain Rij contains the points (x, y) with x; < £ < xia, y; < Y < 
yj+ı (i =0,1,...,n— 1l; j =0,1,...,m — 1). The values of the function f(x,y) are given at the grid 
points 

Faso = fa (© =0,1,...,n;5 7 =0,1,...,m). (19.245) 
A possible simple, smooth surface over R is required which approximates the points (19.245). 


19.7.2.2 Bicubic Interpolation Splines 


1. Properties 
The bicubic interpolation spline S(x,y) is defined uniquely by the following properties: 
1. S(x, y) satisfies the interpolation conditions 
S(ziyj) — fg (6—0,1...,5 j 20,1,...,m). (19.246) 
2. S(r, y) is identical to a bicubic polynomial on every Rj; of the rectangle R, that is, 


3 3 
S(z,y) — S(z,y) ^ $, 9 ,auu(z — 21) (y — us) (19.247) 
ΚΞ0 1-0 


on ft. So, Sij(r, y) is determined by 16 coefficients, and for the determination of S(r,y) 16 - m- n 
coefficients are needed. 
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3. The derivatives 


ὃς as 98 





Ee 19.248 
Ox’ ὃν OzOy ( ) 
are continuous on R. So, a certain smoothness is ensured for the entire surface. 
4. S(x,y) satisfies the special boundary conditions: 

ὃς 

Jr (th Ws) =p; for i=0,n; 7 =0,1,...,m, 

x 
os 
By ον νὴ) —qi for i=0,1,...,n; j 2 0,m, (19.249) 


2 
5,5, mow) = for 1=0,n; 7 =0,m. 


Here pij, qi; and rij are previously given values. τς 

The results of one-dimensional cubic spline interpolation can be used for the determination of the co- 
efficients Qijkl- 

1. There is a very large number (2n + m + 3) of linear systems of equations but only with tridiagonal 
coefficient matrices. 

2. The linear systems of equations differ from each other only on their right-hand sides. 

In general, it can be said that bicubic interpolation splines are useful with respect to computation cost 
and accuracy, and so they are appropriate procedures for practical applications. For practical methods 
of computing the coefficients see the literature. 


2. Tensor Product Approach 


The bicubic spline approach (19.247) is an example of the so-called tensor product approach having the 
form 


n m 
δία, ψ) -- Σ) Σ᾽ αυσ(ογη(ῳ) (19.250) 
i-0 j-0 
and which is especially suitable for approximations over a rectangular grid. The functions g;(z) (i = 
0,1,...,n) and hj(y) (j = 0,1,..., m) form two linearly independent function systems. The tensor 
product approach has the big advantage, from numerical viewpoint, that, e.g., the solution of a two- 
dimensional interpolation problem (19.246) can be reduced to a one-dimensional one. Furthermore, 
the two-dimensional interpolation problem (19.246) is uniquely solvable with the approach (19.250) if 
1. the one-dimensional interpolation problem with functions g;(r) with respect to the interpolation 
nodes zo, z1,..., 24 and 
2. the one-dimensional interpolation problem with functions h;(y) with respect to the interpolation 
nodes Yo, yi, .. πι 
are uniquely solvable. 
An important tensor product approach is that with the cubic B-splines: 


r42pt2 
S(x,y) = SOS aij Nia(x) Nj aly): (19.251) 
i=1 j=1 


Here, the functions N;4(r) and N;4(y) are normalized B-splines of order four. Here r denotes the 
number of nodes with respect to x, p denotes the number of nodes with respect to y. The nodes can be 
chosen freely but their positions must satisfy certain conditions for the solvability of the interpolation 
problem. 

The B-spline approach results in a system of equations with a band structured coefficient matrix, which 
is a numerically useful structure. 
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For solutions of different interpolation problems using bicubic B-splines see the literature. 


19.7.2.3 Bicubic Smoothing Splines 

The one-dimensional cubic approximation spline is mainly characterized by the optimality condition 
(19.242). For the two-dimensional case there could be determined a whole sequence of corresponding 
optimality conditions, however only a few special cases make the existence of a unique solution possible. 
For appropriate optimality conditions and algorithms for solution of the approximation problem with 
bicubic B-splines see the literature. 


19.7.3 Bernstein-Bézier Representation of Curves and Surfaces 


1. Bernstein Basis Polynomials 
The Bernstein-Bézier representation (briefly B-B representation) of curves and surfaces applies the 
Bernstein polynomials 


Bin(t) = (1) Ü(—1)- (i—0,1...,n) (19.252) 
and uses the following fundamental properties: 
1. 0€ Bj, (f) X1 for 0€ tx 1, (19.253) 
2. M Bi(t)- 1. (19.254) 
i=0 


Formula (19.254) follows directly from the binomial theorem (see 1.1.6.4, p. 12). 
W A: By(t) Ξ]-- t, By a(t) =t (Fig. 19.12). 
E B: Bo(t)=(1-— t)’, Big(t) = 3t(1—#)?, Bog(t) = 3t?(1 — t) Bs3(t) = t? (Fig. 19.13). 


1 p 
B; © (0 


Sol 





0 1 t 0 


Figure 19.12 





2. Vector Representation 
Now, a space curve, whose parametric representation is x = x(t), y = y(t), z = z(t), will be denoted 
in vector form by 


T = F(t) = x(t) ës + y(t) €y + z(t) €. (19.255) 
Here t is the parameter of the curve. The corresponding representation of a surface is 
¥ = F(u, v) = z(u,v) Es + y(u, v) €, 4- z(u, v) €. (19.256) 


Here, u and v are the surface parameters. 
19.7.3.1 Principle of the B-B Curve Representation 
Suppose there are given n + 1 vertices P; (i = 0,1,...,n) of a three-dimensional polygon with the 


position vectors P;. Introducing the vector-valued function 


r(t) = » Bin (t)P; (19.257) 
i=0 
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a space curve is assigned to these points, which is called the B-B curve. Because of (19.254) for- 
mula (19.257) can be considered as a “variable convex combination” of the given points. The three- 
dimensional curve (19.257) has the following important properties: 


1. The points J P and Ph , are interpolated. 

2. Vectors PoP; and P, 4 P, are tangents to r(t) at points Po 
and B,. 

The relation between a polygon and a B-B curve is shown in 
Fig. 19.14. 

The B-B representation is considered as a design of the curve, 
since it is easy to influence the shape of the curve by changing 
the polygon vertices. 

Often normalized B-splines are used instead of Bernstein poly- 
Figure 19.14 nomials. 





The corresponding space curves are called the B-spline curves. Their shape corresponds basically to 
the B-B curves with the following advantages: 

1. The polygon is better approximated. 

2. The B-spline curve changes only locally if the polygon vertices are changed. 

3. In addition to the local changes of the shape of the curve the differentiability can also be influenced. 
So, it is possible to produce break points and line segments for example. 


19.7.3.2 B-B Surface Representation 


Suppose there are given the points £j; (i — 0,1,...,n; j — 0,1,..., m) with the position vectors P,;, 
which can be considered as the nodes of a grid along the parameter curves of a surface. Analogously to 
the B-B curves (19.257), a surface is assigned to the grid points by 


n m 
F(u, v) =X Y Bi (u) Bj, (v)Ps. (19.258) 
i=0 j=0 
Representation (19.258) is useful for surface design, since by changing the grid points the surface can be 
changed. Anyway, the influence of every grid point is global, so one should change from the Bernstein 
polynomials to the B-splines in (19.258). 


19.8 Using the Computer 
19.8.1 Internal Symbol Representation 


Computers are machines that work with symbols. The interpretation and processing of these symbols is 
determined and controlled by the software. The external symbols, letters, cyphers and special symbols 
are internally represented in binary code by a form of bit sequence. A bit (binary digit) is the smallest 
representable information unit with values 0 and 1. Eight bits form the next unit, the byte. In a byte 
one can distinguish between 2° bit combinations, so 256 symbols can be assigned to them. Such an 
assignment is called a code. There are different codes; one of the most widespread is ASCII (American 
Standard Code for Information Interchange). 


19.8.1.1 Number Systems 


1. Law of Representation 

Numbers are represented in computers in a sequence of consecutive bytes. The basis for the internal 
representation is the binary system, which belongs to the polyadic systems, similarly to the decimal 
system. 
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The law of representation for a polyadic number system is 
n 
a= >> xB’ (m>0, n> 0; m,n integer) (19.259) 
i=—m 


with B as basis and z; (0 < z; < B) asa digit of the number system. The positions i > 0 form the 
integers, those with i < 0 the fractional part of the number. 


W The decimal number representation, i.e., B. — 10, of the decimal number 139.8125 has the form 
139.8125 — 1-10? - 3- 101 --9- 109 - 8-101 1-10? - 2-10? 4- 5-10 4. 


The number systems occurring most often in computers are shown in Table 19.3. 


Table 19.3 Number systems 








Number system Basis | Corresponding digits 
Binary system 2 0,1 
Octal system 8 0,1,2,3,4,5,6,7 


Hexadecimal system 16 0, 1, 2,3, 4, 5, 6,7, 8,9, A,B,C,D,E,F 
(The letters A—F are for the values 10-15.) 


Decimal system 10 0, 1,2,3,4,5, 6, 78,9 














2. Conversion 

The transition from one number system to another is called conversion. If different number systems 
are used in the same time, in order to avoid confusion the basis is denoted as an index. 

W The decimal number 139.8125 is in different systems: 139.8125,9 — 10001011.11015 = 213.64 = 
8B.Dis. 

1. Conversion of Binary Numbers into Octal or Hexadecimal Numbers The conversion of 
binary numbers into octal or hexadecimal numbers is simple. Groups of three or four bits are formed 
starting at the binary point to the left and to the right, and their values are determined. These values 
are the digits of the octal or hexadecimal systems. 

2. Conversion of Decimal Numbers into Binary, Octal or Hexadecimal Numbers For the 
conversion of a decimal numbers into another system, the following rules are applied for the integer and 
for the fractional part separately: 

a) Integer Part: If G is an integer in the decimal system, then for the number system with basis B 
the law of formation (19.259) is: 


G=} zB? (n>0). (19.260) 
i=0 
If G is divided by B, then an integer part (the sum) is obtained and a residue: 
G = ; 20 
<= aB. 19.261 
B Σ ος ( ) 


Here, zo can have the values 0,1,..., B — 1, and it is the lowest valued digit of the required number. If 
this method is repeated for the quotients, further digits can be got. 

b) Fractional Part: If g is a proper fraction, then the method to convert it into the number system 
with basis B is 


m 
gB-—z4- B, (19.262) 
i-2 
i.e., the next digit is obtained as the integer part of the product gB. The values z_2, z_3, ... can be 


obtained in the same way. 
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W A: Conversion of the decimal number W B: Conversion of a decimal fraction 
139 into a binary number. 0.8125 into a binary fraction. 

139:2— 69 residue 1 (1- zo) 0.8125- 2 = 1.625 (1— 4) 
69:2—34 residue 1 (1-21) 0.625 .2—125 (122.3) 
34:2=17 residue 0 (0= x22) 0.25 «20.5 (0 = z_3) 
17:2= 8 residue 1 0.5 -2=1.0 (1— 2-4) 

8:2— 4 residue 0 0.0 -2=0.0 
ο... TOI 
1:2= 0 residue 1 (1-z;) 


139,9 — 100010115 


3. Conversion of Binary, Octal, and Hexadecimal Numbers into a Decimal Number The 
algorithm for the conversion of a value from the binary, octal, or hexadecimal system into the decimal 
system is the following, where the decimal point is after 29: 


a= J` zB’ (m>0, n> 0, integer). (19.263) 


The calculation is convenient with the Horner rule (see 19.1.2.1, p. 952). 
W LLLOL-—1-2*!41.2541.2?40.2!4 1.29 — 29. 





1110 1 
The corresponding Horner scheme is shown on the right. 2 6 14 98 
19.8.1.2 Internal Number Representation INR 1 3 7 14 |29 





Binary numbers are represented in computers in one or more bytes. Two types of form of representation 
are distinguished, the fixed-point numbers and the floating-point numbers. In the first case, the decimal 
point is at a fixed place, in the second case it is “floating” with the change of the exponent. 


1. Fixed-Point Numbers «—————— binary number (t bits) —————» 
The range for fixed-point numbers with the 
given parameters is 
















































































0<ja|<2-1. (19.264) 
Fixed-point numbers can be represented in the $ 
form of Fig. 19.15. sign v of the fixed-point number 
2. Floating-Point Numbers 
Basically, two different forms are in Figure 19.15 


use for the representation of floating-point numbers, where the internal implementation can vary in 
detail. 
exponent E mantissa M : . . . 
(p bits) (t bits) 1. Normalized Semilogarithmic Form 
mle >| In the first form, the signs of the exponent 
E and the mantissa M of the number a are 
stored separately 



















































































a=+MB**. (19.265a) 
$ $ Here the exponent F is chosen so that for the 
sign Vg sign Vm mantissa 
of the exponent of the mantissa 1/B<M<1 (19.265b) 


holds. It is called the normalized semiloga- 
Figure 19.16 rithmic form (Fig. 19.16). 
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The range of the absolute value of the floating-point numbers with the given parameters is: 
27? «|a| x (1-27) 20775. (19.266) 


2. IEEE Standard The second (nowadays used) form of floating-point numbers corresponds to the 
IEEE (Institute of Electrical and Electronics Engineers) standard accepted in 1985. It deals with the 
requirements of computer arithmetic, roundoff behavior, arithmetical operators, conversion of num- 
bers, comparison operators and handling of exceptional cases such as over- and underflow. 

The floating-point number representations are | Characteristic C h mantissa M 
shown in Fig. 19.17. 

The characteristic C comes from the exponent 
E by addition of a suitable constant K. This is 
chosen so that only positive numbers occur in the ; 


characteristic. The representable number is | ] . 
a = (—1)” -2E - 1.b1b2 ... bi sign v of the floating point number 


with E-C- K. (19.267) 
















































































Figure 19.17 


Here: Cai, — 1, Css; — 254, since C — 0and C = 255 are reserved. 
The standard gives two basic forms of representation (single-precision and double-precision floating- 
point numbers), but other representations are also possible. Table 19.4 contains the parameters for 
the basic forms. 














Table 19.4 Parameters for the basic forms 
Parameter Single precision | Double precision 
Word length in bits 32 64 
Maximal exponent Emax +127 +1023 
Minimal exponent Emin —126 —1022 
Constant K +127 +1023 
Number of bits in exponent 8 11 
Number of bits in mantissa 24 53 














19.8.2 Numerical Problems in Calculations with Computers 
19.8.2.1 Introduction, Error Types 


The general properties of calculations with a computer are basically the same as those of calculations 
done by hand, however some of them need special attention, because the accuracy comes from the rep- 
resentation of the numbers, and from the missing judgement with respect to the errors of the computer. 
Furthermore, computers perform many more calculation steps than human can do manually. 

So, thereis the problem of how to influence and control the errors, e.g., by choosing the most appropriate 
numerical method among the mathematically equivalent methods. 

In further discussions, the following notation is used, where x denotes the exact value of a quantity, 
which is mostly unknown, and f is an approximation value of x: 











Δ: " 
Absoluteerro: |Ax| —]|xr — i|. (19.208) Relative error: |——| = (19.269) 
z 
The notations 
e(r) 2r—$ and Eei(x) = EET (19.270) 
Ὁ 


are also often used. 
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19.8.2.2 Normalized Decimal Numbers and Round-Off 


1. Normalized Decimal Numbers 
Every real number x ¥ 0 can be expressed as a decimal number in the form 

x = +0.b1b2...- 107 (b, 0). (19.271) 
Here 0, bybg... is called the mantissa formed with the digits b; € {0,1,2,...,9}. The number F is an 
integer, the so-called exponent with respect to the base 10. Since b; 4 0, (19.271) is called a normalized 
decimal number. 
Since only finitely many digits can be handled by a real computer, one has to restrict himself to a fixed 
number t of mantissa digits and to a fixed range of the exponent Æ. So, from the number x given in 
(19.271) the number 


-. f 0.biba -+ - bi- 107 for bı <5 (round-down), 
mi X(0.b405--- b, 4-107*)10P — for bı >5 (round-up), 





(19.272) 





is obtained by round-off (as it is usual in practical calculations). The absolute error caused by round-off 
|Aa| = |x — 3| € 0.5 - 10 *10P. (19.273) 

2. Basic Operations and Numerical Calculations 

Every numerical process is a sequence of basic calculation operations. Problems arise especially with 

the finite number of positions in the floating-point representation. Here a short overview is given. It is 

supposed that x and y are normalized error-free floating-point numbers with the same sign and with a 

non-zero value: 


e=mB", y=m BY with (19.274a) 
t x s 
m=) aC) B-*, a) £0, and (19.274b) 
k=1 
aO, — Oorlor... or B-1 for k>1 (i=1,2). (19.274c) 


1. Addition If E, » E5, then the common exponent becomes £F, since normalization allows us to 
make only a left-shift. The mantissas are then added. 


If B^ €| m 4- m3 B (92 | 2 (19.2752) and | my +m ο λα, (19.275) 


then shifting the decimal point by one position to the left results in an increase of the exponent by one. 
lI 0.9604 - 10? -- 0.5873 - 10? — 0.9604 - 10? 4- 0.05873 - 10? — 1.01913 - 10? — 0.1019 - 10*. 


2. Subtraction The exponents are equalized as in the case of addition, the mantissas are then sub- 
tracted. If 


| m; — m3B- 0-99 |< 1 — B-* (19.2762) and |m — mB ~F: E2) |< B, (19.276b) 


shifting the decimal point to the right by a maximum of t positions results in the corresponding decrease 
of the exponent. 
Œ 0.1004- 10 — 0.9988 - 10? = 0.1004- 103 — 0.09988 - 10 = 0.00052 - 103 = 0.5200- 10°. This example 
shows the critical case of subtractive cancellation. Because of the limited number of positions (here 
four), zeros are carried in from the right instead of the correct characters. 
3. Multiplication The exponents are added and the mantissas are multiplied. If 

mim < B, (19.277) 
then the decimal point is shifted to the right by one position, and the exponent is decreased by one. 
Wi (0.3176 - 10?) - (0.2504 - 10°) = 0.07952704 - 10° = 0.7953 - 10”. 
4. Division The exponents are subtracted and the mantissas are divided. If 

my 


mp (19.278) 


meg 
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then the decimal point is shifted to the left by one position, and the exponent is increased by one. 
WI (0.3176 - 105)/(0.2504 - 105) — 1.2683706.... 107? — 0.1268 - 1071. 
5. Error of the Result The error of the result in the four basic operations with terms that are 
supposed to be error-free is a consequence of round-off. For the relative error with number of positions 
t and the base B, the limit is 

B 


gt. (19.279) 


6. Subtractive cancelation As it was mentioned above, the critical operation is the subtraction 
of nearly equal floating-point numbers. If it is possible, one should avoid this by changing the order of 
operations, or by using certain identities. 
Mx = 1985 — 1984 = 0.4455 - 10? — 0.4454 - 10? = 0.1- 107 or x = V/1985 — V/1984 = 
1985 — 1984 1 
= 0.1122 - 107 
1985 + V1984 


19.8.2.3 Accuracy in Numerical Calculations 


1. Types of Errors 
Numerical methods have errors. There are several types of errors, from which the total error of the final 
result is accumulated (Fig. 19.18). 





Total error 


























Input error Error of method Round-off error 






































Truncation error Discretization error 




















Figure 19.18 

2. Input Error 

1. Notion of Input Error Input error is the error of the result caused by inaccurate input data. 
Slight inaccuracies of input data are also called perturbations. The determination of the error of the 
input data is called the direct problem of error calculus. The inverse problem is the following: How 
large an error the input data may have such that the final input error does not exceed an acceptable 
tolerance value. The estimation of the input error in rather complex problems is very difficult and is 
usually hardly possible. In general, for a real-valued function y = f(x) with x = (x1, £2,- . , £n)! the 
absolute value of the input error is 


|Δυ| -- |f (£1, £2,- .-, £n) — f (ž1, Ž2,. .- , Žn)| 
n ð n 
-X ξο,.. €n) (i a1 <3 (imax 3 


if the Taylor formula (see 7.3.3.3, p. 471) is used for y = f n — f(11,22,..., 4) with a linear residue. 
E1, E2, ..., £4 denote the intermediate values, 21, 39,..., 2, denote the approximating values of x4, 2, 
...,%n. The approximating values are the perturbed input data. Here, also the Gauss error propagation 
law (see 16.4.2.1, p. 855) is considered. 

2. Input Error of Simple Arithmetic Operations The input error is known for simple arith- 
metical operations. With the notation of (19.268)-(19.270) for the four basic operations: 











oe (x!) lA], (19.280) 
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e(x x y) — e(x) + ely), (19.281) e(vy) = ye(x) + ve(y) + €(x)e(y), (19.282) 
x 1 2 : : 
€ B = —e(x) — e(y) + terms of higher order ine, (19.283) 
y y y 
€jg (m X y) = Due CI (19.284) e«(xy) — era (x) + eca(y) + Eret(%)érei(y), (19.285) 
Érel (2) = ére (£) — Era (y) + terms of higher order ine. (19.286) 
y 


The formulas show: Small relative errors of the input data result in small relative errors of the result 
on multiplication and division. For addition and subtraction, the relative error can be very large if 
[x x y| « [e] + lyl. 

3. Error ofthe Method 


1. Notion of the Error of the Method The error of the method comes from the fact that theoret- 
ically continuous phenomena are numerically approximated in many different ways as limits. Hence, 
there are truncation errors in limiting processes (as, e.g., in iteration methods) and discretization er- 
rors in the approximation of continuous phenomena by a finite discrete system (as, e.g., in numerical 
integration). Errors of methods exist independently of the input and round-off errors; consequently, 
they can be investigated only in connection with the applied solution methodology. 

2. Applying Iteration Methods If an iteration method is used, then both cases may occur: A 
correct solution or also a false solution of the problem can be obtained. It is also possible that no 
solution is obtained by an iteration method although there exists one. 

To make an iteration method clearer and safer, the following advices should be considered: 





a) To avoid “infinite” iterations, count the number of steps and stop the process if this number exceeds 
a previously given value (i.e., stop without reaching the required accuracy). 

b) The location of the intermediate result should be tracked on the screen by a numerical or a graphical 
representation of the intermediate results. 
c) All known properties of the solution should be used such as gradient, monotonicity, etc. 

d) The possibilities of scaling the variables and functions should be investigated. 

e) Several tests should be performed by varying the step size, truncation conditions, initial values, etc. 
4. Round-off Errors 
Round-off errors occur because the intermediate results should be rounded. So, they have an essential 
importance in judging mathematical methods with respect to the required accuracy. They determine 
together with the errors of input and the error of the method, whether a given numerical method is 
strongly stable, weakly stable or unstable. Strong stability, weak stability, or instability occur if the 
total error, at an increasing number of steps, decreases, has the same order, or increases, respectively. 
At the instability one distinguishes between the sensitivity with respect to round-off errors and dis- 
cretization errors (numerical instability) and with respect to the error in the initial data at a theo- 
retically exact calculation (natural instability). A calculation process is appropriate if the numerical 
instability is not greater than the natural instability. 





For the local error propagation of round-off errors, i.e., errors at the transition from a calculation step 
to the next one, the same estimation process can be used as the one applied at the input error. 

5. Examples of Numerical Calculations 

Some of the problems mentioned above are illustrated by numerical examples. 

E A: Roots of a Quadratic Equation: 

az? + bx + c = 0 with real coefficients a, b, c and D = b? — 4ac > 0 (real roots). Critical situations are 
the cases a) | dac |< b? and b) 4ac ~ b?. Recommended proceeding: 
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b+ sign(b) VD G 
Pts o p= == (Vieta root theorem, see 1.6.3.1, 3., p. 44). 
α ax, 


b) The vanishing of D cannot be avoided by a direct method. Subtractive cancellation occurs but the 

error in (b 4- sign(bv/D) is not too large since |b| > vD holds. 

E B: Volume ofa Thin Conical Shell for ^ < r 

V=4r (r+h}-—r? 
3 


equation V — 4x 


a)z-- 





because of (r 4- h) z r there is a case of subtractive cancellation. However in the 
3r2h + 3rh? + h3 
3 


there is no such problem. 


em 1 
E C: Determining the Sum S = X` Pal (S = 1.07667...) with an accuracy of three signif- 
k= 


icant digits. Performing the calculations with 8 digits, about 6000 terms should be added. After the 
1 1 


1 
PORE 


2 oo 


identical transformation 


T 


1 
S= ee B zh BOT x τη and S= Co » BOR hold. By this transformation only 


eight terms are considered. 











0 g? 
WB D: Avoiding the 0 Situation in the function z = (1 1-4 224 y) zl zu forz — y = 0. 
y 


Multiplying the numerator and the denominator by (1 4- /1 4- z? 4- y?) one avoids this situation. 
W E: Example for an Unstable Recursive Process. Algorithms with the general form y,41 — 











2 
aYn + by, 1 (n. — 1,2,...) are stable if the condition z £ 4 +6| < 1 is satisfied. The special 
Case Yn41 = —3Yn + 4Yn-1 (n = 1,2,...) is unstable. If yo and y; have errors € and —e, then for 


2; U3; UA: U5; Ue, - -- the errors are 7e, —25e, 103e, —409e, 1639e, .... The process is instable for the pa- 
rameters a — —3and b — 4 
E F: Numerical Integration of a Differential Equation. The numerical solution for the first- 
order ordinary differential equation 

y' = f(x,y) with f(x,y) = ay (19.287) 
and the initial value y(29) = yo will be represented. 
a) Natural Instability. Together with the exact solution y(x) for the exact initial values y(ao) = yo 
let u(x) be the solution for a perturbed initial value. Without loss of generality, it may be assumed that 
the perturbed solution has the form 

u(x) — y(x) 4 εη(α), (19.2882) 
where £ is a parameter with 0 < € < 1 and (x) is the so-called perturbation function. Considering 
that w(x) = f (v, u) one gets from the Taylor expansion (see 7.3.3.3, p. 471) 


u'(r) — f(x, (x) 4 en(x)) — f(x, y) 4 enl) f(x,y) + terms of higher order (19.288b) 
which implies the so-called differential variation equation 

nl Gc) — fur. yn. (19.288c) 
The solution of the problem with f(x,y) = ay is 

n(ax) = no e= with no = η(αρ). (19.288d) 


For a 7 0 even a small initial perturbation rp results in an unboundedly increasing perturbation rj(x). 
So, there is a natural instability. 
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b) Investigation of the Error of the Method in the Trapezoidal Rule. With a — —1, the stable 


differential equation y'(r) — —y(x) has the exact solution 
y(r) — yoe- € 9, where yy — y(vo). (19.2892) 
The trapezoidal rule is 
Til 4g; 
1 y(x)da % STI with h = ri41 — vi. (19.289b) 
Tı 


By using this formula for the given differential equation 


Tirpl 
3 . wry i 2—h. 
Hie = fi + Í (-y)dr = j, - En > Th or jua aan ος 
2— hM 
yi = | ———_] J 19.289c 
ῦ (; 2 i) D ( c) 


is valid. With 2; = x 4- ih, i.e., with 2 (r; — xo/h for0 € h «2 








h 12 80 
(19.289d) 


l (; — i) 

(zi—z0)/h n 2 4 
2—h 2+h h h 
Te 2 — qn ocU0 (rito) : s πο ο CORPO MET aue Bn? 

Vi E T Ἴ Yo = Joe with c(h) 1 


is obtained. If jo — yo, then j; « yi, and so for h — 0, Ji also tends to the exact solution yoe C779. 


c) Input Error In b) it is supposed that the exact and the approximate initial values coincide. Now, 
the behavior is investigated when yo Z Jo with | go — yo |< εο- 


Since (fia1— yia) € — 
(Uii Visi) Oh 
So, €:41 is at most of the same order as £o, and the method is stable with respect to the initial values. 

It has to be mentioned that in solving the above differential equation with the Simpson method an 
artificial instability is introduced. In this case, for h — 0, the general solution is obtained as 


δες Cie + C2(-1)'e*?. (19.290b) 


The problem is that the numerical solution method uses higher-order differences than those to which 
the order of the differential equation corresponds. 


19.8.3 Libraries of Numerical Methods 


Over time, libraries of functions and procedures have been developed independently of each other for 
numerical methods in different programming languages. An enormous amount of computer experimen- 
tation was considered in their development, so in solutions of practical numerical problems one should 
use the programs from one of these program libraries. Programs are available for current operating 
systems like WINDOWS, UNIX and LINUX and mostly for every computation problem type and they 
keep certain conventions, so it is more or less easy to use them. 


h P atl 
(9; — yi) thereis (giy1 — yii) € (=) (Uo — νο). (19.290a) 


The application of methods from program libraries does not relieve the user of the necessity of thinking 
about the expected results. This is a warning that the user should be informed about the advantages 
and also about the disadvantages and weaknesses of the mathematical method he/she is going to use. 


19.8.3.1 NAG Library 


The NAG library (Numerical Algorithms Group) is a rich collection of numerical methods in the form 
of functions and subroutines/procedures in the programming languages FORTRAN 77, FORTRAN 90 
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and C. Here is a contents overview: 


1. Complex arithmetic 14. Eigenvalues and eigenvectors 
2. Roots of polynomials 15. Determinants 
3. Roots of transcendental equations 16. Simultaneous linear equations 
4. Series 17. Orthogonalization 
5. Integration 18. Linear algebra 
6. Ordinary differential equations 19. Simple calculations with statistical data 
7. Partial differential equations 20. Correlation and regression analysis 
8. Numeric differentiation 21. Random number generators 
9. Integral equations 22. Non-parametric statistics 
10. Interpolation 23. Time series analysis 
11. Approximation of curves and surfaces from data 24. Operations research 
12. Minimum/maximum of a function 25. Special functions 
13. Matrix operations, inversion 26. Mathematical and computer constants 


Furthermore the NAG library contains extensive software concerning statistics and financial mathe- 
matics. 


19.8.3.2 IMSL Library 
The IMSL library (International Mathematical and Statistical Library) consists of three synchro- 
nized parts: 

General mathematical methods, 

Statistical problems, 

Special functions. 
The sublibraries contain functions and subroutines in FORTRAN 77, FORTRAN 90 and C. Here is a 
contents overview: 


General Mathematical Methods 


1. Linear systems 6. Transformations 
2. Eigenvalues 7. Non-linear equations 
3. Interpolation and approximation 8. Optimization 
4. Integration and differentiation 9. Vector and matrix operations 
5. Differential equations 10. Auxiliary functions 
Statistical Problems 
1. Elementary statistics 12. Random sampling 
2. Regression 13. Life time distributions and reliability 
3. Correlation 14. Multidimensional scaling 
4. Variance analysis 15. Estimation of reliability function, 
5. Categorization and discrete data analysis hazard rate and risk function 
6. Non-parametric statistics 16. Line-printer graphics 
7. Test of goodness of fit and test of randomness 17. Probability distributions 
8. Analysis of time series and forecasting 18. Random number generators 
9. Covariance and factor analysis 19. Auxiliary algorithms 
10. Discriminance analysis 20. Auxiliary mathematical tools 
11. Cluster analysis 


Special Functions 


1. Elementary functions 6. Bessel functions 
2. Trigonometric and hyperbolic 7. Kelvin functions 
functions 8. Bessel functions with fractional orders 
3. Exponential and related functions 9. Weierstrass elliptic integrals and 
4. Gamma function and relatives related functions 
5. Error functions and relatives 10. Different functions 
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19.8.3.3 Aachen Library 


The Aachen library is based on the collection of formulas for numerical mathematics of G. Engeln— 
Müllges (Fachhochschule Aachen) and F. Reutter (Rheinisch-Westfálische Technische Hochschule Aa- 
chen). It exists in the programming languages BASIC, QUICKBASIC, FORTRAN 77, FORTRAN 90, C, 
MODULA 2 and TURBO PASCAL. Here is an overview: 


Numerical methods to solve non-linear and special algebraic equations 
Direct and iterative methods to solve systems of linear equations 
Systems of non-linear equations 

Eigenvalues and eigenvectors of matrices 

Linear and non-linear approximation 

Polynomial and rational interpolation, polynomial splines 

Numerical differentiation 

Numerical quadrature 

Initial value problems of ordinary differential equations 

10. Boundary value problems of ordinary differential equations 


SI DUE Ore Bore 





The programs of the Aachen library are especially suitable for the investigation of individual algorithms 
of numerical mathematics. 


19.8.4 Application of Interactive Program Systems 
and Computeralgebra Systems 


19.8.4.1 Matlab 


The commercial program system Matlab Matlab (Matrix Laboratory) is an interactive environment 
for solving mathematically formulated problems and in the same time it is a high level script language 
for scientific technical computations. The set up priorities are the problems and algorithms of linear 
algebra. Matlab unifies the convenient well developed implementations of numerical procedures with 
advanced graphical representation of the results and data. The computations are processed mostly 
with double precision floating point numbers according to IEEE-standards (see Table19.4, p. 1004). 
As further alternatives which are compatible to Matlab are the systems Scilab and Octave with free 
downloads. 


1. Functions Survey 
There is a short survey of the procedures and functions available in Matlab: 


General Mathematical Functions 


1. Trigonometry 5. Coordinate transformations 

2. Exponential functions, Logarithms 6. Round-off and fractions 

3. Special functions 7. Discrete mathematics 

4. Complex arithmetic 8. Mathematical constants 
Numerical Linear Algebra 

1. Manipulation of fields and matrices 5. Eigenvalues and singular values 

2. Special matrices 6. Matrix factorization 

3. Matrix analyzes (norms, condition) 7. Matrix functions 

4. Systems of linear equations 8. Methods for sparse matrices 
Numerical Methods 

1. Calculation of statistical data 7. Determination of the convex closures 

2. Correlation and regression 8. Numerical integration 

3. Discrete Fourier transformation 9. Ordinary differential equations 

4. Polynomials and splines 10. Partial differential equations 

5. One- and more-dimensional interpolation 11. Non-linear equations 

6. Triangulations and decompositions 12. Minimization of functions 
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In addition there are several program packages of Matlab, the so called toolbooxes, which can be applied 
in the cases of special mathematical classis of problems. As some examples can be mentioned here the 
curve fitting, filtering, business mathematics, time series analysis, signal and pattern processing, neural 
networks, optimization, partial differential equations, splines, statistics and wavelets. 

In the following paragraphs the possibilities of Matlab are demonstrated by simple examples. The same 
problems are partially discussed here as in the paragraphs of numerical applications of Mathematica and 
Maple. 


2. Numerical Linear Algebra 


After starting with Matlab the command prompt >> appears in the command window to indicate the 
readiness to accept commands. If a command is not closed by a semicolon, then the result appears in 
the command window. The basic command to solve a system of linear equations A x — b (see 19.2.1, 
p. 955) is the backslash operator V. 


103 
E Given matrix A = (2 1 J and vector b — (—2,3,2)7 . For the input 
123 
>> A=(103;211;123], b=[—2;3;2], x=A\b, norm(A*x-—b) 


the output is 


103 -2 1.0000 
A—211 b— 3 x- 2.0000 ans = 8.8818e — 016 
123 2 — 1.0000 


As the Euclidean norm of the residual shows the obtained solution x (for which not all the digits are 
shown) satisfies the system of equations with an accuracy allowed by the mashine floating point repre- 
sentation. 
If the matrix A is quadratic and nonsingular, then the linear system has a unique solution. By the 
backslash operator V ordinary the Gaussian elimination is used with column pivoting, i.e., a triangle 
decomposition PA — LR is obtained (see 19.2.1.1, p. 955). 
W The triangle decomposition of A can be realized also with the input 

>> [L,R, P] = 1u(A) 


giving the output 


1.0000 0 0 2.0000 1.0000 1.0000 010 
L= | 0.5000 1.0000 0 R= 0 1.5000 2.5000 P= 001 
0.5000 —0.3333 1.0000 0 0 3.3333 100 


(Here the matrices are given in brackets in order to avoid confusion). 
The backslash operator first tests the properties of the coefficient matrix A. If A is a permutation 
of a triangle matrix, then the corresponding echelon form is solved. For a symmetric A the Cholesky 
method is applied (see 19.2.1.2, p. 958). 
If the condition number of the coefficient matrix A is too high, then numerical problems can occure 
during the solution. Because of this problem, during the procedure Matlab calculates an estimation of 
the reciprocal value of the condition number, and gives a warning if it is too small. 
E The Hilbert matrix H — (hi;) of order n — 13 can serve as an example, where hi; — 1/(i 4- k — 1). 
>> x = hilb(13)\ones(13, 1); 
Warning: Matrix is close to singular or badly scaled. 
Results may be inaccurate. RCOND = 2.409320e-017. 

In the case of overdetermined linear systems the corresponding linear fitting problem is handled by an 
orthogonalization procedure, i.e. by orthogonal transformations into a QR-decomposition A = QR 
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(see 19.2.1.3, p. 958). 
E >> 24—[102211123;:11-—1| b-—([-2;32;01; x-—AWb, nmnorm(A*x-— b) 
0.4673 


x= 14393 ans = 2.0508 
—0.4953 


2». [Q,R] = ar(A) 


—0.3780 —0.4583 0.6466 0.4785 —2.6458 —1.8898 —2.6458 

q= —0.7559 —0.2750 —0.2321 —0.5469 R= 0 1.5584 0.6417 
~ | —0.3780 0.8250 0.4145 —0.0684 τ 0 0 3.5480 
—0.3780 0.1833 —0.5969 0.6836 0 0 0 


The backslash operator also gives meaningful results in the cases of underdetermined and rank deficient 
linear systems of equations. The details of these cases with the ways how to handle large sparse matrices 
can be found in the corresponding documentation of Matlab and in the introductions [19.20], [19.29]. 
3. Numerical Solution of Equations 
A polynomial 

p(x) — aqz" -- ay 12" 1 +--+ + a,x + a9 
is in Matlab represented by the row vector (a5, à5 1,...,04, a9) of the coefficients. Several functions 
are available to handle polynomials. 


E As an example the polynomial value at 1, the derivative (i.e. the coefficient vector of the derivative 
polynomial) and the roots are determined for the polynomial p() — x$ + 3x? — 5. 

>> p-[100030 —5] 

>> polyval(p, 1) ans — —1 

>> polyder(p) ans— 600060 

»»  roots(p) ans — 0.8673 -- 1.15291, 0.8673 — 1.15291, 1.0743, 





0.8673 + 1.15291, —0.8673 — 1.15291, —1.0743 


The roots are determined as the eigenvalues of the corresponding companion matrix. 
The command fzero is used to find the approximate solutions of nonlinear scalar equations. 
W Calculation of three solutions of equation e? — 43? « Q. 
2  fzero(Q(x)exp(—x^3) — 4 * x^2, 1) ans — 0.4741 
2  fzero(Q(x)exp(—x^3) — 4 * x^2, 0) ans — —0.5413 
>>  fzero(QG(x)exp(—x^3) — 4 « x^2, — 1) ans — —1.2085 
'The input of the equation for the command fzero is made as an unnamed function. As it is obvi- 
ous, it depends on the given initial value in the second argument, which solution is approximated. A 


combination of the bisection method with regula falsi (see 19.1.1.3, p. 951) is used for the iteration 
process. 





4. Interpolation 

Function fitting based on a given data set can be done either by interpolation (see 19.6.1, p. 982 or 
19.7.1.1, p. 996) or by best approximating functions (see 19.6.2.2, p. 985). In Matlab the command plot 
is the most simple way to represent the data set graphically. The menu in the selfopening graphical 
window contains tools for editing the figure (linetypes, symbols, titles and legends), for exporting and 
printing Basic Fitting under Tools. 
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The Basic Fitting is a subroutine of Tools by which a variety of interpolation methods and best ap- 
proximating polynomials of different degrees are offered. It is realized by the functions interp1 and 
polyfit. 
By the input 

>> plot([1.70045, 1.2523, 0.638803, 0.423479, 0.249091, 0.160321, 0.0883432, 0.0570776, 


0.0302744, 0.0212794]); 


the data values are located at the data positions 1, 2,..., 10 and are graphically represented. Fig.19.19a 
shows the data set, the corresponding cubic interpolation spline and the best approximating polynomial 
of degree four. 


Cubic spline interpolant 
y =~ 2.194e- 005*x* - 0.0037135*x « 0.099896*x" - 0.85348*x * 
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Figure 19.19 a) and b) 

The Function interp2 offers appropriate methods to interpolate the data given over a two dimensional 
rectangular grid (see 19.7.2.1, p. 998). To interpolate irregularly distributed data is served by calling 
griddata. 
E The command sequence 

>> [X,Y] = meshgrid(—2 : 1:2); F—4-sqrt(16 — X.^2 — Y.^2); 

>> [Xe, Ye] = meshgrid(—2:0.1:2); S — interp2(X Y,F, Xe, Ye, spline^); 

>> surf(Xe,Ye,S) 

>> holdon; stem3(X,Y,F,’ fill’) 


realizes the bivariable cubic spline interpolation of the function f(r,y) — y16 — 2? — y? given on a 
grid. The interpolation spline is evaluated on a finer rectangular grid. Fig.19.19b represents the 
interpolation function where the data points are also shown. 

5. Numerical Integration 

Numerical integration is available in Matlab by the procedures quad and quadl. Both procedures are 
based on the recursive application of interpolation quadratures with adaptive step-size selection. quad 
is based on the Simpson formula, and in quad1 the Lobatto formulas of higher order are applied (see 
19.3.2, p. 964). In the case of sufficiently smooth integrand and higher accuracy requirements quadl 
works more effectively than quad. 


1 sina 





dx (Integral sine see 


W As the first example the approximation of the definite integral I = [ 
Jo 


8.2.5,1. p. 513) is considered. 
>> format long; [I, fwerte] = quad(@(x)(sin(x)./x), 0, 1) 
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Warning: Divide by zero. > In @(x) (sin(x) ./x) 
I = 0.94608307007653 fwerte = 14 

[I, fwerte] = quadl(@(x)(sin(x)./x), 0,1) 
Warning: Divide by zero. > In @(x) (sin(x)./x) In quadl at 64 
I = 0.94608307036718 fwerte = 19 


In quad at 62 


>> format long; 


Both procedures obviously recognize the discontinuity of the integrand at the left endpoint of the in- 
terval, but the approximated value of the integral can be obtained without any difficulty. Based on the 
results of the same example in 19.3.4.2, p. 968 the number of function evaluations seems to be high but 


it is determined for the adap 


>> format long; 
I = 0.94608307036718 
>> format long; 


I = 0.94608307036718 


(The warning messages are n 
given as a further argument 


ive recursion. 


I, fwerte] = quad(G(x)(sin(x)./x), 0, 1, 1e — 14) 
fwerte — 258 

I, fwerte] = quad1(@(x)(sin(x)./x),0,1,1e — 14) 
fwerte = 19 


ot repeated here.) The determination of the accuracy of 1074, which is 
the default is a 1079 tolerance), makes the advantages of quad1 obvious 


in this case. 








1000 
E The definite integral I = I e? dx is to be determined. 
J—1000 
>> format long; [I,fwerte] = quad(@(x)(exp(—x.*2)), —1000, 1000, 1e — 10) 
I = 1.77245385094233 fwerte = 585 
>> format long; [I, fwerte] = quadl(@(x)(exp(—x.”2)), —1000, 1000, 19 -- 10) 
I = 1.77245385090571 fwerte = 768 


The flat shape of the integrand in a very wide part of the integration interval and the relatively steep 
peak at x = 0 make quad better in this case. 

6. Numerical Solution of Differential Equations 

Matlab offers several procedures to determine the numerical solutions of initial value problems of sys- 
tems of first order ordinary differential equations. A standard procedure is ode45, in which Runge- 
Kutta methods of 4-th and 5-th order are applied with adaptive step-size selection (see 19.4.1.2, p. 969). 
To achieve higher accuracy the program ode113 is more effective with implemented linear multi-step 
methods of predictor-corrector type (see 19.4.1.4, p. 971). Besides there are procedures which are es- 
pecially effective for stiff systems of differential equations (see 19.4.1.5,4., p. 973). 


1 
Hi To solve the problem y’ = τα +y’), y(0) =0, by the Runge-Kutta method (see 19.4.1.2, p. 970) 
in the interval [0, 1] the input is 
>> ΙΚΥ] -- ede45(G(x, y) (a? -- y.?)./4),[0 1],0);  plot(x.y) 
The shape of the resulted solution is represented in Fig.19.20a. 
W To solve the special Lorenz-system (see ΒΒ in 17.2.4.3, p. 887) 


x} = l0(z2 — 21), x5 28:;— r9 — zixa, v3 — 1119 — 375 


in the interval 0 < t < 50 with initial conditions z(0) — (0, 1, 0)7 the following commands are used: 


>> [t,x] = ode45(@(t, x)([10 * (x(2) — x(1)); 
28 « x(1) — x(2) — x(1) * x(3);x(1) * x(2) — 8 * x(3)/3]), [0 50], [0; 1; 0]); 
>> plot(x(:,1),x(:,3)) 
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Figure 19.20 a) and b) 
The last command creates a phase-diagram in the ση, 3 plane (see Fig.19.20b). 
19.8.4.2 Mathematica 


1. Tools for the Solution of Numerical Problems 

The computer algebra system Mathematica offers a very effective tool that can be used to solve a large 
variety of numerical mathematical problems. The numerical procedures of Mathematica are totally dif- 
ferent from symbolic calculations. Mathematica determines a table of values of the considered function 
according to certain previously given principles, similarly to the case of graphical representations, and 
it determines the solution using these values. Since the number of points must be finite, this could be 
a problem with “ badly ” behaving functions. Although Mathematica tries to choose more nodes in 
problematic regions, we have to suppose a certain continuity on the considered domain. This can be 
the cause of errors in the final result. It is advised to use as much information as possible about the 
problem under consideration, and if it is possible, then to perform calculations in symbolic form, even 
if this is possible only for subproblems. 

In Table 19.5, we represent the operations for numerical computations: 


Table 19.5 Numerical operations 





NIntegrate calculates definite integrals 
n 
NSum calculates sums > f(z) 
i=l 
NProduct calculates products 
NSolve numerically solves algebraic equations 
NDSolve numerically solves differential equations 








After starting Mathematica the “ Prompt ” In[1] := is shown; it indicates that the system is ready to 
except an input. Mathematica denotes the output of the corresponding result by Out [1]. In general: 
The text in the rows denoted by In[n] := is the input. The rows with the sign Out [n] are given back 
by Mathematica as answers. The arrow —> in the expressions means, e.g., replace x by the value a. 


2. Curve Fitting and Interpolation 


1. Curve Fitting Mathematica can perform the fitting of chosen functions to a set of data using the 
least squares method (see 6.2.5, p. 454ff.) and the approximation in mean to discrete problems (see 
19.6.2.2, p. 986). The general instruction is: 


Fit[(yi, yo, . ..], funkt, z]. (19.291) 
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Here the values y; form the list of data, funkt is the list of the chosen functions, by which the fitting 
should be performed, and x denotes the corresponding domain of the independent variables. If funkt 
is chosen, e.g., as Table[x^i, (4,0, n)] , then the fitting will be made by a polynomial of degree n. 
W Let the following list of data be given: 

In[1] :=1 = (1.70045, 1.2523, 0.638803, 0.423479, 0.249091, 0.160321, 0.0883432, 0.0570776, 


0.0302744, 0.0212794) 


With the input 
In[2]:— fl- Fit[l, (1, 2, x^2, z^3, 1^4}, z] 
it is supposed that the elements of | are assigned to the values 1,2,..., 10 of x. The result is the following 


approximation polynomial of degree four: 
Out [2] = 2.48918 — 0.853487x + 0.0998996x? — 0.003713932 — 0.0000219224.a° 
With the command 
In[3] := Plot|ListPlot|l, {x, 10}], f1, {x, 1,10}, AxesOrigin—> {0, 0}| 
a representation of the data and the approximation curve given in Fig. 19.21a can be obtained. 
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Figure 19.21 


For the given data this is completely satisfactory. The terms are the first four terms of the series ex- 
pansion of e1057, 

2. Interpolation Mathematica offers special algorithms for the determination of interpolation func- 
tions. They are represented as so-called interpolating function objects, which are formed similarly 
to pure functions. The directions for using them are in Table 19.6. Instead of the single function 
values y; a list of function values and values of specified derivatives can be given at the given points. 
B With In[3] :— Plot[Interpolation[][r], (x, 1, 10] one gets Fig. 19.21b. Obviously Mathemat- 
ica gives a precise correspondence to the data list. 


Table 19.6 Commands for interpolation 





Interpolation|{y1, yo,...}] gives an approximation function with the values y; for the 
values x; = i as integers 

Interpolation|{{x1,y1}, {x2, yo},...}] gives an approximation function for the point-sequence 
(xi, yi) 








3. Numerical Solution of Polynomial Equations 
As shown in 20.3.2.1, p. 1038 Mathematica can determine the roots of polynomials numerically. The 
command is NSolve[p[r|] == 0, x, n], where n prescribes the accuracy by which the calculations should 
be done. If n is omitted, then the calculations are made to machine accuracy. The complete solution 
is got, i.e., m roots, if the input polynomial is of degree m. 
E 77/1] := NSolve|r^6 + 3r^2 — 5 == 0] 

Out [1] = {x—> —1.07432}, {x—> —0.867262 — 1.152921}, {xr—> —0.867262 + 1.152921}, 
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{x—> 0.867262 — 1.152921}, {x—> 0.867262 + 1.152921}, {x—> 1.07432}}. 


4. Numerical Integration 


For numerical integration Mathematica offers the procedure NIntegrate. Differently from the symboli- 
cal method, here it works with a table of values of the integrand. Two improper integrals are considered 
(see 8.2.3, p. 506) as examples. 

@ A: In[1] := NIntegrate|Exp|—2x’2], {w,—Infinity,Infinity}] Out [1] = 1.77245. 

ΒΒ: In[2] :— NIntegrate[1/27^2, (v, —1,1]] 


: : Μπ. 

Power::infy: Infinite expression — encountered. 

NIntegrate::inum: Integrand ComplexInfinity is not numerical at{«} = {0}. 
Mathematica recognizes the discontinuity of the integrand at x = 0 in example B and gives a warning. 
Mathematica applies a table of values with a higher number of nodes in the problematic domain, and it 
recognizes the pole. However, the answer can be still wrong. 
Mathematica applies certain previously specified options for numerical integration, and in some spe- 
cial cases they are not sufficient. The minimal and the maximal number of recursion steps, by which 
Mathematica works in a problematic domain, can be determined with parameters MinRecursion and 
MaxRecursion. The default options are always 0 and 6. If these values are increased, then although 
Mathematica works slower, it gives a better result. 
E In[3] := NIntegrate[Exp|—27^2], (, — 1000, 1000)] Mathematica cannot find the peak at x = 0, 
since the integration domain is too large, and the answers is: 

NIntegrate::ploss: 

Numerical integration stopping due to loss of precision. Achieved neither the requested 

PrecisionGoal nor AccuracyGoal; suspect one of the following: highly oscillatory integrand 

or the true value of the integral is 0. 

Out [3] = 1.34946 - 10 76 


If the requirement is 
1π[4] :— NIntegrate[Exp|—x^2], (x, —1000, 1000), MinRecursion—- 3, MaxRecursion—- 10], 
then the result is 
Out [4] = 1.77245 
Similarly, a result closer to the actual value of the integral can be got with the command: 
NIntegrate|fun, c, x, 1, 22, . . . , ve]]. (19.292) 
One can give the points of singularities x; between the lower and upper limit of the integral to force 
Mathematica to evaluate more accurately. 
5. Numerical Solution of Differential Equations 
In the numerical solution of ordinary differential equations and also in the solution of systems of differ- 
ential equations Mathematica represents the result by an InterpolatingFunction. It allows us to get 


the numerical values of the solution at any point of the given interval and also to sketch the graphical 
representation of the solution function. The most often used commands are represented in Table 19.7. 


Table 19.7 Commands for numerical solution of differential equations 





NDSolve[dgl, y, (x, x4, x«]] computes the numerical solution of the differential equation 
in the domain between x, and x, 

InterpolatingFunction|liste][x] gives the solution at the point x 

Plot[Evaluate[y|v]/. l0s]], (x, x4, x«)] | scetches the graphical representation 











19.8 Using the Computer 1019 





W Solution of a differential equation describing the motion of a heavy object in a medium with friction. 
The equations of motion in two dimensions are 


Ë =w t? + y? +a, ü =-g- Wi? +i? i. 


The friction is supposed to be proportional to the velocity. If g = 10, y = 0.1 are substituted, then the 
following command can be given to solve the equations of motion with initial values z(0) = y(0) = 0 
and z(0) — 100, ġ(0) = 200: 
In[1]:— dg — NDSolve[(z"[t] 2— —0.1Sqrt[z'[t]^2 ^ y'[t]^2] x'[t], y” [t] == —10 
—0.1Sqrt [z" (4]^2 ἠ- y'[t]^2] v'(t], x[0] —— v[0] —— 0, z"[0] == 100, y’[0] == 200}, 


{x,y}, {t, 15} 


Mathematica gives the answer by the interpolating function: 
Out [1] = {{x—> InterpolatingFunction[{0., 15.}, <>], 


y—> InterpolatingFunction[{0.,15.},<>]}} 


The solution 
In[2] := ParametricPlot|{x[t], y[t]}/.dg, {t, 0,2}, PlotRange—> A11] 
is represented as a parametric curve (Fig. 19.22a). 
NDSolve accepts several options which affect the accuracy of the result. 
The accuracy of the calculations can be given by the command AccuracyGoal. The command 


PrecisionGoal works similarly. During calculations, Mathematica works according to the so-called 
WorkingPre cision, which should be increased by five units in calculations requiring higher accuracy. 


The numbers of steps by which Mathematica works in the considered domain is prescribed as 500. In 
general, Mathematica increases the number of nodes in the neighborhood of the problematic domain. 
In the neighborhood of singularities it can exhaust the step limit. In such cases, it is possible to increase 
the number of steps by MaxSteps. It is also possible the prescribe Infinity for MaxSteps. 


E The equations for the Foucault pendulum are: 
d(t)--w?z(t) -209(t), YE) +a ylt) = —203(t). 
With w = 1, Q = 0.025 and the initial conditions z(0) — 0, y(0) — 10, z(0) — ġ(0) = 0 the solution is: 
In[3] := dg3 = NDSolve[{x"|t] == —a{t] + 0.05y’{E], y”[4] == —ylt] — 0.05x'ft], 
x[0] == 0, y[0] == 10, x"[0] == y'[0] == 0}, {x,y}, {t, 0, 40}] 
Out [3] = {{x—> InterpolatingFunction[{0., 40.}, <>], 
y—> InterpolatingFunction[{0., 40.},<>]}} 


With 
In[4] := ParametricPlot|{x[t], y[t]}/.dg3, (t, 0, 40), AspectRatio— 1] 
one gets Fig. 19.22b. 


19.8.4.3 Maple 


The computer algebra system Maple can solve several problems of numerical mathematics with the use 
of built-in approximation methods. The number of nodes, which is required by the calculations, can 
be determined by specifying the value of the global variable Digits as an arbitrary n. But it should be 
kept in mind that selecting a higher n than the prescribed value results in a lower speed of calculation. 
1. Numerical Calculation of Expressions and Functions 

After starting Maple, the symbol * Prompt " > is shown, which denotes the readyness for input. Con- 
nected in- and outputs are often represented in one row, separated by the arrow operator —> . 
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Figure 19.22 


1. Operator evalf Numerical values of expressions containing built-in and user-defined functions 
which can be evaluated as a real number, can be calculated with the command 

evalf (expr, n). (19.293) 
expr is the expression whose value should be determined; the argument n is optional, it is for evaluation 
to n digits accuracy. Default accuracy is set by the global variable Digits. 
W Prepare a table of values of the function y — f(x) = yz + Inz. 
First, the function is defined by the arrow operator: 

> f:=2—> sart(z)+In(z); 3 f:= 2S Vrt+Ing. 
Then the required values of the function can be got with the command evalf(f());, where a numerical 
value should be substituted for x. 
A table of values of the function with steps size 0.2 between 1 and 4 can be obtained by 

> for x from 1 by 0.2 to 4 do print(/f|v| — evalf(f(x),12)) od; 
Here, it is required to work with twelve digits. 
Maple gives the result in the form of a one-column table with elements in the form ft; — 2.95200519181. 


2. Operator evalhf (expr): Beside evalf there is the operator evalhf. It can be used in a similar 
way to evalf. Its argument is also an expression which has a real value. It evaluates the symbolic 
expression numerically, using the hardware floating-point double-precision calculations available on 
the computer. A Maple floating-point value is returned. Using evalhf speeds up your calculations in 
most cases, but you lose the definiable accuracy of using evalf and Digits together. For instance in 
the problem in 19.8.2, p. 1004, it can produce a considerable error. 


2. Numerical Solution of Equations 
By using Maple equations or systems of equations can be solved numerically in many cases. 
The command to do this is fsolve. It has the syntax 

fsolve(eqn, var, option). (19.294) 
This command determines real solutions. If eqn is in polynomial form, the result is all the real roots. 
If eqn is not in polynomial form, it is likely that fsolve will return only one solution. The available 
options are given in Table 19.8. 


Table 19.8 Options for the command fsolve 





complex determines a complex root (or all roots of a polynomial) 

maxsols =n determines at least the n roots (only for polynomial equations) 

fulldigits ensures that fsolve does not lower the number of digits used during computations 
intervall looks for roots in the given interval 
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E A: Determination of all solutions of a polynomial equation x9 + 32? — 5 — 0. With 

> eq:= z^64-3*2^2—5-0: 
the result is 

> fsolve(eq, £); — —1.074323739, 1.074323739 
Maple determined only the two real roots. With the option complex, also the complex roots are ob- 
tained: 

> fsolve(eq, x, complex); 

—1.074323739, —0.8672620244 — 1.1529220121, —0.8672620244 + 1.1529220121, 

0.8672620244 — 1.152922012T, 0.8672620244 + 1.152922012T, 1.074323739 

W B: Determination of both solutions of the transcendental equation e™ — 4a? = 0. 
After defining the equation 

> eq := exp(—x^3) — 4 * r^2 = 0 
the result is 

> fsolve(eq, x); — 0.4740623572 
as the positive solution. With 

> fsolve(eq, x, x — —2..0); — —0.5412548544 
Maple also determines the second (negative) root. 
3. Numerical Integration 
The determination of definite integrals is often possible only numerically. This is the case when the 
integrand is too complicated, or if the primitive function cannot be expressed by elementary functions. 
The command to determine a definite integral in Maple is evalf: 


evalf(int(f(r),x — a..b), n). (19.295) 
Maple calculates the integral by using an approximation formula. 


2 
W Calculation of the definite integral / e* dx. Since the primitive function is not known, for the 
J-2 


integral command the following answer is got 
2 

> int(exp(—x2"3), 2 = —2..2); --{ e^ dz. 
-2 


If 
> evalf(int(exp(—2x%3),x = —2..2), 15); 
is typed,then the answer is 277.745841695583. 
Maple used the built-in approximation method for numerical integration with 15 digits. 
In certain cases this method fails, especially if the integration interval is too large. Then, another 
approximation procedure can be tried with the call to a library 
readlib()evalf/int'): 
which applies an adaptive Newton method. 
E The input 
> evalf(int(exp(—x^2), x = —1000..1000)); 
results in an error message. With 
> readlib(`evalf/int`): 
> `evalf/int`(exp(—x^2), x — —1000..1000,10, NCrule); 
1.772453851 


the correct result is obtained. The third argument specifies the accuracy and the last one specifies the 
internal notation of the approximation method. 
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4. Numerical Solution of Differential Equations 
Ordinary differential equations can be solved with the Maple operation dsolve. However, in most cases 
it is not possible to determine the solution in closed form. In these cases, it can be solved numerically, 
where the corresponding initial conditions have to be given. 
In order to do this, the command dsolve is used in the form 

dsolve(deqn, var, numeric) (19.296) 
with the option numeric as a third argument. Here, the argument deqn contains the actual differential 
equation and the initial conditions. The result of this operation is a procedure, and if it is denoted, e.g., 
by f, for using the command f(t), the value of the solution function at the value t of the independent 
variable is returned. 
Maple applies the Runge-Kutta method to get this result (see 19.4.1.2, p. 969). The default accuracy for 
the relative and for the absolute error is 107P'sits+3, The user can modify these default error tolerances 
with the global symbols _RELERR and _ABSERR. If there are some problems during calculations, then 
Maple gives different error messages. 
W At solving the problem given in the Runge-Kutta methods in 19.4.1.2, p. 970, Maple gives: 

> r:= dsolve({diff(y(x), x) = (1/4) * (a2 + y(x)*2), y(0) = 0}, y(x), numeric); 

r := proc ‘dsolve/numeric/result2> (x, 1592392, [1]) end 

With 

> x(0.5); —> {x(.5) = 0.5000000000, (2) (.5) = 0.01041860472} 


we can determine the value of the solution, e.g., at x — 0.5. 


20 Computer AlgebraSystems- Example 
Mathematica 


20.1 Introduction 
20.1.1 Brief Characterization of Computer Algebra Systems 
20.1.1.1 General Purpose of Computer Algebra Systems 


The development of computers has made possible the introduction of computer algebra systems for 
“doing mathematics”. They are software systems able to perform mathematical operations formally. 
These systems, such as Macsyma, Reduce, Derive, Maple, Mathematica, Matlab, Sage, can also be used 
on relatively small computers (PC), and with their help, we can transform complicated expressions, 
calculate derivatives and integrals, solve systems of equations, represent functions of one and of sev- 
eral variables variables graphically, etc. They can manipulate mathematical expressions, i.e., they can 
transform and simplify mathematical expressions according to mathematical rules if this is possible in 
closed form. They also provide a wide range of numerical solutions to required accuracy, and they can 
represent functional dependence between data sets graphically. 

Most computer algebra systems can import and export data. Besides a basic offer of definitions and 
procedures which are activated at every start of the system, most systems provide a large variety of 
libraries and program packages from special fields of mathematics, which can be loaded and activated 
on request (see [20.15].[20.16]). Computer algebra systems allow users to build up their own packages 
(20.11]- [20.14]. 

However, the possibilities of computer algebra systems should not be overestimated. They spare us 
the trouble of boring, time-demanding, and mechanical computations and transformations, but they 
do not save us from thinking. 

For frequent errors see 19.8.2, p. 1004. 


20.1.1.2 Restriction to Mathematica 
The systems are under perpetual developing. Therefore, every concrete representation reflects only a 
momentary state. Here we introduce the basic idea and applications of these systems for the most im- 
portant fields of mathematics. This introduction will help for the first steps in working with computer 
algebra systems. In particular, we discuss Mathematica compatible until Version 10. This system seems 
to be very popular among users, and the basic structures of the other systems are similar. 
In this book, we do not discuss how computer algebra systems are installed on computers. It is assumed 
that the computer algebra system has already been started by a command, and it is ready to commu- 
nicate by command lines or in a Windows-like graphical environment. 
The input and output is always represented for Mathematica (see 19.8.4.2, 1., p. 1016) in rows which 
are distinguished from other text parts, e.g., in the form 

In[1] := Solve[3 z — 5 —— 0, x] (20.1) 
System specific symbols (commands, type notation, etc.) will be represented in typewriter style. 
In order to save space, we often write the input and the output in the same row in this book, and we 
separate them by the symbol —>. 


20.1.1.3 Two Introducing Examples of Basic Application Fields 


1. Manipulation of Formulas 

Formula manipulation means here the transformation of mathematical expressions in the widest sense, 
e.g., simplification or transformation into a useful form, representation of the solution of equations or 
systems of equations by algebraic expressions, differentiation of functions or determination of indefinite 
integrals, solution of differential equations, formation of infinite series, etc. 
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W Solution of the following quadratic equation: 


x? +ar+b=0 with a,bc R. (20.2a) 
In Mathematica, one types: 
Solveļz? +a x +b == 0, zx]. (20.2b) 


After pressing the corresponding input key/keys (ENTER or SHIFT+RETURN, depending on the 
operation system), Mathematica replaces this row by 


In[1] :— Solve|i? - a x - 0 —— 0, x] (20.2c) 
and starts the evaluation process. In a moment, the answer appears in a new row 


σαί [1] --{{α sol a — va? 4b) }, {x >5( a+ va? 4b) }}. (20.2d) 


Mathematica has solved the equation and both solutions are represented in the form of a list consisting 
of two sublists. 








2. Numerical Calculations 

Computer algebra systems provide many procedures to handle numerical problems of mathematics. 
These are solutions of algebraic equations, linear systems of equations, the solutions of transcendental 
equations, calculation of definite integrals, numerical solutions of differential equations, interpolation 
problems, etc. 


W Problem: Solution of the equation 

a5 — 2a? — 30a* 4- 362? -- 1902? — 36x — 150 — 0. (20.3a) 
Although this equation of degree six cannot be solved in closed form, it has six real roots, which are to 
be determined numerically. 


In Mathematica the input is: 

In[1] := NSolve[x® — 22° — 30x* -- 362? 4- 190x? — 36x — 150 == 0, 2] (20.3b) 
It results in the answer: 

Out[1] — ((x— —4.42228), (r— —2.14285), (v— —0.937347}, {a—> 0.972291}, 

(x— 3.35802), (x— 5.172171) (20.3c) 


This is a list of the six solutions with a certain accuracy which will be discussed later. 


20.2 Important Structure Elements of Mathematica 


Mathematica is a computer algebra system, developed by Wolfram Research Inc. A detailed description 
of Mathematica can be found in. [20.11]-[20.16]. For the current version 10 see the Virtual Book in the 
online Help. 


20.2.1 Basic Structure Elements of Mathematica 


In Mathematica the basic structure elements are called expressions. Their syntax is (it is emphasized 
again, that the current objects are given by their corresponding symbol, by their names): 


objy|obj,, objy,..., obj,,] (20.4) 
0bj, is called the head of the expression; the number 0 is assigned to it. The parts obj; (i — 1,...,m) 
are the elements or arguments of the expression, and one can refer to them by their numbers 1,...,n. 


In many cases the head of the expression is an operator or a function, the elements are the operands or 
variables on which the head acts. 

Also the head, as an element of an expression, can be an expression, too. Square brackets are reserved 
in Mathematica for the representation of an expression, and they can be applied only in this relation. 
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E Theterm z^2 4-2: -- 1, which can also be entered in this infix form (and also in the nicer, preferred 
form a? + 22 +1) in Mathematica, has the complete form (FullForm) 


Plus[l, Times[2, 1], Power (v, 2]] 
which is also an expression. Plus, Power and Times denote the corresponding arithmetical operations. 
'The example shows that all simple mathematical operators exist in prefix form in the internal repre- 
sentation, and the term notation is only a facility in Mathematica. 
Parts of expressions can be separated. This can be done with Part [expr, i], where i is the number of 
the corresponding element. In particular, 7 = 0 gives back the head of the expression. 
E If entering in Mathematica 

In[1]:— 2? 4 2x 4-1, 
then after the SHIFT and ENTER keys together are pressed, Mathematica answers 

Üut[1] = 1 +22 + x? 


Mathematica analyzed the input and returned it in mathematical standard form. If the input had been 
terminated by a semicolon, then the output would have been suppressed. 
If entering 


In[2]:— FullForm[] 
then the answer is 
Out [2] — Plus[l, Tines[2, z], Power v, 2]] 


The sign % in the square brackets tells Mathematica that the argument of this input is the last output. 
From this expression it is possible to get, e.g., the third element 

In[3] := Part[%,3] as Out[3] =x? 
which is again an expression in this case. 
Symbols in Mathematica are the notation of the basic objects; they can be any sequence of letters and 
numbers but they must not begin with a number. The special sign $ is also allowed. Upper-case and 
lower-case letters are distinguished. Reserved symbols begin either with a capital letter, or with the 
sign $, and in compound words also the second word begins with a capital letter, if it has a separate 
meaning. Users are advised to create their own symbols starting with lower-case letters. 


20.2.2 Types of Numbers in Mathematica 
20.2.2.1 Basic Types of Numbers 


Mathematica knows four types of numbers represented in Table 20.1. 


Table 20.1 Types of numbers in Mathematica 








Type of number | Head Characteristic Input 
Integers Integer | exact integer, arbitrarily long nnnnn 
Rational numbers | Rational | fraction of coprimes in form Integer/Integer pppp/qqqq 
Real numbers Real floating-point number, arbitrary given precision | nnnn.mmmm 
Complex numbers | Complex | complex number in the form number+number *1 

















Real numbers, i.e., floating-point numbers, can be arbitrarily long. If an integer nnn is written in the 
form nnn., then Mathematica considers it as a floating-point number, that is, of type Real. 

The type of a number x can be determined with the command Head[r]. Hence, In[1] := Head[51] 
results in Qut [1] = Integer, while In[2] := Head[51.] Out [2] — Real. The real and imaginary 
components of a complex number can belong to any type of numbers. A number such as 5.731 + 0T is 
considered as a Real type by Mathematica, while 5.731 + 0. Lis of type Complex, since 0. is considered 
as a floating-point approximation of 0. 
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There are some further operations, which give information about numbers. So, 

In[3] := NumberQ[51] results in Out [3] = True, (20.5a) 
Otherwise, if x is manifestly not a number, as e.g. « = 7, then the output is Qut [3] =False. However, 
NumericQ[7] gives True. Here, True and False are the symbols for Boolean constants. 

IntegerQ[r] tests if x is an integer, or not, so 

In[4] := IntegerQ|2.| —> Out[4] = False (20.5b) 
Similar tests can be performed for numbers with heads EvenQ, OddQ and PrimeQ. Their meanings are 
obvious. So, one gets 


In[5]:— PrimeQ[1075643] —95 Out[5] = True (20.5c) 
while 

In[6] := PrimeQ[1075641] —+ Out[6] = False (20.5d) 
These last tests belong to a group of test operators, called predicates or criteria, which all end by Q and 
always answer True or False in the sense of a logical test (including a type check). 
20.2.2.2 Special Numbers 


In Mathematica, there are some special numbers which are often needed, and they can be called with 


as the transformation 





T 
arbitrary accuracy. They include x with the symbol Pi, e with the symbol E, 180 


factor from degree measure into radian measure with the constant Degree, Infinity as the symbol for 
oo and the imaginary unit I. 


20.2.2.3 Representation and Conversion of Numbers 


Numbers can be represented in different forms which can be converted into each other. So, every real 
number z can be represented by a floating-point number N[z, n] with an n-digit precision. 
IN[1]:— N[E, 20] yields —» Out[1] = 2.7182818284590452354 (20.6α) 


With Rationalize|[r, dz], the number x with an accuracy dx can be converted into a rational number, 
i.e., into the fraction of two integers. 


1071 
In[2] := Rationalize[%, 10-5] —> Out[2] = 394 (20.6b) 
With 0 accuracy, Mathematica gives the possible best approximation of the number x by a rational 
number. 
Numbers of different number systems can be converted into each other. With BaseForn|z, b], the num- 
ber x given in the decimal system is converted into the corresponding number in the number system 


with base b < 36. If b > 10, then the consecutive letters of the alphabet a, b, c, ... are used for the 

further digits having a meaning greater than ten. 

W A: In[1] :— BaseForm|255, 16) — outfiJ=ffi (20.7a) 
In[2]:— BaseForm[N[E, 10, ]j  —9 Out[2] = 2.5576052131 (20.7b) 

"The reversed transformation can be performed by b^^mmmn. 

Bb B:In[1]:— 8^^735 —5 Out[1] = 477 (20.7c) 


Numbers can be represented with arbitrary precision (the default here is the hardware precision), and 
for large numbers so-called scientific form is used, i.e., the form n.mmmim10^ c qq. 





20.2.3 Important Operators 


Several basic operators can be written in infix form (as in the classical form in mathematics) 
« symb; op symbs >. However, in every case, the complete form of this simplified notation is the 
expression op[symb;, symb»]. The most often occurring operators and their complete form are col- 
lected in Table 20.2. Most symbols in Table 20.2 are obvious. For multiplication in the form a b, the 
space between the factors is very important. 
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Table 20.2 Important Operators in Mathematica 





a+b Plus|a, b] u-—- v Equal[u, v] 
aboraxb  Times|[a, b] wl=v  Unequal|w, v] 
a^borea^  Powerl|a, b] rod Greater[r, t] 

a/b Times|a, Power|b, —1]] r2t GreaterEqual[r, t] 
u—> v Ruleļu, v] s«t Less[s, t] 

r=s Set|r, s] st LessEqual|s, t] 











The expressions with the heads Rule and Set will be explained. Set assigns the value of the expression 
s on the right-hand side, e.g., a number, to the expression r on the left-hand side, e.g., a variable. 
From here on, r is represented by this value until this assignment is changed. The change can be done 
either by a new assignment or by x — . or by Clear[r], i.c., by releasing every assignment so far. The 
construction Rule should be considered as a transformation rule. It occurs together with /. which is 
the substitution operator. 


Replace|t, u — v] or t/. u—» v means that every occurance u in the expression t will be replaced by 
the expression v. 
B In[1]:— zy [.y— a b. —5 Out[1] = (a+b)? +x 
It is typical in the case of both operators that the right-hand side is evaluated immediately after the 
assignment or transformation rule. So, the left-hand side will be replaced by this evaluated right-hand 
side at every later call. 
Here, two further operators have to be mentioned with delayed evaluation. 
The FullFormofu:— vis SetDelayed[u,v| and (20.82) 
the FullFormofu: — v is RuleDelayed[u, v] (20.8b) 
The assignment or the transformation rule are also valid here until it is changed. Although the left- 
hand side is always replaced by the right side, the right-hand side is evaluated for the first time only at 
the moment when the left one is called. 
The expression u —- v or Equal[u, v] returns True if u and v are identical. Equal is used, e.g., in 
manipulation of equations. 


20.2.4 Lists 
20.2.4.1 Notions 


Lists are important tools in Mathematica for the manipulation of whole groups of quantities, which are 
important in higher-dimensional algebra and analysis. 
A list is a collection of several objects into a new object. In the list, each object is distinguished only 
by its place in the list. The construction of a list is made either by the command 

List[al, a2, a3, ...] or by (a1, a2, a3, ...} (20.9) 
if the elements can be simply enumerated. To explain the work with lists, a particular list is used, 
denoted by /1: 

In[1]:— ll — List[al, a2, a3, a4, a5, a6], —> Out[1] = {al, a2, a3, a4, a5, a6} (20.10) 
Mathematica applies a short form to the output of the list: It is enclosed in curly braces. 
'Table 20.3 represents commands which choose one or more elements from a list, and the output is a 
"sublist" . 
Wi For the list /1 in (20.9) one gets, ο.ρ., 

In[2]:—First(l|] —> Out[2]=al In[3] :=11[[3]] —> Out[3] = a3 

In[4] := l1[[{2, 4, 6}]] —> Outl4] = {a2, a4, a6} 

In[5] := Take[l1, 2} —> Out[5] = {al, a2} 
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Table 20.3 Commands for the choice of list elements 





First|l], Last (]] selects the first/last element 

Most[/], Rest [I| selects the elements except the last /first one 

Part|l, n] or l[[n]] selects the n-th element 

Part|l, [nl, n2, ...)] gives a list of the elements with the given numbers 
Uf[{n1, n2, ...}]] equivalent to the previous operation 

Take[l, rn) gives the list of the first m elements of | 

Take|l, (m, n3] gives the list of the elements from m through n 
Dropl|l, n] gives the list without the first n elements 

Drop|l, (m, n3] gives the list without the elements from m through n 











20.2.4.2 Nested Lists 


The elements of lists can also be lists, so nested lists can be obtained. If entering, e.g., for the elements 
of the previous list /1 (20.10) 


In[1] := al = (b11, 012, 013, b14, b15} 
In[2]:— a2 — (021, 022, 023, 024, 525) 
In[3] := a3 = (031, 032, 033, 094, 095} 
and analogously for a4, a5 and a6, then because of (20.10) a nested list (an array) is obtained which 
is not shown here explicitly. One can refer to the j-th element of the i-th sublist with the command 
Part[l, i, j]. The expression /|[;, j]] has the same result. In the above example (p. 1027), e.g., 
In[4]:— 1|[3, 4]] yields Out[A4] = b34 
Furthermore, Part|l, (i1, i2 ...), {g1, 72 ...}] or U[[{il, 22, ...}, {71, 72, ...}]] results in a list con- 
sisting of the elements numbered with j1, 72... from the lists numbered with i1, i2, . . .. 
W For the above example in 20.2.4.1, p. 1027 
In[1] := 11{[{3, 5}, {2, 3, 4}]] —> Out[1] = {{b32, b33, b34}, (552, 053, 054) 
The idea of nesting lists is obvious from these examples. It is easy to create lists of three or higher 
dimensions, and it is easy to refer to the corresponding elements. 
20.2.4.3 Operations with Lists 
Mathematica provides several further operations by which lists can be monitored, enlarged or shortened 


(Table 20.4). 


Table 20.4 Operations with lists 





Position|l, a] gives a list of the positions where a occurs in the list 
MemberQ|/, a] checks whether a is an element of the list 

Select|l, crit] picks out all elements of the list for which crit holds 
Cases|l, pattern] gives a list of elements which match the pattern 
FreeQ|I, a] checks if a does not occur in the list 

Prepend|, a] changes the list by adding a to the front 

Append|l, a] changes the list by appending a to the end 
Insert|l, a, i] inserts a at position { in the list 

Delete|l, (4, j, ...)]] delete the elements at positions 1, j, .. . from the list 
ReplacePart|l, a, i] replace the element at position i by a 








BW With Delete, the original list /1 (20.10) can be shortened by the term a6: 
In[1] := 12 = Delete(ll, 6] —95 QOut[1] — (a1, a2, a3, a4, a5}, 
where in the output the a? are shown by their values — they are lists themselves. 
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20.2.4.4 Tables 


In Mathematica, several operations are available to create lists. One of them, which often occurs in 
working with mathematical functions, is the command Table shown in Table 20.5. 


Table 20.5 Operation Table 





Table[f, {imaz}] creates a list with max values of f: (1), f(2),..., f (imax) 
Table|f, (i, imin, imax]] creates a list with values of f from imin to imax 
Table|f, (i, imin, imax, di] the same as the last one, but by steps di 








W Table of binomial coefficients for n = 7: 
In[1]:— Table[Binomial[T, i] (4, 0, 7}]] —9  Out[1] — (1, 7, 21, 35, 35, 21, 7, 1} 
With Table, also higher-dimensional arrays can be created. With the expression 
Table|/, (i, i1, i2), (5, 71, 72}, ...] 
a higher-dimensional, multiple nested table is obtained, i.e., entering 
In[2] :— Table[Binomial[i, j], (4, 1, 7), (j. 0, 1] 
the binomial coefficients are got up to degree 7: 
Out [2] = {{1, 1}, (1, 2, 1), (1, 3 3, 1}, {1, 4, 6, 4, 1}, 
{1, 5, 10, 10, 5, 1}, {1, 6, 15, 20, 15, 6, 1}, {1, 7, 21, 35, 35, 21, 7, 1}} 
The operation Range produces a list of consecutive numbers or equally spaced numbers: 
Range[n] yields the list {1, 2,..., n} 
Similarly, Range[n1, n2] and Range[n1, n2, dn] produce lists of numbers (arithmetic sequences) from 


n1 to n2 with step-size 1 or dn respectively. The command Array uses functions (as opposed to function 
5 


values used by Table) to create lists. Array [Exp ,5] yields {ο, εὖ, εἳ, εὖ, εδ} 


20.2.5 Vectors and Matrices as Lists 
20.2.5.1 Creating Appropriate Lists 


Several special (list) commands are available for defining vectors and matrices. A one-dimensional list 
of the form 

v = {vl, v2,..., un} (20.11) 
can always be considered as a vector in n-dimensional space with components v1, v2,...,un. The spe- 
cial operation Array[v, n] produces the list (the vector) (v[1], v[2], ... , v[n]]. Symbolic vector opera- 
tions can be performed with vectors defined in this way. 
The two-dimensional lists /1 (see 20.2.4.2, p. 1028) and /2 (see 20.2.4.3, p. 1028) can be considered as 
matrices with rows ¿į and columns j. In this case bij would be the element of the matrix in the i-th row 
and the j-th column. A rectangular matrix of type (6,5) is defined by /1, and a square matrix of type 
(5,5) by 12. 
With the operation Array[b, (n, m)] a matrix of type (n, rn) is generated, whose elements are denoted 
by b[i, j]. The rows are numbered by i, i changes from 1 to n; by j the columns are numbered from 1 
to m. In this symbolic form [1 can be created as 


11 = Array[b, (6, 5], (20.12a) where b[i j|] — bij (i—1,...,6; j— 1,...,5). (20.125) 


Summarizing, lists can be created either by enumeration or by using the functions Array, Range, 
Table. Note that lists are different from sets in mathematics. 
The operation IdentityMatrix[n] produces the n-dimensional unit matrix. 





With the operation DiagonalMatrix|list] a diagonal matrix is produced with the elements of the list 
in its main diagonal. 
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The operation Dimension|list] gives the size (number of rows, columns, ...) of a matrix, whose struc- 
ture is given by a list. Finally, with the command MatrixForn|list], one gets a matrix-type represen- 
tation of the list. A further possibility to define matrices is the following: Let f(i,j) be a function 
of integers i and j. Then, the operation Table[f[i j], (4, n), (j, m)] defines a matrix of type (n, m), 
whose elements are the corresponding f(i, j). 


20.2.5.2 Operations with Matrices and Vectors 


Mathematica allows formal manipulation of matrices and vectors. The operations given in Table 20.6 
can be applied. 


Table 20.6 Operations with matrices 











ca matrix a is multiplied by the scalar c 
a.b the product of matrices a and b 
Det[a] the determinant of matrix a 
Inverse|a| the inverse of matrix a 
Transpose|a| the transpose of matrix a 
MatrixExp|a| the exponential function of matrix a 
MatrixPower|a, n| the n-th power of matrix a 
Eigenvalues|a] the eigenvalues of matrix a 
Eigenvectors|a] the eigenvectors of matrix a 
E A: In[1]:=r = Ατταγία, {4,4}] —> Out[1] = { { afl, 1], a(1, 2], a[l, 3], a[1, 4] } 
1 aJ2, 1], a[2, 2], a[2, 3], a[2, 4] V, 
{ a[3, 1], a[3, 2], a[3, 3], a|3, 4] }, 
{ al4, 1], a|4, 2], a|4, 3], a|4, 4] } 
In[2] := Transpose[r] -- Uut[2] — ( ( a1, 1], α[2, 1, α[β, 1], α[4, 1] }, 
{ all, 2], a[2, 2], a[3, 2], αἰ4, 3] }, 
{ all, 3], a[2, 3], a[3, 3], a|4, 3] T. 
{ afl, 4], a[2, 4], a[3, 4], a[4, 4] } 


Here, the transpose matrix r! of r is produced. 
Let the general four-dimensional vector v be defined by 
In[3]:— v — Array|u, | —5 Out[3] = {u[l], u[2], u[3], u[4]} 


Now, the product of the matrix r and the vector v is again a vector (see Calculations with Matrices, 
4.1.4, p. 272). 





In[4]:— r.v. —5 Out [4] — ( a1, 1]u[1] - a[1, 2] u[2] 4- a|1, 3] u[3] + afl, 4] uf] , 
a|2, 1| u|1] 4- a2, 2| u[2] - a[2, 3| u[3] 4- a|2, 4| u[4] , 

a[3, 1| u|1] 4- aj3, 2| u[2] - a[3, 3| u[3] 4- a|3, 4] u[4] , 

a|4, 1| u|1] 4- a4, 2] u[2] - a[4, 3| u[3] 4- a|4, 4| u[4] ). 





There is no difference between row and column vectors in Mathematica. In general, matrix multiplica- 
tion is not commutative (see Calculations with Matrices 4.1.4, p. 272). The expression r. v corresponds 
to the product in linear algebra when a matrix is multiplied by a column vector from the right, while 
v.r means a multiplication by a row vector from the left. 


W D: In the section on Cramer's rule (4.5.2.3, p. 311) the linear system of equations pt — b is solved 
with the matrix 


2 13 
In[1] := MatrixForm|p = {{2, 1, 3}, {1, —2, 1}, {3, 2, 2}}] —  out[11 — (: -2 1 
3 22 


and vectors 
In[2] :=t = Array[r, 3] —> Out [2] = {2[1], x[2], 2[3]} 
In[3] := b= {9, —2,7} —> Out[3] = {9, —2, 7}. 
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Since in this case Det [p] --13 4 0 holds, the system can be solved by t = p~+b. This can be done by 
In[4]:— Inverse[p.b with the output of the solution vector Out [4] — (—1, 2, 3). 


Note that a b calculates the componentwise product and Exp [a] gives the matrix containing the exp 
of the components of the matrix a. 


20.2.6 Functions 
20.2.6.1 Standard Functions 


Mathematica knows several standard mathematical functions, which are listed in Table 20.7. 


Table 20.7 A few standard functions 





Exponential function Εχρ|χ] 
Logarithmic functions ^ Logl|x|, Log[b,x] 
Trigonometric functions | Sin|x], Cos|x], Tan|x], Cot[x], Sec[x], Csc[x| 


Arc functions ArcSin|x], ArcCos|x], ArcTan[x], ArcCot[x], ArcSec[x], ArcCsc[x] 
Hyperbolic functions Sinh|x|, Cosh|x|, Tanh|x], Coth|x], Sech|x], Csch|x] 
Area functions ArcSinh|x]|, ArcCosh[x], ArcTanh[x], ArcCoth[x|, ArcSech|x], ArcCsch[x] 











All these functions can be applied even with complex arguments. 


In every case must be considered the single-valuedness of the functions. For real functions one branch 
of the function has to be chosen (if it is needed); for functions with complex arguments the principal 
value (see 14.5, p. 758) should be chosen. 


20.2.6.2 Special Functions 


Mathematica knows several special functions, which are not standard functions. Table 20.8 lists some 
of these functions. 


Table 20.8 Special functions 





Bessel functions J,(z) and Y;,(z) BesselJ[n.z|, BesselY[n,z] 
Modified Bessel functions /;,(z) and K,(z) Bessell[u,z], BesselK|n,z| 
Legendre polynomials P, (a) LegendreP|[n.x] 

Spherical harmonic Y;™ (ð, φ) SphericalHarmonicyY|I,m, 6, ¢] 








Further functions can be loaded with the corresponding special packages of Mathematica 


20.2.6.3 Pure Functions 


Mathematica supports the use of so-called pure functions. A pure function is an anonymous function, 
an operation with no name assigned to it. They are denoted by Function[r, body]. The first argument 
specifies the formal parameters and the second one is the body of the function, i.e., body is an expression 
for the function of the variable x. 





In[1] := Function|[z, 2° +27] —+ Out[1] = Function|z, αὖ + 27] (20.13) 
and so 

In[2]:— Function[r, i? -a?||] gives Οι [2] -- ? 4 e. (20.14) 
We can use a simplified version of this command. It has the form body &, where the variable is denoted 


by # . Instead of the previous two rows one can also write 

Inf3] := (#8 + #)&[q out (37 =C +e. (20.15) 
It is also possible to define pure functions of several variables: 
Function| {21,22,...}, body] or in short form body &, where the variables in body are denoted by the 
elements #1, #2,.... The sign & is very important for closing the expression, since it can be seen from 
this sign that the previous expression should be considered as a pure function. Let us remark that the 
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pure function #& is nothing else than the identity function: to any argument x it assigns x. Similarly, 
#1& corresponds to the projection onto the first coordinate axis. 


20.2.7 Patterns 


Mathematica allows users to define their own functions and to use them in calculations. With the 
command 

In[1] := £[{x_] := Polynomial|[zr] (20.16) 
with Polynomial() as an arbitrary polynomial of variable «x, a special function is defined by the user. 
In the definition of the function f, there is no simple x, but x_ (pronounced x-blank) with a symbol _ 
for the blank. The symbol r means “something with the name x”. From here on, every time when 
the expression f [something] occurs, Mathematica replaces it by its definition given above. This type of 
definition is called a pattern. The symbol blank_ denotes the basic element of a pattern; y_ stands for 
y as a pattern. It is also possible to apply in the corresponding definition only a * ", that is y^ . 'This 
pattern stands for an arbitrary power of y with any exponent, thus, for an entire class of expressions 
with the same structure. 
The essence of a pattern is that it defines a structure. When Mathematica checks an expression with 
respect to a pattern, it compares the structure of the elements of the expression to the elements of 
the pattern, Mathematica does not check mathematical equality! This is important in the following 
example: Let | be the list 


In[2]:—l — (A, y, y^, y, (t[y'/*], 29) (20.17) 
If one writes 

In[3]:—l[.y^ — yes (20.18) 
then Mathematica returns the list 

Dut[3] — (1, y, yes, yes, (£[yes], 273) (20.19) 


Mathematica checked the elements of the list with respect to its structural identity to its pattern y^ - 
and in every case when it determined coincidence it replaced the corresponding element by yes. The 
elements 1 and y were not replaced, since they have not the given structure, even though y? — 1, y! — y 
holds. 


Remark: Pattern comparison always happens in FullForm. If 

In[4] := b/y/.y_—> yes isexamined then Οωἑ [4] = b yes (20.20) 
This is a consequence of the fact that FullForm of b/y is Times[b, Power[y, —1]|], and for structure 
comparison the second argument of Times is identified as the structure of the pattern. 


With the definition 





(20.21a) 

Mathematica replaces, corresponding to the given pattern, 

In[6] — f[r|] by Out[6] =r? etc. (20.21b) 

In[7] := [a] + f[a] yields Out[7] — à? + z? (20.21c) 
If however, 

In[8]:— f|r| :— z?, then for the same input In[9] := [a] + £[2] (20.21d) 
the output would be 

Out [9] = £[a] -- 3? (20.21e) 


In this case only the (fixed, single) input x corresponds to the definition. 


20.2.8 Functional Operations 


Functions operate on numbers and expressions. Mathematica can also perform operations with func- 
tions, since the names of functions are handled as expressions so they can be manipulated as expressions. 


20.2 Important Structure Elements of Mathematica 1033 





1. Inverse Function, Inverse Series The determination of the inverse function of a given function 
f can be made by the functional operation InverseFunction or InverseSeries. 

Bl A: In[1]:— InverseFunction|f]|[r] —> Out[1] = f~ [x] 

E B: In[1]:— InverseFunction[Exp —> Out[1] = Log 

HC: In[1] := InverseSeries|Series|(g[z], {x,0, 2}]] 

T- g(0] 9" [o] ΙΙ σ[ο|)” Olax πι] 

g'[0] 24109 
2. Differentiation Mathematica uses the possibility that the differentiation of functions can be 
considered as a mapping in the space of functions. In Mathematica, the differentiation operator is 
Derivative|l]|/] or in short form f’. If the function f is defined, then its derivative can be got by f’. 
B ^ In[1]:—f|r]:— Sin|r|Cos|v| With 

In[2]:— f' follows Out [2] = Cos[#1]? — Sin[#1]?&, 
hence f’ is represented as a pure function and it evaluates to 

In[3] :=%|x] —+ Out [3] = Cos|x]? — Sin|z]? 
3. Nest The command Nest[f, x, n] means that the function f nested n times into itself should be 
applied on z. The result is £[£|. .. £[a]] . . .]. 
4. NestList By NestList[f, v, n] a list (v, f[x], £[£|x]], ...) willbeshown, where finally f is nested 
n times. FoldList|/, x, list] iterates a two-variable function. 
5. FixedPoint For FixedPoint[f, x], the function is applied repeatedly until the result does not 
change. 
6. FixedPointList The functional operation FixedPointList[f, x] shows the continued list with 
the results after f is applied, until the value no longer changes. 








Out [1] 


W As an example for this type of functional operation the NestList operation will be used for the 
approximation of a root of an equation f(x) — 0 with Newton's method (see 19.1.1.2, p. 950). A root 
of the equation x cos x = sin z is needed in the neighborhood of 37/2: 


In[1]:— f|[r]:— v — Tanje] In[2]:— f'lUU — Out[2] =1 — Sec[z]? 

In[3] — g[r]:—- x — f[x]/f'[x] 

In[A4] :— NestList[g, 4.6, 4] —> Outl[4] = {4.6, 4.54573, 4.50615, 4.49417, 4.49341} 
In[5]:— FixedPoint[g, 4.6 —> Out[5] = 4.49341 


A higher precision of the result can also be achieved. 
7. Apply Let f be a function which is considered in connection with a list {a, b, c, ...}. Then 
Apply|/ fa, 6,0, ...)]] fla, b, ¢,...] (20.22) 
E In[1]:— hpply[Plus, [u, v, w)]] —> Out[1]=u+ v+ w 
In[2]:— Mpply[List, a 4- b-- c] —> Out[2] = {a, b, c} 
Here, the general scheme of how Mathematica handles expressions of expressions can be easily recog- 
nized. The FullForm of the last operation is: 
In[3] := FullForm|Apply|List, Plus|a, b, c]]] —+ Out[3] =Listla, b, c] 
The functional operation Apply obviously replaces the Head of the considered expression Plus by the 
required List. 
8. Map With a defined function f the operation Map gives: 
Maplf, {a, b, c.. J] — {flal, £0) fld,-.} (20.23) 
Map generates a list whose elements are the values when f is applied to the original list. 
Bl Let f be the function f(x) = x”. It is defined by 


In[1]:— f|r]:— x? With this f one gets 
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In[2] :=Map|f, {u, v, w}] —> Out l2] = {u?, v?, w*} 
Map can be applied for more general expressions: 
In[3] := Map|f, Plus{a, b, c]] —> Out[3] = a +b +e 


20.2.9 Programming 


Mathematica can handle the loop constructions known from other languages for procedural program- 
ming. The two basic commands are 


Dolexpr, {7, 11, 12, di}| and (20.24a) 
While[test, expr] (20.24b) 


The first command evaluates the expression expr, where i runs over the values from 71 to 72 in steps di. 
If di is omitted, the step size is one. If i1 is also missing, then it starts from 1. 
The second command evaluates the expression so far as test has the value True. 
Wl In order to determine an approximate value of e?, the series expansion of the exponential function 
is used: 
In[1] := sum = 1.0; 
Do[sum — sum 4- (2^i/i!), (4, 1, 10}]; 
sum 


Out [1] = 7.38899 


The Do loop evaluates its argument a previously given number of times, while the While loop evaluates 
as far as a previously given condition becomes false. 


(20.25) 


Among other things, Mathematica provides the possibility of defining and using local variables. This 
can be done by the command 
Module[{tl, ¢2,...}, procedure] (20.26) 
The variables or constants enclosed in the list are locally usable in the module; their values assigned 
here are not valid outside of this module. 
W A: A procedure is to be defined which calculates the sum of the square roots of the integers from 1 
ton. 
In[1] := sumg|n_] := 
Module[{sum = 1.}, 
Do[sum = sum + N [Sart [i|], (1, 2, n3]; 
sum |; 


The call sumq[30] results in 112.083. 


The real power of the programming capabilities of Mathematica is, first of all, the use of functional 
methods in programming, which are made possible by the operations Nest, NestWhile, Apply, Map, 
MapThread, Distribute and by some further ones. 


(20.27) 


W B: Example A can be written in a functional manner for the case when an accuracy of ten digits is 
required: 
sumg|n_] := N[Apply[Plus, Table[Sqrt [i], {7, 1, n}]], 10], 


sumq[30] results in 112.0828452. Total|,/N{Range[n], 10]] gives the same result without using an index, 
increasing its value continuously, without needing the variable sum and its initial value. 
For the details, see [20.16]. 
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20.2.10 Supplement about Syntax, Information, Messages 
20.2.10.1 Contexts, Attributes 


Mathematica must handle several symbols; among them there are those which are used in further pro- 
gram modules loaded on request. To avoid many-valuedness, the names of symbols in Mathematica 
consist of two parts, the context and the short name. 


Short names mean here the names (see 20.2, p. 1024) of heads and elements of the expressions. In 
addition, in order to name a symbol Mathematica needs the determination of the program part to which 
the symbol belongs. This is given by the contezt, which holds the name of the corresponding program 
part. The complete name of a symbol consists of the context and the short name, which are connected 
by the’ sign. 

When Mathematica starts, then there are always two contexts present: System’ and Global’. Informa- 
tion about other available program modules can be obtained by the command Contexts|]. 


All built-in functions of Mathematica belong to the context System’, while the functions defined by the 
user belong to the context Global’. 

If a context is actual, thus, the corresponding program part is loaded, then the symbols can be referred 
to by their short names. 

For the input of a further Mathematica program module by << NamePackage, the corresponding con- 
text is opened and introduced into the previous list. It can happen that a symbol has already been 
introduced with a certain name before this module is loaded, and in this newly opened context the 
same name occurs with another definition. In this case Mathematica gives a warning to the user. Then 
the previously defined name can be erased by the command Remove[Global‘name], or the complete 
name for the newly loaded symbol can be applied. 

Besides the properties that the symbols have per definition, it is possible to assign to them some other 
general properties, called attributes, like Orderless, i.e., unordered or commutative, Protected, i.e., 
values cannot be changed, or Locked, i.e, attributes cannot be changed, etc. Informations about the 
already existing attributes of the considered object can be obtained by Attributes[/]. 


Some symbols can be protected by Protect[somesymbol]; then no other definition can be introduced 
for this symbol. This attribute can be erased with the command Unprotect. 


20.2.10.2 Information 


Information can be obtained about the fundamental properties of objects by the commands 

?symbol ^ information about the object given by the name symbol, 

??symbol | detailed information about the object, 

? B» information about all Mathematica objects, whose names begin with B. 

It is also possible to get information about special operators, e.g., by ? := about the SetDelay operator. 
However, the most useful possibility is to put the cursor anywhere in the cell containing the symbol of 
the object in question and press the key F1. 


20.2.10.3 Messages 


Mathematica has a message system which can be activated and used for different reasons. The messages 
are generated and shown during the calculations. Their presentation has a uniform form: symbol :: 
tag, providing the possibility to refer to them later. (Such messages can also be created by the user.) 
Consider the following examples as illustrations. 


E A: In[1] := f|x] := 1/x; In[2] := f[0] 
Power: :infy:Infinite expression Ü encountered. 
Out [2] = ComplexInfinity 

ΒΒ: In[1] :— Log|3, 16, 25] 


Log: :argt:Log called with 3 arguments; 1 or 2 arguments are expected. 
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Out [1] —Log(3, 16, 25] 

In example A, Mathematica warns us that during the evaluation of an expression it got the value oo. The 
calculation itself can be performed. In example B the call of logarithm contains three arguments, which 
is not allowed according to the definition. Calculations cannot be performed. Mathematica cannot do 
anything with the expression. The user can switch off a message with Off[s :: tag]. With On the 
message will appear again. Quiet switches off all the messages. 

With Messages|[syrnbol] all messages associated to the symbol with the name symbol can be recalled. 


20.3 Important Applications with Mathematica 

This section describes how to handle mathematical problems with computer algebra systems. The 
choice of the considered problems is organized according to their frequency in practice and also accord- 
ing to the possibilities of solving them with a computer algebra system. Examples will be given for 
functions, commands, operations and supplementary syntax. When it is important, the corresponding 
special package is also discussed briefly. 


20.3.1 Manipulation of Algebraic Expressions 


In practice, further operations must usually be performed with the occurring algebraic expressions (see 
1.1.5, p. 10) such as differentiation, integration, series representation, limiting or numerical evaluation, 
transformations, etc. In general, these expressions are considered over the ring of integers (see 5.3.7, 
p. 361) or over the field (see 5.3.7.1, 2., p. 361) of real numbers. Computer algebra systems can handle, 
e.g., polynomials also over finite fields or over extension fields (see 5.3.7.1, 3., p. 362) of the rational 
numbers. Interested people should study the special literature. The algebraic operations with polyno- 
mials over the field of rational numbers have special importance. Mathematica provides the functions 
and operations represented in Table 20.9 for transformation of algebraic expressions. See also the 
menu item Palettes|Other|Algebraic Manipulation. 


Table 20.9 Commands for manipulation of algebraic expressions 





Expand|[p] expands the powers and products in a polynomial p by multiplication 
Expandlp, r] multiplies only the parts in p, which contain r 

PowerExpand|a] expands also the powers of products and powers of powers 
Εαοζοτ[ρ] factorizes a polynomial completely 

Collect |p, x] orders the polynomial with respect the powers of x 
Collect|p,{z,y,...}] the same as the previous one, with several variables 
ExpandNumerator[r| expands only the numerator of a rational expression 
ExpandDenominator[r| expands only the denominator 

Εχρϑπάλ11 [τ] expands both numerator and denominator completely 
Together|r] combines the terms in the expression over a common denominator 
Apart|[r] represents the expression in partial fractions 

Cancel|r] cancels the common factors in the fraction 








20.3.1.1 Multiplication of Expressions 
The operation of multiplication of algebraic expressions can always be performed. The coefficients can 
also be undefined expressions. 
E 77/1] : = Expand((r--y —z)!] gives 

Dut [1] = 1t + 4r? y +62? yY? +4ry? +yt-— Ax z — 1222yz — 129? z — Ay? z 

T622;2-- 12m y z2 6? z2 Ay z2 Ay + zt 

Similarly, 

In[3]: — Expand((a x + by?) (c x? — d y?)] 

Out [3] =aca*—adaxy? +bea y? — bdy* 
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20.3.1.2 Factorization of Polynomials 
Mathematica performs factorization over the integer or rational numbers if it is possible. Otherwise the 
original expression is returned. 
Wl In[1]:— p — x9 4- 7a? -- 12x* + 6x? — 25a? — 30x — 25; 

In[2] := Factor|p], gives 

Out [2] = (5 + £) (L+ x +2?) (—5 + x? + αὖ) 
Mathematica decomposes the polynomial into three factors which are irreducible over the rational num- 
bers. 





If a polynomial can be completely decomposed over the complex rational numbers, then this can be 
obtained by the option GaussianIntegers. 


Wb 1n[1]:— Factor[i? 2: 4-5] — Out[1] — b— 2x 3? , but 
In[2]:— FactorGaussianIntegers—- True] 
Out [2] = ((—1 — 21) + z)((—1 + 21) + x) 


20.3.1.3 Operations with Polynomials 


Table 20.10 contains a collection of operations by which polynomials can be algebraically manipulated 
over the field of rational numbers. 


Table 20.10 Algebraic polynomial operations 





PolynomialGCD[pl, p2] determines the greatest common divisor of pl and p2 
PolynomialLCM[pl, p2) determines the least common multiple of pl and p2 
PolynomialQuotient|pl,p2,2] ^ divides p1 (as a function of x) by p2, the residue 

is omitted 
PolynomialRemainder[pl,p2,r| determines the residue on dividing p1 by p2 
MonomialList[p] gives the list of all monomials in the polynomial p 








W Two polynomials are defined: 
In[1]:— p — zê + 725 + 1224 + 6x3 — 25a? — 30x — 25; 
q = x^ -- z? — 62? — 7m — 7; 
With these polynomials the following operations are performed: 
In[2]:— PolynomialGCD[p,g] — Out[2] = 1 + x +2? 
In[3] := PolynomialLCM[p, q]//Factor 
Out [3] = (5 + £)(—7 + 2?) (1 + x - 32)(—5 + x? + x°) 
In[4]:— PolynomialQuotient[p,q,r| —5 Out [4] = 12 + 6x + x? 
In[5]:— PolynomialRemainder[p,q,z| —9 Out[5] — 59 4- 96x 4- 962? -- 372? 
With the two last results one gets 
xÊ +725 + 12x4 + 6x? — 25x? — 30x — 25 F ia 37x + 96a? + 96a +59 
zt + 23 — 622 — 7x —7 πα μη x* a3 — 63? — Tr — T' 
20.3.1.4 Partial Fraction Decomposition 


Mathematica can decompose a fraction of two polynomials into partial fractions, of course, over the 
field of rational numbers. The degree of the numerator of any part is always less than the degree of the 
denominator. 














W Using the polynomials p and q from the previous example one gets 


6 —55 --11z 4-62? 
InfiJ:=A t » Dut[1] — t 
LLI = Apartla/pl u=- ΕΠ 
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20.3.1.5 Manipulation of Non-Polynomial Expressions 

Complicated expressions, not necessarily polynomials, can often be simplified by the help of the com- 
mand Simplify. Mathematica will always try to manipulate algebraic expressions, independently of 
the nature of the symbolic quantities. Here, certain built-in knowledge is used. Mathematica knows the 
rules of powers (see 1.1.4.1, p. 7): 

In[1]:— Simplify[a"/a")] —5 Out[1] = a "*^ (20.28) 
With the option Trig — True, the commands Expand and Factor can express powers of trigono- 
metric functions by trigonometric functions with multiple arguments, and conversely. Alternatively, 
TrigExpand, TrigFactor, TrigFactorlist, TrigReduce, ExpToTrig, TrigToExp can be applied. 


E  In[1]:— TrigExpand[Sin [2x]Cos [2y1] 


Out [1] = 2Cos[x]Cos[y] ?8in[x]-2Cos [x] Sin [x] Sin [y] 2 

In[2] := Factor[Sin [4x], Trig—> True] — 8Cos[x]*Sin[x] + 4Cos[x] Sin[x]) 

Out [2] = 0 

In[3] := Factor[Cos [5x], Trig—> True] 

Out [3] = Cos[x] (1 — 2Cos[2x] + 2Cos[4x]). 
Remark: The command ComplexExpand[expr] assumes areal variable expr, while in the command 
ComplexExpand|[ezpr, (x1, x2, . ..)] the variables x are supposed to be complex. 


In[1] :— ComplexExpand[Sin[2 χ],{χ}} 


n Üut[1] — Cosh[2Im[x]] Sin[2Re[x]]-*I Cos[2Re[x]] Sinh[2 Im[x]] 


20.3.2 Solution of Equations and Systems of Equations 


Computer algebra systems know procedures to solve equations and systems of equations. If the equa- 
tion can be solved explicitly in the domain of algebraic numbers, then the solution will be represented 
with the help of radicals. If it is not possible to give the solution in closed form, then at least numerical 
solutions can be found with a given accuracy. In the following some basic commands will be introduced. 
The solution of systems of linear equations (see 4.5.2, p. 308) is discussed here in a special section (see 


20.3.2.4, p. 1040). 
20.3.2.1 Equations as Logical Expressions 


Mathematica allows the manipulation and solution of equations within a wide range. In Mathematica, 
an equation is considered as a logical expression. If one writes 


In[1] := g = z? + 22 — 9 == 0, (20.29a) 
then Mathematica considers it as a definition of a function with Boolean values. Giving the input 
In[2] := %/. x—> 2, yields Out [2] = False, (20.29b) 


since with this value of x the left-hand side and right-hand side are not equal. 
The command Roots{g, x] transforms the above identity into a form which contains x explicitly. Math- 
ematica represents the result with the help of the logical OR in the form of a logical statement: 

1η [3] --Βοοτο[ο,α] —9 Out[3] = x == —1 — v10||x —— —1 4- V10 (20.29c) 
In this sense, logical operations can be performed with equations. 
With the operation ToRules, the last logical type equations can be transformed as follows: 


In[4] : — (ToRules[X]) 
Out [4] = ((x — —1— v10), (x — —1 + v10} (20.29d) 
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20.3.2.2 Solution of Polynomial Equations 
Mathematica provides the command Solve to solve equations. In a certain sense, Solve perform the 
operations Roots and ToRules after each other. 
Mathematica solves polynomial equations in symbolic form up to fourth degree, since for these equations 
solutions can be given in the form of algebraic expressions. However, if equations of higher degree can be 
transformed into a simpler form by algebraic transformations, such as factorization, then Mathematica 
provides symbolic solutions. In these cases, Solve tries to apply the built-in operations Expand and 
Decompose. 
In Mathematica numerical solutions are also available. 
W The general solution of an equation of third degree: 

In[1] := Solve|x? +a x? +b x+ c== 0, <] 
Mathematica gives 


Out [1] ={{x—> = 
21/3 (—a? +3b) 
; 1/3 
3 (ος +9ab-— 27c + 33/2 γ- (a2? b2) +40 +4a3c— 18abc+ 272) 
i/3 
( 2a) -9ab — 27 c 4- 332 γ (a? 02) 4- 405 -- 4a3c — 18abc 27 2) 
| 3213 } 
dj 
The solution list here shows only the first term explicitly because of the length of their terms. If an 


equation with given coefficients a, b, c has to be solved, then it is better to handle the equation itself 
with Solve than to substitute a, b, c into the solution formula. 


ΒΛ: For the cubic equation (see 1.6.2.3, p. 40) z? -- 6x 4- 2 — 0 one gets: 
In[1] := Solveļ|z? +6 x +2 == 0,1] 


; : 1-1γ3 1-1IV3 1-I/3 1-13 
Out [1] — ([z— 23 — 27/9}, {a> 21/8 23/8 his xy t m η 


W B: Solution of an equation of sixth degree: 

















᾽ 





In[2] : = Solve|x® — 62° + 6x4 — da? + 652? — 38a — 120 == 0, 2] 

Out [2] = {{x—> —-1}, {a> -1- 21}, {a> -1+4 21}, {x= 2}, {a> 3}, {x 4}} 
Mathematica succeeded in factorizing the equation in B with internal tools; then it is solved without 
difficulty. 


If numerical solutions are required, then the command NSolve can be used. 





W Tho following equation is solved by NSolve: 
In[3] : = NSolve[x® — 4a? -- 6z* — 5a? -- 33? — Ay -- 2 == 0, z] 
Out [3] = {{x—> —0.379567 — 0.76948 I), (z—9 —0.379567 -- 0.76948 1], 
(x— 0.641445), (x— 1. — 1. I, (z— 1. - 1. 1), (x— 2.117693) 


20.3.2.3 Solution of Transcendental Equations | 
Mathematica can solve transcendental equations, as well. In general, this is not possible symbolically, 
and these equations often have infinitely many solutions. In these cases, an estimate of the domain 
should be given, where Mathematica has to find the solutions. This is possible with the command 
FindRoot|g, {x,s}], where x, is the initial value for the search of the root. 
WB = In([1] := FindRoot|x + ArcCoth[z] — 4 == 0, {x,1.1}] 
Out [1] = {x—> 1.00502} and 
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In[2] : = FindRoot|x + ArcCoth[z] — 4 —— 0, (2,5)] — Out [2] — (x— 3.72478} 


20.3.2.4 Solution of Systems of Equations 


Mathematica can solve simultaneous equations. The operations, built-in for this purpose, are repre- 
sented in Table 20.11, and they present the symbolical solutions, not the numerical ones. 

Similarly to the case of one unknown, the command NSolve gives the numerical solution(s). The solu- 
tion of systems of linear equations is discussed in 20.3.3, p. 1040. 


Table 20.11 Operations to solve systems of equations 





Solve[{l,; == 11, lz, ==1re,...},vars] solves the given system of equations with respect to 
vars 

Eliminate[{l; ==11,...}, vars] eliminates vars from the system of equations 

Reduce[{l, == r1,...}, vars] simplifies the system of equations and gives the possi- 
ble solutions 

FindInstance[erpr, vars] finds an instance of vars that make expr true 








20.8.3 Linear Systems of Equations and Eigenvalue Problems 


In 20.2.4, p. 1027, the notion of matrix and several operations with matrices were defined on the basis 
of lists. Mathematica applies these notions in the theory of systems of linear equations. In the following 
command m,n denote given integers, not variables. 


P = Array|p, {m,n}] (20.30) 
defines a matrix of type (m, n) with elements p;; — p[[i, j]]. Furthermore 
X = Array([x,{n}] und B = Array|b, {m}] (20.31) 


are n- or m-dimensional vectors. With these definitions the general system of linear homogeneous or 
inhomogeneous equations can be written in the form (see 4.5.2, p. 308) 


P.X == Py X= or Thread[P.X —— B] Thread[P.X —— 0] (20.32) 
1. Special Case n — m, det P Z0 


In the special case n = m, detP 7 0, the system of inhomogeneous equations has a unique solution, 
which can be determined directly by 

X — Inverse|P]. B (20.33) 
Mathematica can handle such systems of up to ca. 5000 unknowns in a reasonable time, depending on 
the computer system. An equivalent, but much faster solution is obtained by LinearSolve|P, B]. 
2. General Case 
With the commands LinearSolve and NullSpace, all the possible cases can be handled as discussed 
in 4.5.2, p. 308, i.e., it can be determined first if any solution exists, and if it does, then it is calculated. 
Now, some of the examples from 4.5.2, p. 308ff. will be discussed. 








W A: The example in 4.5.2.1, 2., p. 310, is a system of homogeneous equations 
πι -- απο 5r4 — x4—0 
πι Ἑ X3 — 2133 - 3234 — 0 
32, — t2+ 8%34+ x40 
zi 329 — 9x3 + Tr, = 0 








which has non-trivial solutions. These solutions are the linear combinations of the basis vectors of the 
null space of matrix p. It is the subspace of the n-dimensional vector space which is mapped into the 
zero by the transformation p. A basis for this space can be generated by the command Nul1Spacelp]. 
With the input 


Inf1]:— p - (5 7,5, 15, (5,1, 2,3), (3, 1,8,1), (1,3, 9, 7)] 
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a matrix, whose determinant is actually zero is defined (check it by calculating Det[p]). Now 
In[2]:— NullSpace[p| and Owt[2] — {{—1, —2,0, 1}, {—3,7, 2,0}} 

is displayed, a list of two linearly independent vectors of four-dimensional space, which form a basis for 

the two-dimensional null-space of matrix p. An arbitrary linear combination of these vectors is also in 

the null-space, so it is a solution of the system of homogeneous equations. This solution coincides with 

the solution found in 4.5.2.1, 2., p. 310. 

W B: Consider the example A in 4.5.2.1, 2., p. 309, 





1} -- 220 + 323 — r4 0 2:5 —2 
31} — x3 4 5rg — 334 — 2:5 —6 
211 1- x3 -- 2:73 — 2134 — 9235 — 8 


with matrix m1 of type (3,5), and vector b1 
In[1]:— ml — ((1, —2,3, -1, 2}, {3, -1, 5, -3, —1}, (2,1,2, -2, -3) 5 
In[2] := bl = {2,6,8}; 





For the command 
In[3] := LinearSolve[ml1, b1] the response is 
LinearSolve : : nosol: Linear equation encountered which has no solution. 
The input appears as output. 
W C: According to example B from 4.5.2.1, 1., p. 309, 
πι - πο 2. 1 
rj — 223 — t= 2 
32, — %2+5%3= 3 
—2z1 + 2%_ + 323 = —4 





the input is 

In[1] := m2 = {{1, -1, 2}, {1, -2, -1}, {3, -1, 5}, {-2, 2, 3}}; 

In[2] := b2 = {1,2,3, —4}; 
To learn how many equations have independent left-hand sides, one calls 

In[3] := RowReduce|[m2];—> Out [3] = {{1,0,0}, {0, 1,0}, {0, 0, 1}, {0,0, O}} 
Then the input is 








10 1 2 
In[4] := LinearSolve[m2,02]; — Out [4] = συ τν =z} 
The answer is the known solution. 
3. Eigenvalues and Eigenvectors 
Eigenvalues and eigenvectors of matrices are defined in 4.6, p. 314. Mathematica provides the possibility 
of determining eigenvalues and eigenvectors by special commands. So, the command Eigenvalues|m] 
produces a list of eigenvalues of a square matrix m, Eigenvectors|[m] creates a list of the eigenvectors of 
m, whereas Eigensystem|[m] gives both. If N[m] is substituted instead of m, then one gets the numerical 
eigenvalues. In general, if the order of the matrix is greater than four (n > 4), then no algebraic 
expression can be obtained, since the characteristic polynomial has degree higher than four. In this 
case, one should ask for numerical values. 
E In[1] :=h=Table[1/(i+ 9 — 1), {i, 5}, {7, 5} 
This generates a five-dimensional so-called Hilbert matrix. 


1111..11111 LOE d. T. d 4.34 [1X4 


Out [1] = [bar ip gl DIDIA EIR Lt q An g- Li AER TR biter ars R att 
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With the command 
In[2] := Eigenvalues|h| 


the answer (which may be not useful) is 


{Root [-1 + 307505 #1 - 1022881200 #17 + ...} 
But with the command 


In[3] := Eigenvalues [N[h]] one gets 


Out [3] = {1.56705, 0.208534, 0.0114075, 0.000305898, 3.28793 x 107°} 


20.3.4 Differential and Integral Calculus 

The notation of the derivative as a functional operator was introduced in 20.2.8, p. 1032. Mathematica 
provides several possibilities to apply the operations of analysis, e.g., determination of the derivative 
of arbitrarily high order, of partial derivatives, of the complete differential, determination of indefinite 
and definite integrals, series expansion of functions, and also solutions of differential equations. 


20.3.4.1 Calculation of Derivatives 


1. Differentiation Operator 

The differentiation operator (see 20.2.8, p. 1032) is Derivative. Its complete form is 
Derivative[n;, ns,...] (20.34) 

'The arguments say how many times the function is to be differentiated with respect to the current 

variables. In this sense, it is an operator of partial differentiation. Mathematica tries to represent the 

result as a pure function. 

2. Differentiation of Functions 


The differentiation of a given function can be performed in a simplified manner with the operator D. 
With D[/ [x], x], the derivative of the function f at the argument x will be determined. 
D belongs to a group of differential operations, which are enumerated in Table 20.12. 


Table 20.12 Operations of differentiation 





D[f [x]. (x. n] yields the n-th derivative of function f(x) with respect to x 
Dif, {21, m1}, {v2, n2},---] multiple derivatives, n;-th derivative with respect tox; (7 = 1,2,---) 
Dt[f] the complete differential of the function f 
d 
Dt|[f, x] the complete differential a of the function f 





απ 
Dt[/, 21, 22, ...] the complete differential of a function of several variables 





WA: In[1] :— D[Sqrt[z? Exp[4v] Sin(a]], x] 
E** 3? Cos[x] + 3E** a*Sin|[x] + 4E** x? Sin[v] 








Out [1] = 
2 JE*? 3 Sin[x] 
MB: Inf1] :=D{(2x+1)*,a] —> Outl1] = (14+22)°* (s x + 3Log[1+2 x) 


The command Dt results in the complete derivative or complete differential. 
MC: Inf1] :=Dt[x? +y°] —> Out [1] = 327Dt[x] + 3y? Dtly] 


E D: Τη[1]:--Ρτ[ο) 1 γϑ,α] --» Out [1] = 3a? + 3y°Dtly, x] 
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In this last example, Mathematica supposes y to be a function of x, which is not known, so it writes the 
second part of the derivative in a symbolic way. The preferable forms of writing are: D[y[£?] 4- y[t]?, t] 
and D[z? 4- y[z]?, z] showing explicitly the independent variables. 

If Mathematica finds a symbolic function while calculating a derivative, it leaves it in this general form, 
and expresses its derivative by f’. 

WB E: In[1]:—D|r f[z, | —. Out[1] — f|x]? + 3xf[x} f [x] 

Mathematica knows the rules for differentiation of products and quotients, it knows the chain rule, and 
it can apply these rules formally: 


E F: In[1]:—D[f[ux], x] — Out[17 — £'[u[z]] u'[x] 


wir] α[α] ν']α] 


viz] ν[ο]2 


B G: In[1]:—D[u[r]/v(x], x] — ^ Out[1] = 





20.3.4.2 Indefinite Integrals 


With the command Integrate|, x], Mathematica tries to determine the indefinite integral 1 f(a) da. 


If Mathematica knows the integral, it gives it without the integration constant. Mathematica supposes 
that every expression not containing the integration variable does not depend on it. 

In general, Mathematica finds an indefinite integral, if there exists one which can be expressed in closed 
form by elementary functions, such as rational functions, exponential and logarithmic functions, trigono- 
metric and their inverse functions, etc. If Mathematica cannot find the integral, then it returns the orig- 
inal input. Mathematica knows some special functions which are defined by non-elementary integrals, 
such as the elliptic functions, and some others. 


To demonstrate the possibilities of Mathematica, some examples will be shown, which are discussed in 


8.1, p. 480ff. 


1. Integration of Rational Functions 
(see also 8.1.3.3, p. 485ff.) 


E A: In[1] :— Integrate[(2x + 3)/(x? + 3? — 22), x] 


Out [1] = Sica Ἔσ]-- Stoele] .. l ogp +z] 
3 2 6 
E B: 7n[1] :— Integrate|(a? 4- 1)/(x(x — 1)9), a] 
Οωτ [1] -- 1 : F 2Log[-1 + x] — Log[x] (20.35) 





(-l4+2)? -l+2 


On the monitor can be seen in the left corner of the next cell a plus sign. Clicking on it one may choose 
either the free-from input or the Wolfram-Alpha query. If one types the integral into one of these then 
there is given the possibility to have a look at all the details of the process of integration. 


2. Integration of Trigonometric Functions 
(see also 8.1.5, p. 491ff.) 
W A: The example A in 8.1.5.2, p. 492, with the integral / sin? x cos? x dx is calculated (substitution 
is done by the program automatically, if needed): 
In[1]: — Integrate[Sin[r|?Cos[x]?, x] 
5 Sin|[z| 1 k 
64 192 





Out [1] = Sin[3 2] z5 Sin[5 z] ΠΠ 
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W B: The example B in 8.1.5.2, p. 492, with the integral / ποστ dx is calculated: 


In[1]:— Integrate[Sin|r]/Sqrt|Cos[r],x] — Out[1] — —2y Cos[r] 
Remark: In the case of non-elementary integrals Mathematica may do nothing. 


E 77/1] = f wax — Out[1] SE 


20.3.4.3 Definite Integrals and Multiple Integrals 


1. Definite Integrals 
With the command Integrate|/, (x, 4, ve}, Mathematica can evaluate the definite integral of the 
function f(x) with a lower limit £a and upper limit ze. 

2 - vm 
W A: In[1]:— Integrate[Exp|-277], (z, 0, Infinity)]] —5 Out[1] = a 
(see Table 21.8, p. 1098, No. 25 for a = 1). 


W B: Ifthe input is 


1 
In[1] := Integrate[—,, {x, —1,1}] one gets 
x 


1 » 
Out [1] = Integrate::idiv: ” Integral of E does not converge on {-1,1}. ' 


In the calculation of definite integrals one should be careful. If the properties of the integrand are not 
known, then it is recommended to ask for a graphical representation of the function in the considered 
domain before integration. 

2. Multiple Integrals 

Definite double integrals can be called by the command 


Integrate|f[r, y], (x, τα, ze] (y, Va; ve] (20.36) 
The evaluation is performed from right to left, so, first the integration is evaluated with respect to y. 
The limits ya and ye can be functions of x, which are substituted into the primitive function. Then the 
integral is evaluated with respect to x. 
W For the integral A, which calculates the area between a parabola and a line intersecting it twice, in 
8.4.1.2, p. 524, one gets 3 

In[1] := Integrateļ|x y?, {x,0, 2}, {y, x°, 2£}] — Out[1] = δι 
Also in this case, it is important to be careful with the discontinuities of the integrand. The domain of in- 
tegration can also be specified with inequalities: Integrate [Boole [x?+y? < 1, {x,—-1,1},{y,-1,1}]] 
gives 7. 


20.3.4.4 Solution of Differential Equations 


Mathematica can handle ordinary differential equations symbolically if the solution can be given in 
closed form. In this case, Mathematica gives the solution in general. The commands discussed here are 
listed in Table 20.13. 

The solutions (see 9.1, p. 540) are represented as general solutions with the arbitrary constants Ci]. 
Initial values and boundary conditions can be introduced in the part of the list which contains the equa- 
tion or equations. In this case a special solution is returned. As examples, two differential equations 
are solved here from 9.1.1.2, p. 542. 

W A: The solution of the differential equation y'(r) — y(x) tan x = cos z is to be determined. 


In[1] := DSolve[y'[r] — y|v] Tan[(v] —— Cos[x], y, x] 
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Mathematica solves this equation, and gives the solution as a pure function with the integrations con- 
stant C1]. 


Table 20.13 Commands to solve differential equations 





DSolve|deq, y[x], x] solves the differential equation for y[x] (if it is possible); y[x] 
may be given in implicit form 
DSolve|deq, y, x] gives the solution of the differential equation in the form of a 


pure function 
DSolve|{deq, deqz,...},y,2] solves a system of ordinary differential equations 








Out [1] = {{ν — Function[x, C[1] Sec[x] * Sec[x] E T ων xi 
If it is required to get the solution value y[r], then Mathematica gives 
In[2]:— y|x]/. 6&1 —5 Owt[2] — {Ο[ἠ5οε[ο) {-86ε[α] 5 + isin? 3j! 


One also could make the substitution for other quantities, e.g., for y'[r] or y[1]. The advantage of using 
pure functions is obvious here. 
Bl B: The solution of the differential equation y'(z)z(x — y(a)) 4- y?(x) — 0 (see 9.1.1.2, 2., p. 542) is 
to be determined. 

In[1] := DSolvely' [x] x(a — y[x}) + y[x]2 == 0, y[2], x] 





E-O 
Out [1] = { {ule —> -x ProductLog|— 7 i} 


Here ProductLog[z] gives the principal solution for w in z — we". The solution of this differential 
equation was given in implicit form (see 9.1.1.2, 2., p. 542). 

If Mathematica cannot solve a differential equation it returns the input without any comment. In such 
cases, or also, if the symbolic solution is too complicated, the solutions can be found by numerical 
solutions (see 19.8.4.2, 5., p. 1018). Also in the case of symbolic solutions of differential equations, like 
in the evaluation of indefinite integrals, the efficiency of Mathematica should not be overestimated. If 
the result cannot be expressed as an algebraic expression of elementary functions, the only way is to 
find a numerical solution. 

Remark: Mathematica can solve some partial differential equations both symbolically and numerically, 


as well, even on complicated multidimensional domains. 


20.4 Graphics with Mathematica 

By providing routines for graphical representation of mathematical relations such as the graphs of func- 
tions, space curves, and surfaces in three-dimensional space, modern computer algebra systems provide 
extensive possibilities for combining and manipulating formulas, especially in analysis, vector calculus, 
and differential geometry, and they provide immeasurable help in engineering designing. Graphics is a 
special strength of Mathematica. 





20.4.1 Basic Elements of Graphics 


Mathematica builds graphical objects from built-in graphics primitives. These are objects such as points 
(Point), lines (Line) and polygons (Polygon) and properties of these objects such as thickness and 
color. 

Mathematica has several options to specify the environment for graphics and how the graphical objects 
should be represented. 
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With the command Graphics|list], where list is a list of graphics primitives, Mathematica is called 
to generate a graphic from the listed objects. The object list can follow a list of options about the 
appearance of the representation. 
With the input 
In[1] := g = Graphics|{Line[{{0, 0}, (5, 5), {10, 3}}], Circle[{5, 5}, 4], (20.37a) 
Text[Style|[“Example” , “Helvetica” ,Bold,25], {5, 6}]}, AspectRatio— Automatic] (20.37b) 
a graphic is built from the following elements: 
a) Broken line of two line segments starting at the point (0,0) through 
the point (5,5) to the point (10, 3). 
example b) Circle with the center at (5, 5) and radius 4. 
c) Text with the content "Example", written in Helvetica font, boldface 
(the text appears centered with respect to the reference point (5, 6)). 
With the call Show|g], Mathematica displays the figure (Fig. 20.1). 
Certain options might be previously specified. Here the option 
Figure 20.1 AspectRatio is set to Automatic. 


By default Mathematica makes the ratio of the height to the width of the graph 1 : GoldenRatio (sce 
e.g. 3.5.2.3,3., p. 194). It corresponds to a relation between the extension in the x direction to the one 
in1:1/1.618 — 1 : 0.618. With this option the circle would be deformed into an ellipse. The value of 
the option Automatic ensures that the representation is not deformed. 


20.4.2 Graphics Primitives 


Mathematica provides the two-dimensional graphic objects enumerated in Table 20.14. 

Besides these objects Mathematica provides further primitives to control the appearance of the rep- 
resentation, the graphics commands. They specify how graphic objects should be represented. The 
commands are listed in Table 20.15. 

There is a wide scale of colors to choose from but their definitions are not discussed here. 


Table 20.14 Two-dimensional graphic objects 





Point|{x, y}] point at position x, y 

Line[{{21, y1}, {2, yo}, ..-}] broken line through the given points 

Rectangle|{21u, Yiu}, {2ro; Yro}] shaded rectangle with the given coordinates left-down, right-up 
Polygon|{{x1, y1},{x2, yo},...}] ^ shaded polygon with the given vertices 

Circle|(z, y), r] circle with radius r around the center c, y 

Circlel|{x, y},r, {a1, a2}] circular arc with the given angles as limits 

Circle|{x, y}, {a, b}| ellipse with half-axes a and b 

Circle|{x, y}, {a, b}, {a1, a2}] elliptic are 

Disk[{x, y},7r], Disk[{x, y},{a,b}] shaded circle or ellipse 

Text|text, {x, y}] writes text centered to the point x, y 








Table 20.15 Graphics commands 





PointSize|a] a dot is drawn with radius a as a fraction of the total picture 

AbsolutePointSize[b) denotes the absolute radius b of the dot (measured in American 
pt (0.3515 mm)) 

Thickness|a] draws lines with relative thickness a 

AbsoluteThickness[b) draws lines with absolute thickness b (also in pt) 

Dashing|{a1, a2, a3,...}] draws a line as a sequence of stripes with the given length (in 


relative measure) 
AbsoluteDashing[(bi,b9,...)]] the same as the previous one but in absolute measure 
GrayLevel[p| specifies the level of shade (p — 0 is for black, p — 1is for white) 








20.4 Graphics with Mathematica 1047 





20.4.3 Graphical Options 


Mathematica provides several graphical options which have an influence on the appearance of the entire 
picture. Table 20.16 gives a selection of the most important commands. For a detailed explanation, 


see [20.16]. 


Table 20.16 Some graphical options 





AspectRatio — tw 


Axes —> True 

Axes —> False 

Axes —> {True, False} 

Frame —> True 

GridLines — Automatic 
AxesLabel — {Z symbol» Ysymbol } 
Ticks —> Automatic 


Ticks > {{21,29,...},{y1, y2,---}} 


sets the ratio w of height and width. Automatic determines 
w from the absolute coordinates; the default setting is 

w =1:GoldenRatio 

draws coordinate axes 

does not draw coordinate axes 

shows only the x-axis 

shows frames 

shows grid lines 

denotes axes with the given symbols 

denotes scaling marks automatically; with None they can be 
suppressed 

scaling marks are placed at the given nodes 








20.4.4 Syntax of Graphical Representation 
20.4.4.1 Building Graphic Objects 


If a graphic object is to be built from primitives, then first a list of the corresponding objects with their 
global definition should be given in the form 


(object, objecta, . . .), 


(20.382) 


where the objects themselves can be lists of graphic objects. Let object1 be, e.g., 
In[1] := 01 = {Circle[{5, 5}, {5,3}], Line[{{0, 5}, {10,5} }]} 


and corresponding to it 


In[2] := 02 = {Circle[{5, 5}, 3]} 


as in Fig.20.1. If a graphic object, e.g., 02, is to be provided with certain graphical commands, then 
it should be written into one list with the corresponding command 

In[3] := 03 = {Thickness[0.01], 02} 
This command is valid for all objects in the corresponding braces, and also for nested ones, but not for 
the objects outside of the braces of the list. 
From the generated objects two different graphic lists are defined: 


In[4] := g1 = Graphics|{ol, 02}] 


; g2 = Graphics|{ol, 03}] 


which differs only in the second object by the thickness of the circle. The call 

Show[gl| and Show[g2, Axes — True] (20.38b) 
givs the pictures represented in Fig. 20.2. 
In the call of the picture in Fig. 20.2b, the option Axes —> True was activated. This results in the 
representation of the axes with marks on them chosen by Mathematica and with the corresponding 


scaling. 


20.4.4.2 Graphical Representation of Functions 


Mathematica has special commands for the graphical representation of functions. With 


Plot[f|[r], (2, tin; mac] 


(20.39) 


the function f is represented graphically in the domain between £ = £min and £ = max. Mathematica 
produces a function table by internal algorithms and reproduces the graphics following from this table 


by graphics primitives. 
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a) b 2 4 6 8 10 


Figure 20.3 


Figure 20.2 
H ifthe function z — sin2z is to be graphically represented in the domain between —27 and 27, then 
the input is 
In[1] := Plot|Sin[2x], (x, —2Pi,2P1]]. 
Mathematica produces the curve shown in Fig. 20.3. 


It is obvious that Mathematica uses certain default graphical options in the representation as men- 
tioned in 20.4.1, p. 1045. So, the axes are automatically drawn, they are scaled and denoted by the 
corresponding x and y values. In this example, the influence of the default AspectRatio can be seen. 
The ratio of the total width to the total height is 1 : 0.618. 


With the command InputForm|%] the whole representation of the graphic objects can be shown. For 
the previous example one gets: 
Graphics[{{{}, {}, {Directive [Opacity[1.], RGBColor [0.368417, 0.506779, 0.709798] , 


AbsoluteThickness[1.6]], Line[[[-6.283185050723043, 2.5645654335783057* '), 
..., (6.283185050723043, -2.5645654335783057* ") 1], (DisplayFunction -> Identity, 


AspectRatio -» GoldenRatio\)), Axes -> {True, True}, AxesLabel -> {None, None}, 
AxesOrigin -> (0, 0}, DisplayFunction :> Identity, 

Frame -> {{False, False}, {False, False}}, FrameLabel -> {{None, None}, 

{None, None}}, FrameTicks -> {{Automatic, Automatic}, {Automatic, Automatic}}, 
GridLines -> {None, None}, GridLinesStyle -» Directive[GrayLevel[0.5, 0.4]], 
Method -» ("DefaultBoundaryStyle" -» Automatic, "ScalingFunctions" -» None], 
PlotRange -> [[-2*Pi, 2«Pi], (-0.9999996654606427 , 0.9999993654113022]] , 
PlotRangeClipping -» True, PlotRangePadding -» ([Scaled[0.02], Scaled[0.02]], 
{Scaled[0.05], Scaled[0.05]}}, Ticks -> {Automatic, Automatic}}] 


Consequently, the graphic object consists of a few sublists. The first one contains the graphics primitive 
Line (slightly modified), with which the internal algorithm connects the calculated points of the curve 
by lines. The second sublist contains the options needed by the given graphic. These are the default 
options. If the picture is to be altered at certain positions, then the new settings in the Plot command 
must be set after the main input. With 

In[2] := Plot{Sin[2z], {x, —2Pi, 2Pi}, AspectRatio—> 1] (20.40) 
the representation would be done with equal length of axes x and y. 
It is possible to give several options at the same time after each other.With the input 


PlotH filz], falz], - --}, {2£, min, Umac}] (20.41) 
several functions are shown in the same graphic. With the command 
Show[plot, options] (20.42) 
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an earlier picture can be renewed with other options.With 

Show[GraphicsArray[list]], (20.43) 
(with list as lists of graphic objects) pictures can be placed next to each other, under each other, or 
they can be arranged in matrix form. 


20.4.5 Two-Dimensional Curves 


A series of curves from the chapter on functions and their representations (see 2.1, p. 48ff.) is shown as 
examples. 
20.4.5.1 Exponential Functions 


A family of curves with several exponential functions (see 2.6.1, p. 72) is generated by Mathematica 
(Fig. 20.4a) with the following input: 


In[1] := £[x_] := 2x; g[x_] := 1042; 
In[2]:— h|zx ] :— (1/2)^z; j[x. ] :— (1/E)^v;k[r ] :— (1/10)^z; 


These are the definitions of the considered functions. There is no need to define the function e”, since 
it is built into Mathematica. In the second step the following graphics are generated: 


In[3] := pl = Plot|{f[x], h|z]}, {x,—4, 4}, PlotStyle—> Dashing|{0.01, 0.02}]] 

In[4] := p2 = Plot|(Exp[z], 3[x]), (v, —4, 4])] 

In[5] :— p3 — Plot|(g[z], k[x] ), (x, —4, 4), PlotStyle—- Dashing|(0.005, 0.02, 0.01, 0.02] ]] 
The whole picture (Fig. 20.4a) can be obtained by: 

In[6] := Show|{p1, p2, p3}, PlotRange—> {0,18}, AspectRatio—> 1.2] 


The question of how to write text on the curves is not discussed here. This is possible with the graphics 
primitive Text. 





a) b) 


Figure 20.4 
20.4.5.2 Function y = æ + Arcoth x 


Considering the properties of the function Arcoth: discussed in 2.10, p. 93, the function y = x + 
Arcoth can be graphically represented in the following way: 


In[1] := f1 = Plot|x + ArcCoth|[x], (x, 1.000000000005, 7}] 
In[2] := f2 = Plot|x + ArcCoth|[z], {a, —7, —1.000000000005}] 
In[3] := Show|{f1, f2},PlotRange—> {—10, 10}, AspectRatio—> 1.2, Ticks— 
{{{—6, 6}, {-1, —1}, {1, 1}, {6, 6} }, {{2.5, 2.5}, {10, 10}}}, AxesOrigin— > 0,0] 


The high precision of the x values in the close neighborhood of 1 and —1 was chosen to get sufficiently 
large function values for the required domain of y. The result is shown in Fig. 20.4b. 
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20.4.5.3 Bessel Functions (sce 9.1.2.6, 2., p. 562) 
With the calls 
In[1] := bj0 = Plot|{BesselJ[0, z], Besse1J|2, z], Besse1J|4, z])]. (2,0, 10), PlotLabel—> 
TraditionalForm[{BesselJ[0, z], BesselJ[2, z], BesselJ[4, 2]}} (20.44a) 
In[2] := bj1 = Plot|{BesselJ[1, z], Besse1J(3, z], Bessel J[5, z]}, {z,0, 10}, PlotLabel—> 
TraditionalForm[{BesselJ[1, z], BesselJ[3, z], BesselJ[5, z]}]]  (20.44b) 
the graphics of the Bessel function J,,(z) for n = 0,2,4 and n = 1,3,5 are generated, which are then 
represented by the call 
In[3] := GraphicsRow|{bj0, bj1}]] 
next to each other in Fig. 20.5. 


1 J(n,z) n=0,2,A 0.6 J(n,z) n=1,3,5 





b) 
Figure 20.5 


20.4.6 Parametric Representation of Curves 
Mathematica has a special graphics command, with which curves given in parametric form can be graph- 
ically represented. This command is: 

ParametricPlot|(f.(t), f(£) . (t.t, 09]. (20.45) 
It provides the possibility of showing several curves in one graphic. A list of several curves must be 
given in the command. With the option AspectRatio—> Automatic, Mathematica shows the curves 
in their natural forms. 
The parametric curves in Fig. 20.6 are the Archimedean spiral (see 2.14.1, p. 105) and the logarithmic 
spiral (see 2.14.3, p. 106). They are represented with the input 

In[1] := ParametricPlot|{t Cos[t],¢ Sin[t]}, {¢,0, 3Pi}, AspectRatio— Automatic] 
and 

In[2] := ParametricPlot[{Exp|0.1t] Cos[t],Exp[0.1¢] Sin[t]}, {t,0, 3Pi}, 

AspectRatio—> Automatic] 

With 

In[3] := ParametricPlot|{t — 2 Sin[t], 1 — 2 Cos[t]}, {t, -Pi, 11Pi}, AspectRatio— 0.3] 
a trochoid (see 2.13.2, p. 102) is generated (Fig. 20.7). 
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Figure 20.7 


Figure 20.6 
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20.4.7 Representation of Surfaces and Space Curves 


Mathematica provides the possibility of representing three-dimensional graphics primitives. 

Similarly to the two-dimensional case, three-dimensional graphics can be generated by applying differ- 
ent options. The objects can be represented and observed from different viewpoints and from different 
perspectives. Also the representation of curved surfaces in three-dimensional space, i.e., the graphical 
representation of functions of two variables, is possible. Furthermore it is possible to represent curves 
in three-dimensional space, e.g., if they are given in parametric form. For a detailed description of 
three-dimensional graphics primitives see [20.5], [20.16]. The introduction of these representations is 
similar to the two-dimensional case. 


20.4.7.1 Graphical Representation of Surfaces 


The command P10t3D in its basic form requires the definition of a function of two variables and the 
domain of these two variables: 


Plot3D[f(r, y], (x, a, ve], (y, Va ve] (20.46) 
ΑΙ options have the default setting. 
E For the function z — z? 4- y?, with the input 

In[1] := Plot3D[x? + y?, {a, —5, 5), (y, —5, 5), PLotRange— (0, 25)] 
we get Fig. 20.8a, while Fig. 20.8b is generated by the command 

In[2] :— Plot3D|(1— Sin[r]) (2 — Cos[2 y]), (x, —2, 2}, {y, —2, 2}] 
For the paraboloid, the option PlotRange is given with the required z values, because the solid is cut 
at z = 25. 
20.4.7.2 Options for 3D Graphics 


The number of options for 3D graphics is large. In Table 20.17, only a few are enumerated, where 
options known from 2D graphics are not included. They can be applied in a similar sense. The option 
ViewPoint has special importance, by which very different observational perspectives can be chosen. 








Figure 20.8 


20.4.7.3  Three-Dimensional Objects in Parametric Representation [6 


Similarly to 2D graphics, three-dimensional objects given in parametric representation can also be 
represented. With 


ParametricPlot3D|{ f,[¢, ul, fylt, ul, felt, ul}, {t, ta, te}, {u, wa, Ue] (20.47) 
a parametrically given surface is represented, with 
ParametricPlot3D| f, |t], fylt), felt]}, {t, ta, ted] (20.48) 
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a three-dimensional curve is generated parametrically. 


Table 20.17 Options for 3D graphics 





Boxed default setting is True; it draws a three-dimensional frame around the surface 

HiddenSurface sets the non-transparency of the surface; default setting is True 

ViewPoint specifies the point (x, y, z) in space, from where the surface is observed. De- 
fault values are {1.3, —2.4, 2} 

Shading default setting is True; the surface is shaded; False yields white surfaces 

PlotRange (za; ze], (a, £e}, {Ya Ye}, {Za Ze} } can be chosen for the values All. De- 
fault is Automatic 
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Figure 20.9 


W Thoeobjects in Fig. 20.9a and Fig. 20.9b are represented with the commands 
In[3] := ParametricPlot3D[(Cos|!] Cos[u], Sin|t] Cos[u], Sin[u]], (t, 0, 2Pi) 
{u, —Pi/2, Pi/2}] (20.49a) 
In[4] :— ParametricPlot3D[(Cos|t], Sin|t], 1/4], (t, 0, 20)] (20.49b) 


Mathematica provides further commands by which density, and contour diagrams, bar charts and sector 
diagrams, and also a combination of different types of diagrams, can be generated. 


W The representation of the Lorenz attractor (see 17.2.4.3, p. 887) can easily be generated by Mathe- 
matica. 

There is a series of recent developments most of which are not to be shown in a book. One can eas- 
ily build a GUI (graphical user interface) to utilize interactive properties of the program. Most of 
the calculations are parallelized automatically, but functions such as Parallelize and ParallelMap 
provides the user to create his/her own parallel programs. The extremely fast graphic cards can be 
programmed at a very high level (as opposed to other languages) using such functions as CUDALink, 
OpenCLFunctionLoad etc. An extremely useful example of dynamic interactivity tool is Manipulate 
which in the simplest case shows you the parameter dependence of a family of curves. Working in the 
cloud or using the computer Raspberry Pi (which comes a free Mathematica license) should also not be 
unmentioned. 
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21.1 Frequently Used Mathematical Constants 


























T 3,141592654... | Ludolf constant (7) 1°/, | 0,01 percent 
e 2,718281828... | Euler constant (e) 1?/,, | 0,001 per mil 
C 0,577215665... | Euler constant (C) V2 1,414 2136... 
lge = M | 0,434294482... | In10 = M— — 2,302585093... | V3 | 1,7320508... 
lg2 0,01030... | m2 = 0,693 1472... VTO | 31622777... 





21.2 Important Natural Constants 


This table contains values of constants, recommended in [21.19], [21.20], [21.21]. In parenthesis is given 
the standard uncertainty of the last two digits. The note (fixed) indicates that this value is fixed by 


definition. 





Fundamental constants 





velocity of light in vacuum Ὁ 


Avogadro constant NA 


— 299 792 458m/s (fixed) 


= 6,022 141 29(27) -10% /mol 


— 6,626 069 57(29) -10-9*Js — 4,135 667 516(91)-10- PeVs 


= 6,582 119 28(15)-10-1eVs 


gravitation constant G  —6,67384 (80)-10- m? /(kg s?) 
fundamental electric charge € = 1,602 176 565(35)-10-!9 C 

fine structure constant a — jococ? / (2h) — 7,297 352 5698(24) -1073 
Sommerfeld constant at = 137,035 999 074(44) 

Planck constant h 

Planck quantum h/(27) h = 1,054 571 726(47)-107°*Js 








Electromagnetic constants 


permeability of free space 
permittivity of vacuum 
quantum of magnetic flux 
Josephson constant 

v. Klitzing constant 

quantum of conductance 
character. impedance (vacuum) 
Faraday constant 


spec. fund. electric charge —e/me 


= —1,758820088(39)-10" Ckg 


jo — —4m-10-7 N/A? — 12, 566370 614- 10-7 Vs/Am (fixed) 
Eo = 1/ (foc?) = 8,854 187817-10- As/Vm (fixed) 

o =h/(2e) = 2,067 833 758(46)-10- Wb 
Ky = 2e/h = 483597,870(11)-109 Hz/V 


Rg = h/e? = 25812,807 4434(84) Q 


Go  —2e/h — 1,748091 7346(25) 10-5 S 
Zo  =376,730313461 2 (fixed) 
F =eN, = 96485,3365(21) As/mol 








Boltzmann constant 


universal gas constant, molar 
molar volume of inert gas 
(To =273,15 K, pp =100 kPa 


molar volume of inert gas 


Loschmidt constant (Zo, po) 
Loschmidt constant (To, pı) 
standard acceleration of gravity 


jl) 





45? geographic latitude, sea leve 


Constants in physical chemistry, thermodynamics, mechanics 
k = Ro/Na = 1,380 6488(13) -10 7J/K 


= 8,617 3324(78)-105eV /K 
Ro = Nak =8,3144621(75) J/(mol K) 
Ving = RoTo/po = 22,710953(21) 107? m?/mol 


Ving RoTo/po — 22,413 968(20)-107? m?/mol 


(T, —273,15 K, py —101,325 kPa 
no = Na/Vany = 2,651 6462(24) +1025 /m? 
no. — Na/Va, — 2,686 7805(24)-1025/m* 

(earth, gn =9,80665 ms? (fixed) 
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Atomic electron shell and atomic nucleus 





atomic mass unit u Mau = (107kg/mol)/Na = mao (C) 

= 1,660 538 921(73)-10-2" ke 
quantum of circulation (electron) 5 = h/(2me) = 3,636 947 5520(24)10~4 m?/s 
Bohr radius ao = hi? /(Eo(e)e?) = re/a? = 0,529 177 210 92(17) -107 m 
classical electron radius fà = a?ao = 2,817 940 3267 (27) -10715 m 
Thomson cross-section σο = 8rr2/3 = 0,665 245 8734(13) 10-78 m? 
Bohr magneton μι — eh/ (2m) — 927,400 968 (20)-10?6 J/T 

— 5,788 381 8066(38) -10-?eV/T 
nuclear magneton m — eh/(2mj) — 5,050 78353(11)-107?" J/'T 

= 3,152 451 2605(22)-10-8 eV/T 
nuclear radius R — rg AV; ry — (1,2...1,4) fm; 1 € A < 250: 


9fm2 R2 ro 
rest energy 









































atomic mass unit Eo(u) = 931,494061(21) MeV 
electron Eo(e) = 0,510998 928(11) MeV 
proton Eo(p) = 938,272 046(21) MeV 
neutron Eo(n) = 939,565 379(21) MeV 
rest mass 
electron Me — 9,109 382 91(40) -107?! kg — 5,485 799 0946(22) -107* u 
proton Mp = 1,672 621 71(29)-10-?” kg = 1 836,152 672 61(85) me 
= 1,007 276 466 812(90) u 
neutron My = 1,674 927 351(74)-10-?” ke = 1 838,683 659 8(13) me 
= 1,008 664915 60(55) u 
magnetic moment 
electron με = —1,001 159 652 1859(41) jig 
— —928,476 412(80) -10?9 J/T 
proton m = +2,792 847 356(23) jy, = 1,410 606 71(12)-10-78 J/T 
neutron μα — —1,913042 72(45) jn, — 0,966 236 47(23)-107?6 J/T 
21.3 Metric Prefixes 
| Prefix Factor Abbrevation || Prefix Factor Abbrevation 
Yocto 107 y Deka 101 da 
Zepto 10771 z Hekto 10? h 
Atto 10-15 a Kilo 10? k 
Femto 105 f Mega 105 Μ 
Pico 1013 p Giga 10? G 
Nano 10-9 n Tera 1055 T 
Mikro 1076 μ Peta 1015 P 
Milli 10-3 m Exa 1015 E 
Zenti 107? c Zetta 1021 Ζ 
Dezi 107! d Yotta 1024 Υ 
B 10? —1000. M 107? — 0,001. B 105m — 1 km Bl 1m — 109m. BI 1m — 107? m. 
Remark: The metric system is built up by adding prefixes which are the same for every kind of mea- 








sure. These prefixes should be used in steps of powers with base 10 and exponent +3: milli-, micro-, 
nano-; ruther than in the smaller steps hecto-, deca-, deci-. The British system, unlike the metric one, 
is not built up in 10's e. g.: 11b = 16 oz = 7000 grains. 
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21.4 International System of Physical Units (SI Units) 


Further information about physical units see [21.8], [21.4], [21.15]. 


















































SI Base Units 
length m meter 
time 8 second 
mass kg kilogram 
thermodynamic temperature | K kelvin 
electric current A ampere 
amount of substance mol | (1mol =N, particles, Na = AVOGADRO-constant) 
luminous intensity cd candela 
Additional SI Units 
plain angle rad | radian o —l/r,lrad —1m/1m 
solid angle Sr steradian Q = S/r?, 1sr = 1 m?/1 m? 
Examples of SI derived units with special names and symbols 
frequency Hz Hertz 1 Hertz = 1/s 
force N Newton 1N = 1kg m/s3? 
pressure, tension Pa Pascal 1Pa = 1 N/m? = 1kg/(ms?) 
energy, work, J Joule 1J=1Nm=1kem?/s? 
quantity of heat kWh | kilowatt hour | 1kWh = 3,6-10°J 
power W | Watt 1W -1Nm/s —1J/s 2 1 kgm?/s? 
electric charge C Coulomb 1C=1As 
electric voltage ν Volt 1V =1W/A = 1kgm?/(A sè) 
electric capacitance F Farad 1F—1C/V —1A?s2/J — 1 A?s!/(kg m?) 
electric resistance Q Ohm 10 2 1V/A — Lkgm?/(A? s?) 
electric conductance S Siemens 1821/0 — 1 A? s?/(kgm?) 
magnetic flux Wb | Weber 1Wb-i1Vs-lkgm?/(As?) 
magnetic flux density T Tesla 1T = 1 Wb/m? = 1 kg/(A 2?) 
inductance H Henry 1H =1Wb/A = 1lkgm?/(A?s?) 
luminous flux lm Lumen 1 µη -- 1 ο 5 
illuminance Ix Lux 1 lx = 1 cd sr/m? 
Further derived SI units without special names 
speed, velocity m/s acceleration m/s? 
angular velocity rad/s | angular acceleration rad/s? 
momentum kgm/s | angular momentum kg m?/s 
torque Nm moment of inertia kg m? 
action Js energy Ws 
area n? volume m? 
density kg/m? | particle number density | m^? 
electric fieldstrength | V/m magnetic fieldstrength | A/m 
heat capacity J/K specific heat capacity J/(Kkg) 
entropy J/K enthalpy J 
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activity 


dose equivalent 
absorbed dose 


Further derived SI units with special names and symbols 
Bq | Becquerel | Bq = 1s7t 
Sv | Sievert Sv — Jkg ! 
Gy | Gray Gy —Jkg ! 




















Some units outside the SI accepted for use with the SI 





area 
area 
volume 
velocity 
mass 


energy 


focal power 
pressure 


plain angle 


time 


ar 
b 

1 
km/h 
u 

t 

eV 


dpt 
bar 
mmHg 
grad 
minute 
second 
min 

h 

d 


a 


Ar 
barn 
litre 


unified atomic mass unit 
metric ton 
electronvolt 


diopter 

Bar 

mmHg column (Torr) 

1° = 7/180 rad 

1’ = (1/60)° = 7/108 00 rad 
1” = (1/60)' = 7/648 000 rad 
minute 

hour 

day 

year 


ar = 100 m? 

barn = 10-78 m? 

1= 10-3 m3 

km/h = 0,277 778 m/s 

u = 1,660 5655- 107” kg 
t = 1000 kg 

eV = 1,602 176 565(35) 


-10719 Nm 


dpt = 1/m 
bar — 10? Pa 
mmHg — 133,322 Pa 


° = 0,017 453 293 ... rad 
’ = 000 290 888 ... rad 
" — 000004848 ... rad 


min — 60s 

h = 3,6-10?s 

d — 8,64-10* s 

a = 365d = 8760h 





Some units 


outside 


the SI currently accepted for 


use with the SI 





length 


volume 
plain angle 


velocity 
energy 

pressure 
activity 








ua 
pe 
ly 
A 
sm 
bbl 
gon 


kn 
cal 
atm 


Ci 





astronomical unit 
parsec 

light year 

Angstrom 

nautical (intern.) mile 
U.S. barrel petroleum 
gon 

gon minute 

gon second 

knot 

calory 

standard atmosphere 
Curie 





AE — 149,597 870-10? m 
pe = 30,857-1045 m 

Lj = 9,460447 63-10? m 
À — 10719 m 

sm — 1852m 

bbl — 0,158 988 m? 

8 = 0,57 -107? rad 

e = 0,5r-1074 rad 

æ — (,57-10- 8 rad 

kn = 1sm/h — 0,5144 m/s 
cal — 4,1868 J 

atm = 1,013 25-105 Pa 
Ci = 3,7- 10! Bq 





Remark: 1 Pa =1Nm~? = 10~® bar = 7,52 - 107? Torr — 9,86923 - 10-9 atm ; 
latm = 760 Torr = 101325 Pa; 1 Torr = 133,32Pa — 1mmHg; 
About units accepted only in some EU states see [21.15]. 





bar = 0,987 atm = 760,06 Torr . 
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r? 15 xz gent 
Artanh x tea ta tetet DEI 
1 1 1 1 1 
nome x Y 3x? | δα | Ta? Ὃν (2η. ἠ- 1) απ uet 
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21.6 Fourier Series 





1. y— rzfor0« r « 2x 














sinr  sin2xr  sin3r 
υ-π-2λ 


|t. πα 





2. y—xfor0E zm 
y = 2r — x for n < x < 2r 


y 
ZNZNZN 


-n θπ2π Απ 


π 4 cos3r | cosbr 
COS X I 


3 5? 





3. y—-5xfor--«r«m 





il 3 * 8 








(= x sin2dx sin3x ) 
y=2 UT 








4 (s 
y—-—|sinz— 
T 





sin3r sinr 
3. 5. 





5. y—aforüc rcm 


y — —aform « z « 2n 








4a /. singz sinr 
y= = (sine + + ee) 
π 9 


5 
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6. y=O0for0<a2<aandforn-a<a2<a7+aand2r7-a<a2< 27 
y—aforacr«m-—a 
y =—aforrtr+a<r< r-a 


1 
y= — (cos o Sin  4- — cos 3a sin 3x 
T 3 


1 
+s cos 5a sin 5a +--+ ) 








ax 
7. y=— for-a<a<a 
a 


y-aforaczzzm-a 











4a 1 
a(r—z y = —— [sino sin z 4 — sin3a sin3r 
“ΕΞ... mea y zal tg 
Q Ý 
y=—aforn+a<ax<2r7-a +35 sin5asin 5a +++) 
5 
π 6V3a 1 1 1 
Especially, for a = — holds: y= : (sn € = sin 5x + z sin 7s sinlla+-- ) 
BP 3 ΣΣ 5; n 12 





8. y—a?for-n X v Xm 








T? cosr  cos2r cos3z 
. M 


y 1 22 32 








-π On 2r3r4rE 

















T? cos2r | cosáv | cosÓm 
y a2 po to tog E 


10. y—z(r—z)for0€ € m« 
y = (n — x)(2r — x) for t < x < 2r 
1 1 
y= - (sin a+ Ej sin 3x + ρα sin 5g +-+- ) 
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ll. y=sinzfor0<a<7 

















2 4 e e cos4x | cos6r ) 
p παπι n\ 13 35 50 
-π 0 m m 
12. y=cosxfor0<a<7 
_4 ea 4sin4x | 6sin6r ) 
VUA d» ^ 3 πρ 











13. y=singzfr0<r< r 


y =Q forr < x < 2r 























1 i 3 2 (= 2x m cos 4x 
y=—+=sing—-—— 
Y 2 n\ 1-3 3-5 
cos Ôx P" ) 
5-7 
14. y —cosurfor—z <r <r 
2u sin ut | 1 cost  cos2r σοβϑα , 
we Stays 
d π 2u9 42—1 u-4 w—9! 
(u arbitrary, but not integer number) 
15. y —sinurfor—c- «zr «7 
2sin ut ( sinc 2sin2r | 3sin3r | ) 
y= Rs 
Y T 1—u?  4—u? 9—u? 





(u arbitrary, but not integer number) 





16. y= zcosrfor-7 « r «v 


4sin2r  6sin3r 


2-1 3? 








S 
= —- Snr 
riu 





Τε. 


8sin4r 


1 





17. y —1n (2sin 5) forO<a<a7 


1 1 
y —cosz 4 5 90824 + ἃ 90899 }-».' 





18. y — ln (2s 5) forO<a<7 





1 1 
y = cos x — = cos 2% + + cos 3x — -+ 
2 3 





1 T 
19. y-5ln cot 5 forü c zm 


1 1 
y = cos T + 4 cos de + = cos bx ++: 
5 
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21.7 Indefinite Integrals 


(For instructions on using these tables see 8.1.1.2, 2., p. 482). 
21.7.1 Integral Rational Functions 
21.7.1.1 Integrals with X = ax + b 

Notation: X = ax + b 














1 
1. [ra= mat (n Z —1); (for n — —1 see No. 2). 
2. i -imx. 
1 b 
23. [ox dx = xr m nl 
2 £ a?(n +2) a?(n 4- 1) 


(n Z —1,z —2) (forn — —1,— —2 see No. 5 und 6). 


MYN Jo 1 myn 1 
4. Je X de = a [X -0 X” dX (n #—1,# —2,...,# —m). 


The integral is used for m < n or for integer m and fractional n; in these cases (X — b)™ is expanded 
by the binomial theorem (see 1.1.6.4, p. 12). 





























5 E 
J X a a? 
x dx b 1 
6. Ux? 7 gX + qa nx. 
αν 1 1 - b 
X3 qi Xx 2X1] 
8 zdr 1 ze | b i: pnm 
, X"  a?l(n—2)X7? ' (n—1)X"1 nei, : 
x? dr 1.1: : 5 
ο ο. 2X+ mX). 
2 
10. E πᾳ m 
X 
z?dz 1 . 02b Be 
11. ]x In X 4 zal 
ax? da: 1 2b p 
12. = j — 
J Xn 5 [zm -3903  (n-2)3"3  (n- a (n ,#2,#3) 
a? dr 1 (x3 i " 
13. J z -3(5-3 30X^ | ax —UnX). 
a? da: Xx? à b 
14. x2 = ( 5 3b X + 3b° In X 4 xl 
zdz 1 3a op 
15. πας a(x 3bln X X E i 
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16. 


IT: 


18. 


1 


20. 


© 


2 


22. 


23. 


24. 


25. 


σι 


26. 


e 


27. 
28. 


29. 


30. es = C pm pmtn-1 X 


© 


= 





x dx 1 3b 3b? 09 
a are (mx | + ) : 


a? dm 1 -1 3b 3p? | p? 
l —4)X"34  (n—-3)X"3  (n—-2)X"3 ' (n—1)X" 
(n Z 1,2 2,2: 3, Z 4). 


X^ at 





I dx — 1 ( X + x) 

Jax? BA xz X] 

/ dr ι i X x 2ar ar? 
Jax 7 DA - x 2X1] 


dr 1 X TH fm — 1 (—a)'st 
a -5 |j 
/ αχ. dm |n eo ( i ) ixi ie 




















i=l 
dx 1 ας xX 
n T hr B p? n 
dx 1 1 2. X 
[μπητ lpx * az hz] 
dx 1 2 1 3. X 
s τος zx " bx t ab?z 5 i ] ` 





dx 1 2 [n\ (afr! X X 
Io μυ... | Σ (5 (i— 1)X*1 Tum n 2 (n2 2). 


d 

















/ dr 1], l X 2X | X? 
DX B| a x 21313 
/ dx 1 303] Χ ax x X?  3aX 
J z3X? wq ΠΤ Χ a m 
dx 1 2 X 4r afr? X? 4aX 
/ : 6a^In H — 4 ' 
J z3X3 b5 x X 2X? 22? x 





i dx 1 Y n-1Y (-aya? 3 aX? (n+1)aXx 
q3X" p2 i J(i—-2)X:? 212 x 


i=3 


1 ™2?lmtn—2\ Xa) 
= i (m — i — 1)ym-ii" 


κ If the denominators of the terms behind the sum vanish, then such terms should be replaced by 


m -2 X 
n d ) (—a)" 1n —. 


m-—1 T 
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Notation: A = bf — ag 
ax +b ar A 
31. fet dx = Ft pnt +9). 
da 1 [09 
iz / (ax+b)(fe+g) Δ n ax +b (A70): 
x dx 1 |b g 
33. B In(ax 4- b) — = In(fa A I 
/ (ar --b)(fz--g) Δ E απο f 20 E Pn 
απ d 1 f. fe+g 
m πη xit An H pur 
rdr b a a+r 
35. b). 
/ (a4 z)(b-- x)  (a—b)b--zx) (α-ὐ) T (α΄) 
x? dx b? a? i? — 2ab 
36. E + x) + ln(b+ zr b). 
/ (αἠ-α)(ρἠ-α)λ (b—a)(b+a T (b— a)? nata) (b — a)? neta) πο 
da: - 1 1 2 a+r 
37. = H b). 
/ (a4-z)*(b-- x)? (α-- 0) ( Tag ses) (a — 0)? "bz par 
x dx 1 a b a+b a+r 
38. = 1 b). 
J (a+r)? (b+r) (α-- 0) (. +r b+ -) (a — 0)? 2 Fr ere) 
x? dx -1 a? b? 2ab a+r 
^ / (a + z)?(b 4- x)? ~ (a — 0)? e z iz) ΠΣ κ τσ. (70): 
21.7.1.2 Integrals with X = az? + bx +c 
Notation: X = az? + bz +c; A = 4ac — b? 
40. [&- x arctan TE . for A » 0), 
E SUE Artanh a for A < 0), 
og COE. for A < 0). 
vV—A  2ax ae ae 
dr | 2ar4b dx 
41. X?^ AX + | X I (see No. 40). 
dr 2ax+b/ 1 3a θα”. f dx 
42. x3 y (za | Ax) FA (see No. 40). 
GH v poe 2 T 
43. dx 2ax +b , (2n — 3)2a dx 








Xn 


(n — 1)AX"- 


D^ (n—-l)A J Xn-v 
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xd b lx 
ἂν [E ee [m (see No. 40). 
X 2a 72a] X 
x dx br42c b fdr 
45. x AX AX (see No. 40). 
46 x da bx + 2c b(2n — 3) dx 
i X^ (n—1)AX"-!  (n—1)A J Xn 
zdr z b i? —2ac [ dx 
4T. xX a In X 4 Faz X (see No. 40). 
z?dx (b —2ac)r bc | 2c [dx 
48. x aAX | AÍX (see No. 40). 
x? dx =g 2 dr — 2)b απ απ 
νο : E bnc ΕΙ (see No. 43 and 46). 
X^ (2n—3)aX"-!  (2n—3)aJ] X^  (2n—3)aJ X" 
50 am dx c gm (m — 1)c a” da 
i Xn (2n — m — l)jaXn-1 ' x --πι-- Ίαν X” 
Qn mom) τ - y^ (m z 2n — 1); (for m — 2n — 1 see No. 51). 
αλ]. 1 pa” 3dr gens ᾖ. gn dx 
" ΞΕ: 
: X^ Qo Kal a X^ a. X^ 
dx 1,2 b [dax 
52. |Ξ-τ nT] X (see No. 40). 
dx 1 b [dx 1 dx 
53. / = p= / ᾷ 
J zrX^  2c(n—1)X"! 2c) X” εὐ αΧπὶ 
dx b, X 1 P ^a dx 
54. { l sce No. 40). 
J2PX 2°. ce T (zs 2) J X (bee No 40) 
dx 1 (2n 4- m — 3)a da: 
55. f = / 
am Xn (m — 1)exm-1Xn-1 (m — 1)c gm-2Xn 
(n 4- m — 2)b / dx 
r e T: 
(πι-- 1ο J απ. Χη pad 
απ 1 T 3 
56. / z = fin (fx +9) 
J e+ oX” Nef - dif + ga) X 
2ga — bf dx 
re ae see No. 40). 
onm regu] x (00549) 
21.7.1.3 Integrals with X — αἲ -- 2? 











x, 





Notation: X = a? + 
T 
arctan — for the 


a 





£ 1 
Y — 4 Artanh rm In 
a 2 





ατα 





Arcoth nl In m 
a 2 





“+” sign, 
T e » 1 

for the *—" sign and |v| « a, 
a « » a 

for the “—” sign and |a| > a. 


If there is a double sign in a formula, then the upper one belongs to X = a? +2, 
and the lower one to X — a? — z?, 


a » 0. 
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5T. 


58. 


59. 


60. 


61. 


62. 


63. 


64. 


65. 


66. 


67. 


68. 


69. 


70. 


71. 


72. 


73. 


TA. 


75. 


76. 






























































dr 1. 
Lx. 
dx x 1 
J KXT WX 203^" 
dx x 3r 3 
XS CX? Bx C Re 
απ 2 2n —1 f dz 
/ X"H CC 2ggqiX" m 2na? ] X" 
τ E ας In X. 
de 1 
] x? '2x 
τα. 1 
J X TTX 
iu ~ TES μη). 
a? dx 
/ X sarp ay. 
gdt _ x 1 
xi Tay tx 
gdr _ z x 1 Y 
X3 4X? 8a2X  8a3 ` 
x? dx a 1 dx 
/ KAF m) xe (£09 
gu E πο — Pu 
X TR 2 
a dx a 1 
X? πχ + 5 1n X. 
a dx 1 a? 
χι αχ Ταν 
x dx 1 a 
X"i-^-^3m-ij-i*zs (2D 
> = ES In a" 
πΧ 2a? X 
dx 1 1 x 
[557 ax aa se 
dx 1 1 1 x 
| oe ax ae on 
απ la- i 
/ aX —— dg 
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TT. 


78. 


79. 


80. 


81. 


82. 


i 
/ 
/ 
| 
/ 
I 





















































dx 1 üt 3 
aX? aiy  3adiX ' 245 
dr 1 x Τα 15 Y 
r? X3 αν ' 4a^X? ' 8a9X ` 8a ` 
dz 2o 1. zd uw 
wx 2a?:32 2α X 
d PEE IN T : lw 
r3 X2 2a'a3?  2a^X a6 X 
dr 1 - 1 B 1 B 3 In x 
x3 X3 2a9:? aX  4a4X?  2a8 
dx [ 








(b+ cx) X 7 gdrp 





cln(b + cx) SmX by ; 





21.7.1.4 Integrals with X = a? + 2 


83. 


84. 


85. 


86. 


87. 


88. 


89. 


90. 


91. 


92 


93. 


94. 


/ 


/ 


sr 


/ 
/ 











Notation: a? + zì = X; if there is a double sign in a formula, the 


upper sign belongs to X = a? + x°, the lower one to X = a® — 2°. 

























































































dx 1 In (ata)? 1 esie rca 
X 63 aparte?  q23 av3 ' 
dx © 2 dx 
ΧΙ 3aix 38) X 
rdr 1 in a? Fax + 2x? 1 ο 20 Fa 
X θα (ata)  αγΏ av3 ` 
sdr c a? 1 x dx 
O? ~ 3a8x * 303 Χ 
a a = +1 In X. 

X 3 
x? dx d 

X? "3X 
x? dx co cq fde 

X = τι a f X 
a? dr | 9 , leds 

x ~*3x*3/ x 

dx 1 a? 
zX ^35 ny 

da _ l qo d jis a? 
rX?  3a3X 3a X` 

dr — 1 _1 frdr 
PX dr dj] X 

dr 1 x 4 x dx 
z2X? ax © 3a6X ' 345) Χ 








(see No. 


(see No. 


(see No. 


(see No. 


(see No. 


(see No. 


83). 


85). 


83). 


83). 


85). 


85). 
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dx 





Ες 


απ D 
95. [a = me 1 
x 5 


m | ax axe πο. * 8a$X ' 3a6 
21.7.1.5 Integrals with X = af + zt 


dx 



































dax 1 QU = arv? + a? 1 can? 
97. / | ar tan- ] 
4404 4aM/À α)- ααγϑ-αἲ αῦγΏ — 2? 
a dx 1 z 
98. Ia TA = 2 arctan a 
r? dx 1 a? E axv/2 4- a a 
99. [a i^ In arctan — z: 
a+r 4a v2 — ax V2 + a? xg -r 
z? dx 


100. X 


21.7.1.6 Integrals with X = a‘ — «4 


ae η 1 afat t x^). 


(see No. 83). 


(see No. 83). 








101 / dx 1 i a+r d i T 
' =- iy arctan 
J at— rt Δα a-r 2a a 
α απ 1 a? + 22 
102. jas Τα αλ a? 
x? da 1 £ x 
103. j Tor In oe arctan 
a^—z* 4a a-z α α 
-ᾱ : 
x? dx 1 4A 
104. / et ο. — z^). 


21.7.1.7 Some Cases of Partial Fraction Decomposition 




















, where it holds 


105 : d b 2 
` (a+bx)\(f+gx) fb—ag\a+br f+grj)` 
106. ! Ξ B I —, where it holds 
(c+a)(a+b\(e@+c) e+a «+b ate 
1 1 1 
ΑΞ , B jm s 
(b — a)(c — a)' (a — b)(c — b) e (a — c)(b — c) 
um 1 zo Fe, 
"(r-cayrdbr-cc(r-d) rca rb rcc rd 
1 1 
ο ατα πα ο της ας ο 
1 il b g 
108. = à 
(a + ba?)(f + ga?) fb—ag \at+ba? f+ ga? 
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21.7.2 Integrals of Irrational Functions 
21.7.2.1 Integrals with yz and a? + bx 








Notation: 
bx 
arctan bve for the sign “+”, 
a 


1, a+byr 


= 25 RE 
a— br 
If there is a double i in a formula, then the upper one 
belongs to X — a? -- l?z, the lower one to X — a? — b?x. 


X =@+br, Y = 





for the sign “—”. 























r dr 2 
109. [ae PE ey 
































X p tp 
2375 9 J/ 73 2 
m ΘΕ 
Χ 30 bt b5 
ra νο 1 
111. =F 57 ct —Y. 
PX abè 
$48 / ο mE 2vz?  3a^/z δαν, 
| X PX υχ b 
da 2 
113. | x =—Y. 
Xx b 


114 i dr _ 2 -2y 
` Χντ αντ ' αὖ ᾿ 


us. f da: = ve iy 
X?/r  a?X αὖ) 


Up p 2h52. / 
i = _ 2 3r δι, 




















aX yr aX œ 





21.7.2.2 Other Integrals with yx 








di 1 x v 2x a 1 να 
117. ve ον In ae | arctan B £ ; 
Az 2avV2  r—av2r-a?  ay2 a?-—-z 
dr 1 v 2x 4- a? 1 V 2x 
118. f i 3 In TEM 2 + arctan . r ; 
J (a+r Ίνα 2a3/2  r—av2r--a?  a*y2 a^ — 





Vid: vidre — ln a4 Vr 
a^—a? 2a a 


a+ ναι 
120. = In 
/ (a4 — x?) x 25 τα- ET 


1 
arctan 





gj ar ctan 
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21.7.2.3 Integrals with Vax +b 
Notation: X = ax + b 














121. [ VXax = ZVÝ. 


a 


ην... às 3 
122. [xa AT 











15a? 
2(15a?z? — 12abx -- 8 )v X 
SY di 
123. Js X dx 10503 ; 
124. dx _ 2vx 
VX a 
xdr 2(ax-— 2b) 
125. Ss = — vi. 
VX 3a? 
τοῦ J x? da _ 2(3a2x? — dabx + 8b?) /X 
of SX Ὁ 15a? ᾿ 
2 X 1η νΧ -νὖ 
. —— Arcoth for b > 0, 
127. f dr Z] vb E ONEEN 
ay E forb <0 
7a an | or b « 0. 





VX dx 
128. f ase axes 
2 ry X 


129 / dx VX a dx 
i VX ba 2b ην 

















130. [aa = OVX a dx 
g F VX 
131 / da _ VX (2n — 
. an X (n. -- l)ban-i (2n m = ; AE m 


5 
im. [ via, E 


90, 


133. [vx dx = a (5v X7 — τον Χ5) ; 
: 35a? 














9 T 2 v5 
194. [NE= clum. ae | 
9 a 9 7 5 
a/ 3 MX κ 
135. / s dg WX" enVx «v f a 
T xv X 


13 


e 


x dx 2 b 
PETAR 


(see No. 127). 
(see No. 127). 


(see No. 127). 


(see No. 127). B 
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x 2 
137. Im UE NS 
π( τ να σε) 
dx 2 1 dx 
138. [= +5 [ see No. 127). 
tVX38 ὑνΧ bJ) VX ene ) 
dx 1 3a 3a dx 
139. | - / see No. 127). 
AJ uX PVX GR IX l ) 
B os 
140. J Xar = L. 
a(2 X n) 
2 X (4tn)/2 bX @tn)/2 
141. jae dz = = ————— |. 
: a2 \ 4n 2n 





i / το 7 2 = 2b X (4+n)/2 A] 
i T T= I : 


T 
aè 








6n 4n 2 τι 





143. dx. 





XP? da 2Χ"2 Xon 
fce 


zx TL 





dx 
144. Í = ' E 
xXn? E» 2/7 p] gxo-272 


145 η ο. 1 na dx 
` I 2 Χπ/2 bxXoG-2/ 90 xXnf?' 


21.7.2.4 Integrals with vax + band y fz +g 


Notation: X = az +b, Y = fxr +g, A =bf — ag 


2 | fX 
-Jaf arctan e for af « 0, 


dx 
146. ΨΧΥ = ure E for af 7 0, 
In (v aY + FX) for af » 0. 
F f 


xrdr — vXY ag+bf απ 

















a 

















147. = — see No. 146). 
VXY af 2af J XY ( ) 
148. dx 2/X 


VXVYS AVY 


2/0 en JV X 
us / di . SEAT arctan MENT for Af « 0, 
`J YVX 1 pX vA 


VAT” VX + VAF 


A +2aY A? dx 
150. | VXY dr = ————vV XY — — see No. 146). 
J dx taf Baf J| JXY (see No. 146) 


for Af » 0. 
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A dx 
151. ye-i VxY - = [ see No. 146). 
hix dx δα) JXY (see No. 146) 
VXdx 2X A dx 
152. = πο see No. 149). 
J Y 5 Y/X (s ) 








Y* dz 2 Y”! dg 
153. = al AY? ^] z i 
VX (2n —- 1)a ( " x-) 


da 1 VX 3 dx 
154. Sai ale (n 5) a / : 
J /XyYn (n — 1)A LY" 2/ J /XyYnaà 


155. j VXY" dr = (vx XY" 4A fe | (see No. 153). 














στο 


156. 





pe 1 (X a dx ) 


n-Df4 Yvi*2J VXyva 


21.7.2.5 Integrals with Va? — x? 


Notation: X — a? — a? 


1 zr 
157. [ xa ο («vx + a? arcsin z) : 
} a 














158. [xa = -VIS 


m 2 f 
159. [ 2X ds = -7V X34 - («vx + a? arcsin z) - 
. 4 a 














: y X5 VX’ 
160. [NX i=- è ; 
5 3 
VX VX 
161. [Sa = v X —aln ο . 
q x 
VX VX T 
162. [Gav = ——— — arcsin E 
J g x 
VX VX 1 VX 
163. [Su ————-—ln (ΞΕ : 
z3 23? 2a x 
isa, [een 
` —— = arcsin —. 
VX a 
xdr 
165. — =-vX. 
J VX 
x? dx x 2 T 
166. / UE = ~5Vx + S arcsin Z, 





adr | X3 
167. / LETS ax. 
Za 
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168. 


169. 


170. 


171. 


172. 


173. 


174. 


175. 


176. 


177. 


178. 


179. 


180. 


181. 


182. 


183. 


184. 


/ 
J 








dx 


ην X 


dx 


eh 


aX 


i 


dx 
VX 





a+ τα 
α 
VX 
atx” 
VX 1 


2a?3?  2a3 


[8a ies 


1 
[VX dx = -gVX. 


v Y. 2 3 Av / X 6 : 
2 ια vv X5 . αν Χ ata Q9 . 9 
fe v X3dr 6 | 31 + 16 - 16 arcsin x 
SXT 2,/ X5 
[ών dx = 2E REV : 
3 T 5 
ΨᾺΞ gi 3 X 
j dx a?V/ X —a ln er VE 
J c 1 
X x m z 
/ dx -- 3 ME — 2 arcsin = 
J x x 2 2 a 
v X3 VX3 3VX 3a a+VX 
j dx = +> ln : 
z3 2x2 2 2 z 
drt c 
VR WX 
x dx 1 
JKX X 
x? dx in © 
—— — arcsin — 
VX τα a` 
xv dx 
— 
JX x 
/ dx 1 1 in” + VX 
J VX? ὢνψχ à? n 
/ dr 1l VX d g 
J 72X83 at x νΧ)᾽ 
/ du NE: 3 2t VX 
Jo gy X3 2a?532/X δαν 2αὔ xoc 


n 





zr 





a4 vVX 


3a?x 3a* x 
E DX + 51 arcsin Ž i 
D 2 a 
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21.7.2.6 Integrals with V2? + a? 
Notation: X = a? + a? 
1 2 EE 
185. [VX u= 5 (ων a^ Arsinh z) +C 
E x [sv X 4- a?ln (x + ΨΧ)] TG. 





186. [xa = PV. 


187. [ VX de = Ίνα" = (vx ee a^ Arsinh =) +C 
f 2 


τν να 


VX® ανχὸ 


5 3 











188. [NX dr = 






































189. [Zi cux 
190. s: i E X) +0. 
191. paco ο y 
x Qx? 2a x 
dr 
192. Ix = Arsinh = C la(s +v X) +C. 
x dx 
193. —— =VX 
VX 
αν x a? 
194. VX © Arsinh 2 2 c= ΣΧ In (x | VX) +C. 
J 4X 2 2 
adr VX? 
195. j = -eX. 
Z g 
196. / BS Eg m 
zy X a z 
197. / dx _ NX 
X atx 
198. = EN E PENS 
13 Χ 24211. 2ᾱ9 x 
3a?x 3a^ FN 
199. [vx 3 dz = av X3 : VX J Arsinh FC 
a 











i 
2, 4 
= I ο PE. VK + ΠΩ) TG. 
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1 
200. [αν X?’ dz = s X^ 


q/ X85 2, 3 Αχ 
[ ?/6a- zv X* dr X?  a'zV X  a* Arsinh +C 










































































201. 
6 24 16 16 a 
αγ Χδ  απνχὸ απνχ αν 
= - In (a | V X) 4 Ch. 
6 24 16 16 
SXT 2,/ X5 
202. is X3 dz = i =£ EE : 
5 
VX _ yX X3 X 
203. / a@ VX Emt tZ. 
© 3 m 
/ X3 / 3 a 
204. I x dz = — x + S ME + a Arsinh +C 
J x£ x 2 2 a 
VX3 
Se + AX + 39 In (a + vX) + Ci. 
T 
/X3 X3 
205. / x dx à 3 X ain PEUT 
J! 2x2 2 
dx x 
206. = ; 
J VX avVX 
tdr 1 
207. = ———. 
J VX? VX 
x2 dr PA 
208. C X + Arsinh τ = C= Xx | In ( | vX) + Cy 
x da a? 
209. SNK p 
VX? VX 
210. = = M dy RAE 
rV/X9 dX αὖ t 
211. { EL esl ώς 
a2 X3 a4 X x VX 
da 1 3 3, ac-VX 
212. j - = I In ; 
a9 / X3 9011912 Χ | 2a4/X — 2a5 T 


21.7.2.7 Integrals with V2? — a? 
Notation: X = a? — a? 
213. [νΧα = 5 (ον — a? Arcosh z) +C 
; a 
= [sv x — a?ln (υ + Vx)] +6. 














214. [X à = d 
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215. J? X dz - 2 vx τετ (ενχ —a ? Arcosh 7) +C 
; 2 
= 349 T. Ave 
1X "τς [sx a? ln (a 4 vx)|4 Οι. 
/ X5 2,/X3 
216. J? X dz = τ κ: 2: 
5 3 
217. [Ziv = VX — a arccos = 
x 
VX VX V2 
218. [Ga = —~= + Arcosh = +O = -Y* bm(z +v X) TG. 
Jom x 
VX VX 1 a 
219. us d == f gg 910008 T 
dx v 
220. I: = Arcosh = + C — In (« + VX) +C. 
απ 
221. —— =VX 
VX 
redr x a? z 
222. VX 4 Arcosh — + C 5vx qu nre v X) +0. 
VX 2 2 a 
rdr VX? 
223. / ον αντ 
Vx 3 +a 
224 (= ee a 
A pde arccos =. 
dx ν 
225. / Á—— 
22 X a?a 
dx VX 1 a 
226. E. πρ T qs 8recos x 
1 _ 3a T 3a* x 
227. [vx 3dr— 4 xv X -g VA tg Arohi +C 
a 
1 3a?x 3a* 
=F a ο. an (εντ) +C. 


1 
228. ΠΕ; E gs X^ 








u/ X5 v3 A. C 
229. [ 28 E z ας c T E Arcosh = a 
rV X5 p AS iy X 6 
rv a? v α αν Χ e In (« vx) μα. 























6 24 16 
/ΧΤ 2 /X5 
230. EI 7 P A. 
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231. 


232. 


233. 


234. 


235. 


236. 


237. 


238. 


239. 


240. 






































ee 


ln (« + vX) +C. 








J X3 \/X3 
i x dx = = a? VX + a? arccos E 
à E 3 z 
VX? J/X3 
| Ae ——p c. X - λα ? Arcosh 2 
ΐ ^s 
meo Layee In δε 
= m πρ a σε 2: arccos Č 
x i 22 2 2° X 
dx ο db 
ΜΧ:. qH/X- 
x dx = 1 
VÆ νΧ 
x dx £ 
= F Arcosh 2 τ. e 
v X3 vx V X 
x da 
-JXx- 9. 
V X3 T 
/ dr 1 Ie ou 
Mi ax a arccos =. 
J d: sl VX a m 
DS4X) — adam WX} 
/ = : 2 : ccos = 
— arccos —. 
J a/X39  2a?32/X 2atVX — 2a5 z 


21.7.2.8 Integrals with vVar? + bx + c 


241. 


242. 


243. 


244. 








Notation: X — aa? 


+br +c, 


4a 
A=4ac— 8, k= — 
ac A A 











a In (2v aX 4 2az 4 b) +C fora > 0, 
1 . _ 2az +b 
J2- momia το fora>0, A>0, 
J VX αν fora>0, A=0, 
να 
-= arcsin A forac0, A «0. 
/ dr  — 2(2ax 4) 
JI XVX AVX 
i dx = 2(2ax + b) ( 1 à 2k) 
` i 3AVX ΑΧ 
i 2(2ax + b) 42 n — 1) s 
πΏππῃ )/2 ~ (Qn — DAXC (2n— pat 2n—1 xev 1)/2° 
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245. 


246. 


247. 


248. 


249. 


250. 


251. 


252. 


253. 


254. 


255. 


256. 


257. 


258. 


259. 


260. 


























ο ας Γὐν Χ V X dx 
[VX ae = 4n + — x Vx 
(2aa + b)/X ( 3 ) da: 

XVX dz = X4 t 

/ vx de 8a 2k VX 
9 .0 (2ax - b) X ( 2 5X ) 5 da: 

[ X°VX de = 12a Aot t wa) 168) X 
[ xem o (2aa + b)X 2r+0/2 2n 4-1 [| xe 3s. 
: 4a(n + 1) 2k(n 4 1). 

ταν VX b da 

JX α 2a J yX 
i xdx —— 2(br + 2c) 
J XVX A/X 

x dx 1 

X(2n+1)/2 (2n = 1)aX 9n-0/2 2a / X (2n+1)/2 

x? dx 2 3b 30? — Λας : 
/ Tx 7 (z-i2) H Vx 
/ adx _ (2b? — 4ac)x + 2be d 1 f dx 
Jl XX aA^V X aJ yX 

XVX  b(2ax 4 b) dx 
αν Χ dz = 

fay rds - Fd 4akJ VX 
[ 2XVX de = E esf XVX dz 
1 a 
[oxo dz = ο ^ αν (2n-+1)/2 dx 
: Qn+3)a 2a 





[eVX dr = (- 3) XVX | SU — dac 


/ 





dx 





VX 


e 





4a 16a? 


1 2V cX 





AC 











X b / dx 
cz 36 αγΧ 


LG sia dx le 
rw X 





ΠΕΣ 


ο. +C forc>0, 
forc>0, A>0O, 


forc>0, A=0, 


forc<, A<0. 


(see No. 241). 


(see No. 241). 


(see No. 241). 


(see No. 241). 


(see No. 244). 


(see No. 241). 


(see No. 241). 


(see No. 241). 
(see No. 246). 


(see No. 248). 








(see No. 245). 


(see No. 258). m 


(see No. 241 and 258). 
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VX dx VX 11; b d: 
261. es +a Zi i : (see No. 241 and 258). 
x? x VX 2J yX 
X(27+1)/ X(n+)/2 b XQn-10/2 
262. [^—— =- +> pen dx + ef — — — dx (see No. 248 and 260). 
x 2n 1 2 2 
263 / τ 2 amu 
' — = vVar dq. 
J gyar? + br bx 
264 dx “a r—a 
: —— = arcsin 
J y2az — a? a 
x dx r—a 
265. —————, — —V2az — a? -F a arcsin d 
V2ax — x? a 
r—a a? r—a 
266. fv Qax — x? dx = 2 V2ax — x? + 2 arcsin : 
a 


267 / dx = : arctan 29 — M 
` J (ax? +b) frz +g νύψας- Ὁ VbV fa? F g 
1 Vb fa? 4 g 4- x /bf — ag 
= In 
Wbh/bf—ag Vb /fax? 34 g — x bf — ag 





(ag — bf » 0), 





(ag — bf « 0). 








21.7.2.9 Integrals with other Irrational Expressions 


268. j Vax + bdx = ee Vax +b. 





n4 1)a 
269 / dz — (ας) 1 
: Var-c-b | (n—1)a Yar+b 
370 / da: 2 n T ova" + a? 
2 —M——mw = — — 1n ———————. 
TVX” +a? na Var 
dx 2 a 
271. i at mot auci. 
J gyr” —a? na ' καπ 


ντα: 32... (5y 
272. παπα z arcsin a 


21.7.2.10 Recursion Formulas for an Integral with Binomial Differential 


273. [e +b) dx 








1 
= ——_ nan" +b} + npb f a” lax” ο. ds] ; 
m+np+1 
1 
EX) = πρ τη | -ο"”"(ασ" E D)? 4 (m 4 n4 np 4 1) / 1" (ax" -- pP*! da! ; 


1 
(m+ 1)b 





απ (ax + b)P*! — a(m +n + np +1) ] s +b)” da! ; 
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1 
= μπι πα πε, ppt — . Do fa TR DP da 
a(m 4- np 4- 1) s (aa™ + b) (m — n4 1) (ax" 4- b)? da: 


21.7.3 Integrals of Trigonometric Functions 


Integrals of functions also containing sin x and cos x together with hyperbolic and exponential functions 
are in the table of integrals of other transcendental functions (see 21.7.4, p. 1092). 


21.7.3.1 Integrals with Sine Function 


1 
274. fsa ax dx = — = cos az. 
a 


1 1 
275. [s ax dx = =x — — sin 2az. 
2 4a 








z 1 
276. fi ax dx = —— cosaxz + — cos? ax. 
a 3a 
; 3 1. ες 
277. [sin ax dx = =x — — sin 2ax + — sin 4az. 
A 8 4a 32a 
:Qn-l 
. sin” "arcosar n— 1 m 2 
278. [su ax da — ————————— 4 —— [su 2 aa dx (n integer numbers, > 0). 
i na moa 
; sinar xcosaxr 
279. / xsinax dx = z : 
j α a 
. 22, x 
280. [5 sin az dx = —; sinax — | — — — ] cosaz. 
1 α a a 
ατα 3a? 6)\. x? 6a 
281. [ο sin ax dx = —z — az] Sinar — | — — — | cosax. 
; a a a a 
yn 
ο x n "m- 
282. |z"sinardr — —— cosax ui — [| z" ^cosaz dx (n » 0). 
3 a 


ns 


(ax)? y 
3-3 ^ 5-8  T-Tl 





283. | E OF de = ax 
3 £ 


x 
sint 
The definite integral J x dt is called the sine integral (see 8.2.5, 1., p. 513) and it is denoted si(x). 
0 


For the calculation of the integral see 14.4.3.2, 2., p. 756. The power series expansion is 














p" n at 
= S et 5 
(ο) ξσ-- 3- 3-3 5-8 7717+ ; see 8.2.5, 1., p. 513. 
284. / πο _ _ sin ax PE / cos ax dx (see No. 322). 
J g d i 
285. / sin t ar _ 1 sinar wy cos az ᾿ da (see No. 324). 
g” π--1 gne t n—1J απ 





1 qo gw 
286. [<= B “fom ax dx = —Intan = = = In(cosec ax cot ax). 
sin ax a 2 a 





287. fe Ξ- ---- - ax. 
sin? ax 
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COS aX 1 ax 
288. [<3 —3 | Intan 
sin? ax 2asin ^ar 2a 





1 cos ax n—2 dx 


289. 1.5 - ! - n» I). 
sin” ax a(n—1)sin™ tar) n—1J sin”? ax ( ) 


mU 15 m 
290. / x dx 1 ac + (ax) " T(ax) £ 31(ax) 
a? 

















sin ax 3-3! 3-5-5! 3-7-7! 
127(ax)9 X93 ed) esi 
tasa t t np πα + 
B,, denote the Bernoulli numbers (see 7.2.4.2, p. 465). 
xv dx 1 
291. f= Ξ-- = cot ax + — Insinaz. 
sin? ax a? 
gdr COS ax 1 n—2 ax dx 
292. | = δ], 
sin" ax (n—1)asin" ax — (n—1)(n—2)a?sin"?ar | n—1/J sin"? ax psu 
293 απ 1 T z E 
` l+sinar a "TAa 27 
dx x 
294. = — — tan ( + 2) y 
l — sinag 4 2 





295 xdr Lm (5 =) E 2 i G =) 
; = a cos ; 
1+ sin ax a NA 2 g Na 2 
x da a zx 2 x 
296. / 2 B I cot ( -) | Insin 6 =) ; 
J| l—sinar a 4 2 a? 4 2 
sin az dx π. απ 


1 
297. T———— = +r + -tan ( T =) ; 
3 α 


l-csinax 4 2 


dx 1 T ax 1 ax 
298. j - - tan ( T ) + —Intan 
sinaz(l+sinar) a 4 2 a 2 


















































dx 1 "T ax 1 απ ax 
299. / - = tan ( ) tan ( ) : 
l-sinaxz)? 2a 4 2 6a 4 2 
300 7 dx 1 i ( “) x 1 e z =) 
i cot co i 
l—sinavr)? 2a 4 2 6a 4 2 
301 / sin az dx 1 t G =) 1 tan? ( =) 
; = an t an i 
1+ sin aa)? 2a 4 2 6a 4 2 
302 / sin az dx 1 " ( =) 1 ‘3 e =) 
i = co I CO 7 
J (1—sinax)? 2a 4 2 θα 4 2 
dx _ (3sin? ax —1 
303. j —3 = arcsin —3 x 
+ sin* ax 2 /2a sin‘ ax +1 
απ dx 1 
304. Llc =f > = — tana. 
— sin^ ax coaz a 
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sin(a —b)x sin(a + b)x 
2(a — b) 2(a + b) 





305. E sin ba dx = (jal A |b]; for |a| = |b| see No.275). 
































dx 2 btanaz/2+ c 
306. / = SS aett = for b? > c?), 
ποσο np m za or b? » c?), 
1 btanaz/2-4-c— yc? — t? ee 
= In for b? < ὦ). 
avc —l?  btanaz/2-- c4 Vc — 2 
sin az dx r b dx 
a ee re (see No. 306). 
J b+csnax c ον b+csinar 
dx 1 ar c dx 
308. f =—Inta =f see No. 306). 
J sinaz(b+csinax) αὖ aD bd b+ csinaz P ) 
da CCOS ax b dx 
809. { - / see No. 306). 
(b+ csinaz)? a(b? — @)(b + csinax) b?— cJ b+ csinar poena ) 
310 / sin az dx bcosax c / dx (see No. 306) 
“J (b+csinaxr)? ~ a(2—b)(b+csinar) © 2-2) b+csinax mm ` 
dx 1 V + 2 tan ax 
311. I = I= arctan μα... (b > 0). 
J P-osmar ab +e b 
dx 1 Vb? — e tanar 
aa = arcta P>, b 
p daa gya δα 5 (b? > &, b> 0), 


1 In νο — D? tan ax -- b 
= 1 
2aby/c —1? = Vc? — b tanax — b 


21.7.3.2 Integrals with Cosine Function 





(ὦ » b?, b> 0). 


1 
313. fos az dx = -sin ar. 
a 


1 1 
314. n ardr - —x-- — sin2azr. 
2 4a 

















3 l i 28 
315. cos” ax dx = — sinax — — sin” ax. 
i 3a 
3 i. 3 Ἵν... 
916. fo az dx = -x + — sin 2ax + —— sin4az. 
3 8 4α 32a 
-1 : 
cos" "arsinar mn-—1 E 
317. [ cos" ax dx = ————————— + [ cos" 2 ax da. 
‘ na Ἡ 
cosar | rsinax 
318. x cos ax dx = zy + 
d a a 
2r x f 
319. Je cos ax dx = —z cosax + | — — ~ } sinaz. 
6 α a a 
: 3a? 6 a3?  6rY 
320. z? cosas dz = | = — — | cosax 4- | — — — | sinaz. 
2 4 3 
à a a a a 
z"smar n 
321. fo cos az dx = ———— — — fo sin az dz. 
‘ a a. 
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(ar)! (ax) T 


(aa)? x 
2. 2! 4-4! 6-6! 





322. [a — In(ax) 
x 


T cost 
The definite integral — / - dt is called the cosine integral (see 14.4.3.2, p. 756) and it is denoted by 


zx 
4 6 


μὰ τ jl 
Ci(r). The power series expansion is Ci(a) = C +Ina — peu E 


2.201 44l 6-6! 
C denotes the Euler constant (see 8.2.5, 2., p. 513). 





+--+ see 8.2.5, 2., p. 513; 





























323. / μα _ _ Cosar 6 af sin ax dx (see No. 283). 
T rz gz 
cos ax cos ax a sin ax dx 
324. I pe tte eet S / n (n 1) (see No. 285). 
11; 1 1 a 1 
325. | E c Artanh(sin az) = — In tan e + z) = - In(sec az 4- tan ax). 
J/ cosar a a 2 4 a 
da: 1 
326. y - = —tanaz. 
J costar a 
327 / dr sinax T int e =) 
i = ntan | . 
J cosax —2acos?ar ' 2a 4 2 
328. / dr 1 sin ax n—2 dx (n> 1). 
J coar  a(n—1)cos ar  n—1J cos" ?ax 


QE. (ax)?nt? 


Yn +2)(an)) | - 


329 | adr 1 ((ax}? , (ax)! , 5(ar)9 , 61(ar)* , 1385(ax)? , 
|J coar a? \ 2 4.20 6.4 — 8-0 ^ 10.8 — 


E,, denote the Euler numbers (see 7.2, p. 466). 





rdr 2 1 
330. J — —tanaz-t-3 In cos ax. 
α α 














cos? ax 
331 J da xvsinax 1 ,n-2 xdr (n » 2) 
coax  (n— l)acos"-lar  (n— 1)(n—2)a?cos"-?ar | n—1J cos” ax 
dx 1 ax 
332. -----------ᾶη--.. 
1l+cosax a 2 
dx 1 d 
333. Lou — —-— cot τσ 
1 — cosax a 2 
x dx x J 2 x 
334. Ee = tan = + — ln cos “a 
J 1+cosax a 2 dg 2 
335 α απ T i ar 2 insi ax 
y = co nsin —. 
J 1—cosax a 2' @ 2 
cos ax dx 1 ax 
336. —————— —gy-—-tan—. 
14-cosax a 2 
33T. Eid d E Ξ--α- E cot ou. 
1— cosax a 2 





dx 1 "T ax 1 ax 
338. I — —]ntan ( | ) — tan —. 
J cosar(l--cosar) a 4 2 a 2 
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dx 1 "T ax 1 ax 

339. / — —]ntan ( + ) cot —. 

J cosar(l— cosaxr) a 4 2 a 2 
340. / i tan τ t 1 tan? am 

1+cosar)? 2a 2 6a 
dx 1 ax 1 ax 

341. | = t gm 

J (1— cosax)? 2a 7*9 6a SS es 
342. / Sog αν = tan 1 tanë Gi 

J (1+cosar)? 2a α 

cos az da 1 ax 1 3 aa 
343. / cot cot 
l—cosax)? 2a 6a 2 
m E VPN 

344. I dr s arcsin ΕΓ 

J| l-ccosg?ax — 2y2a 1+ cos? ax 

dx dx 1 

345. [i— = i - - — —- cota. 

J| l—cosar J sin? ar a 

si —b) si b)x 
346. f cosax cosbz dx — A " er (la| # |b]); (for |a| — |b| see No. 314). 
dx 2 b — c) tanax/2 
347. ‘<< = — == arctan joues (for b? > c?) 
b+ecosax — ay? —c MEE 


1 i (c — b) tanax/2 4- Ve — b? 
= n 
ave — (c— 6) tanar/2 — Vc? — b 


sax dx xc b da 
348: |. ο ο. zJ poe (see No. 347). 
b+ccosax c ο) b+ccosaxr 





(for b? < c*). 






































a i cos xu cos ax) = 3 om (5 " 7) b+ oa ax ας 
τ / (b+ s ax) α(ώ- PL cosar) ὦ : b2 / b+ = ax (sce Nb: 347: 
αι / (b τη, a(b? — D cosax) b? É ο / b+ = ax (ece Npr gar; 
κ / b2 + ud az ab p n TM RS ved 

353. J ἘΞ Δ w m =; arctan πα (Pe >, b>0), 


1 i btanax — yc? — D? 
n 
2ab/c —0?  btanar -- / c — 0? 





(2 >, b>0). 





21.7.3.3 Integrals with Sine and Cosine Function 


; 1. .- 
354. [sn ax cosax dx = 2a sin? ax. 
‘ a 
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ni 2 x sindax 
355. sin’ ax cos* ax dx = — 





8 32a 
1 
356. [5 Y' ax cos az dz = ——— sin"*! ax —1). 
] sin" az cosaz dz n in” ax (n zZ —1) 
1 
357. [sn ax cos" ax dz — ——— —— cos" ax (n z —1). 
} a(n 4- 1) 
sin”! ax cos™*4 zi 
358. [swt az cos™ ax dx = Sa + — [577 az cos™ ax dx 
j a(n +m) nom. 


(lowering the exponent n; m and n > 0), 


santi m-—1 
sin" ^ ax cos ar πι-1 i S 
m ————————— ——— / sin" ax cos"? ax dx 
a(n +m) n-cm 


(lowering the exponent m ; m and n » 0). 


d 
359. — In tan ax. 
a 


sin az cos ax 


1 T απ 1 
= [m tan ( | ) - | ] 
sin^arcosar a 4 2 sinax 


1 ax 1 
(n tan + ) : 
a 


2 COS Gx 


360. 





361. 3 
sin ax cos? aa 


1 1 
= — | Intanaxz —- —,— }. 
sin'arcosar a 2 sin* ax 


1 1 
=-=- (mtan ax + mm : 


sin az cos? ax a 2 cos? ax 


cere 
Iw 
/ 
ο 
| Tuai 
datas 
læ 
J 
J 
J 


362. 


363. 




















l[ sinax 1 3 T L 
365. | - + — In tan ( + =) . 
sin? ax ro ar al2cos*ax αἴας 2 4 2 
1 1 cos ax 3 T 
366. = ( - τε Intan Z) ; 
sin? ax s ar  aNcosar 2siar 2 2 
1 dx : 
367. = / (n #1) (see No.361 and 363). 
sinarcos?ar X a(n— l)cos"-!ax sin az cos?-? ax 
368. : J Hm (n # 1)(see No. 360 and 362) 
. t - n see No. anc ] 
sin” ax cos ax a(n — 1) sin”! ax sin”? ax cos ax 
369. _ 1 1 τα dx 
sin" ax cos" ax: a(n—1) sin" axcos"-lar ^ n-—1 sin"? az cos" aa; 


(lowering the exponent n; m > 0, n > 1), 


1 1 ,ntm-2 / da: 
= t . Res T . 
a(m —1) sin"-lax cos"-! az π-- 1 sin" ax cos"? aa 


(lowering the exponent m; n » 0, m » 1). 
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1 1 d 
370. PRI sin de da = — sec ax. 
os? ax acosar a 
d. 1 1 
371. j ο... EC — — tan? ax 4- Cj. 
cos? ax 2a cos? ax 2a 
372. [um sin ax dx _ 1 
cosar a(n — 1) cos" ax’ 
d. 1 1 2 
373. J sin? az de sinax + —Intan ( + -) ; 
cos ax a a 4 2 
374. ge ax dx - 1 sinax ο iuum G a =) 
cos? ax al2cos?ax 2 4 2 
1: sin ax 
375. f Da e / (n#1) — (see No. 325, 326, 328). 
cos” ax a(n—1l)cos!ar  n—1/J cos"? ar 
376. (222 axdx _ NE: (= ax m η 
cos ax a 2 
sin? ax dx 1 
377. E i == (cosax + J 
cos? ax a cos ax 
d. 1 1 1 
378. o See, (n Z1, nz 3). 
cos” ax a t(n—1)cos"ax  (n— 3) cos” ax 
379. / sin" ax sin”! ax / sin”? ax dx ity 
cos ax a(n — 1) cos ax 
sin" ax a sin"*! ax n—m--2 sin” ax 
380. / == / da: (m # 1), 
cos™ a a a(m — 1) cos"-! aa πι-- 1 cos"? qa 
sin" ax n—1 fsin"? azdx 
B i (m £z n), 
a(n—m)cos"-lar n-m cos™ ax 
sin”! ax n—1 sin" azdx (m £ 1). 
a(m —1)cos"-lar  m-—14J  cos"-?azr 
cos ax dr /— 1 1 
381. ---- = —— cosec ax. 
sin? ax a sin ax a 
cosax da _ 1 cot? ax 
382. 7 , = t 1: 
sin? ax 2a sin* ax 2a 


cos ax dx il 


lua 
/ 
sss [T μπα 
I 
/ 


cos? ax dr _ 





1 ax 
384. E «(ros ax + ln tan z) ] 
a 2 


sin ax 


385. 





cos? ax da: 1 /cosax ax 
= ( In tan 5 ) ; 


sin? ax 2a \sin? ax 
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386. 


387. 


388. 


389. 


390. 


391. 


392. 


393. 


394. 


395. 


399. 


400. 


401. 


402. 


/ 


aa 
lue 
/ 
/ 
/ 


/ 

































































2 
cos* ax dx 1 cos ax dr 
a = -ni / ΠΒ (n #1) (see No. 289). 
sin” ax (n— 1) \asin"™~ ax J sin”? ar 
cosardz 1 (cos? ax ; 
—— +Insinax }. 
sinax a 2 
cos? az dx 17. 
—  ——|sinaz --— s 
sin? ax α sin ax 
cosjardr 1 1 1 
= aS ay (n Z 1,n z 3). 
sin" ax a|(n—3)sin""axr  (n—1)sin" ax 
cos" ax cos" qa; » F cos”? ax dx 1) 
sin ax a(n — 1) sin ax ` 
cos” ax dx cos"*! as n—-m+2 / cos" ax da: ( 1) 
= — m 
sin™ a(m — 1) τ lax m-—1 J sin™ ax ? 
cos"! qa; , n—1 pcos” aa da ), 
= —m- | Ξ (m Zn 
a(n— m)sin" "lazy m—1 sin" ax 
αι ην σα αρ] 
cos"! ax n—1 fcos" *axdx (mg 1) 
= —— — m P 
a(m —1)sin"ar  m-—14J  sin"-?^a; 
dr 1 . 1 {αὶ ax 
: =+ + ntan 
sin ax(1 + cos ax) 2a(lccosar) 2a 2 
dx 1 val Int ( 2 
g cT Ξ t ntan t j 
cos az(1 X sinax) 2a(lcsinar) 2a 4 2 
sin az dx 1 1 1 + cosas 
—--ln—————. 
cosax(ld-cosar) a cos ax 
cos ax dx 1 sinar 
sin az(1 + sin ax) a sinaz ` 
sin ax dx 1 1 Int (5-5) 
- = - + ntan . 
cosaz(l+sinax) 2a(1 sinar) 2a 4 2 
cos ax dx iL 1 Intan ££ 
E c ntan 
sin aaz(1 + cos ax) 2a(1 + cos ax 2a 2 
sin ax dx τ 





sinax+cosax 2 


cos ax dx T 














1 
F — In(sin ax + cos az). 
a 














sinag +cosar 2 
dx 


sinax + cos ax 





1 
av/2 


dr 


m 
2a 


In(sin ax + cos ax). 





αν T 
ln tan ( + ) A 
2 8 




















1+cosax + sinax 
dx 


bsin ax + ccos ax 


1 


1 ax 
In{1+ta i 
a n( an 7 ) 


x 0 a 
In tan with sinf = and tan@ = 5 


c 
Pre 
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sin ax dx 1 
dig. | ESL SLE In(b 4- c cos ax). 
b+ ccosax ac 
cos ax dx 1 
dix EE RI In(b 4- csin ax). 
b--csimar ac 
0 
d|r-4— 
dx ee a 
405. i Ξ I. 
b 4- ccos az 4- f sinax b+ c v f? sin(ax 4- 0) 
with sin = 
406. A nd E t ) 
= arctan ( — tan ax ] . 
l?cosar--c?sim?ar — abc b 
407. dx 1 ctanax +b 





— E 1 : 
b2cosar—c?si? ar — 2abc ^ ctanar — b 


cos(a+b)a ^ cos(a — b)r 


(a? 


and tan@ = F (see No. 306). 


b); fora=b 





408. [sin ax cos ba dx = 2(a 4-0) 2(a — 5) 


21.7.3.4 Integrals with Tangent Function 
409. fia ax dx = = In cos ax. 
a 


tan ax 





=f. 


410. [ tan? ax dx = 
i α 


1 1 
411. / tan? ar dz — — tan? ax 4- - Incos az. 
j 2a a 


412. [ow ax dx = SA 
a(n — 1) 


3 a3? 


tan"! ag — / tan"? ax dx. 


ax 2a?z' = 17aTx? 


(see No. 354). 


22n (22n _ 1) B, a2n-152n1 





413. [5 tanax dx = I t t bee. 


3 15 105 2835 


B, denote the Bernoulli numbers (see 7.2.4.2, p. 465). 


(2n 4- 1)! 











ς os eS 15 e 2n(o92n _ a4 2n-1 
Alá. = ardrz — "me (ax)? 2(αα) κ 17(ax) d ΧΗ 1) B, (ax) 
: x 75 2205 (2n — 1)(2n!) 
an" ax 1 
415. = —— tan"! ag . #—1). 
[: cos? ax a(n + 1) ee ων. 














In(sin aa + cos ax). 


16. [ — Ed 
tanar+1 `~ 2 2a 


tanar dx T 
417. QE se 
tanar-d1 2 








1 
— In(sin ax + cos ax). 
2a 





21.7.3.5 Integrals with Cotangent Function 


1 
418. / cot ax dx = — ln sin av. 
j a 
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cot ax 
419. n arder = -— 2 _ y, 
a 


1 1 
420. n ax dx = ο cot? ar — — Insin ax. 
a 








1 
421. fo ax dx = ———— cot”! ax — Jar ax dx (n Z 1). 
a(n — 1) 
i23 / POE r ar? arx? 2 Bear gee 
. | xcotardx = aes 
μας 9 295 (2n - 1)! 
B, denote the Bernoulli numbers (sce 7.2.4.2, p. 465). 
423 / cotardr 1 axr (ax)?  2(ax)? 2?" B, (ag) 2^1 
e g ax 3 135 4725 (2n — 1) (2n)! 
t” 1 
424. Puta OF de = ——_——— cot" ax (n#-1). 
sin? az ' a(n + 1) 
8 11 
425. Ix E Lo [I = (see No. 417). 
+ cot ax tanar 1 








21.7.4 Integrals ofother Transcendental Functions 
21.7.4.1 Integrals with Hyperbolic Functions 


1 
426. [sin ax dx — — cosh ax. 
3 α 
l. 
427. / cosh ax dx — — sinh ax. 
4 a 
T 1... 1 
428. ES ax dx = — sinhax coshax — =x. 
] 2α 2 
9 l. 1 
429. [ cosh ax dx = — sinhaxcoshax + =2. 
à 2a 2 


430. J sink” ax dx 








1 -1 
— — sinh"-! az coshaz — ——— [sim ax dx (for n > 0), 
an n. 
+2 
= ——— sinh" ax cosh az — ~ [smn ax dx (for n « 0) (n Z —1). 
a(n 4- 1) n+l. 


431. n ax da: 


I. E n—l1 e 
— — sinh az cosh"^! ax -- ——— [ 2 ax dx (for n > 0), 
n 


(n2 
σα sinh ax cosh"*! ax 4- ΝΠ [ o? ax da (for n « 0) (n Z —1). 





432. / ει = T danh e 
α 2 


sinh ax 





dx 2 
433. / = — arctan e™” 
coshar a 
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1 1 
434. / x sinhax dx = —x coshax — -> sinhax. 
à α 


a2 


1 1 
435. [5 cosh ax dr — —rsinh ax — > cosh az. 
4 | a 


436. [tanh ax dx = Ὃ cosh ax. 
i a 


437. / coth az dx = 1 In sinh az. 
a 














438. ns ax dx = x — un = 
439. | coth? ax dx — x — ος 
440. J sinh az sinh bz dz = dB (a sinh bx cosh ax — b cosh bx sinh ax) (a? £ 0). 
441. / cosh ax cosh ba dx = a (a sinh az cosh ba — b sinh bx cosh ax) (a? 4 93). 
442. / cosh ax sinh ba dx = BB (a sinh bx sinh ax — b cosh bx cosh ax) (a? £ b). 


; : 1 : : 
443. j sinh az sin az dx = ση (9088 ax sin az — sinh ax cos az). 
a 


E ; 
444. / cosh az cos ax dx = S (sinh ax cos ax + cosh ax sin az). 
a 





; 1 : x 
445. / sinh ax cos ax dx = πα (cosh ax cos az + sinh ax sin ax). 
a 


: d i 
446. / cosh az sin ag dx = z E ax sin az — cosh ax cos az). 
3 a 


21.7.4.2 Integrals with Exponential Functions 


1 
447. Je dz = =e, 
i α 


x 


448. / πο απ -- πίον —1). 
i r? 2g 
449. jee dz — e (= - = 
a a 


2 
tes 


1 , 
450. Jerr dx = =r" e — 5 [antea dx. 
a 


a 


av) , (ax? 








451. JE dx luz 4 εἶ 


x 1! 


2.20 ' 3.3! 
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T 
et 
The definite integral / * dt is called the exponential function integral (see 8.2.5, 4., p. 514) and it is 
—oo 


denoted by Ei(r). For x 7 0 the integrand is divergent at t — 0; in this case we consider the principal 
value of the improper integral Ei(a) (see 8.2.5, 4., p. 514). 


[ξα-σε πας a 
A TOTA νι πα σε πω nen 


C denotes the Euler constant (see 8.2.5, 2., p. 513). 








pot 1 Har pad 
452. [x dx = (-S + af M) (n z 1). 
x” n-1\ gr! gn 
453. T = n n s > 
lore a 1+e% 
dx a 1 
454 ÁN E In(b + ce**). 





`J b+ce b ab 





207 dy 1 
455. {= -- — In(b 4- ce*?). 
ac 


"Foe"? 


dx 1 m 
456. lerem T arctan ( B (bc 0), 
il "m + e% —bc 





= —— ln —_— be < 0). 
2αν--ὑο | c— e'* / —bc ( ) 
ze™ dx em 
457. [ ~ GIL 
(1 απ) a?(1+azx) 
οσα Ty η» 1 paa 
458. [eo ied [= dr; (see No. 451). 
j a aJ α 
459. J” sin bx dx = arp sin ba — bcos ba). 
i a 
460. / €** cos bx dx = EE (a cos bx J- bsin bx). 
} α 
AT ginn l m 
461. few sin” x dx = τος sinz — ncosz) 
a +n 
2—1 
. €** sin"? z dz ; (see No. 447 and 459). 
+n. 
aT οι p 
462. [-- cos” «dx = (a cosg + nsin z) 
; αἲ 
n(n στ 1) aa τι--2 
GIA E cos" *z da; (see No. 447 and 460). 
a? +n? . 
463. fae sin bx dx = To j (asin ba — bcos ba) ( T py [(a? — 13) sin bx — 2ab cos bx]. 
3 a a 
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ρα 


πα, py (a? — 09) cos br -- 2ab sin ba]. 





464. / xe"? cos bx dz — zy m (a cos bx 4- bsin ba:) 


21.7.4.3 Integrals with Logarithmic Functions 
465. f» rdx —zlnz-—z. 


466. [m x)? dx = z(In z)? — 2r ln x + 2x. 
467. fanz) dx = x(Ina)* — 3a(Inx)? + 62 Ina — 6x. 


468. [m x)" dx = x(In z)" — nf (ina) dx (n z —1). 


(nz)? (mnz)? 
2.2! t 3.3! 








469. = =lnlng +lngz + 


dt 
The definite integral j ini is called the logarithm integral (see 8.2.5, p. 513) and it is denoted by Li(z). 
nt 
Ὁ 


For x > 1 the integrand is divergent at t = 1. In this case we consider the principal value of the 
improper integral Li(x) (see 8.2.5, p. 513). 

The relation between the logarithm integral and the exponential function integral (see 8.2.5, p. 514) is 
Li(r) — Ei(ln x). 


dx x 1 dx f 
: / (Ina)" ~ n- l)mz)-! 'n-1 / (In z)n-1 (rds (550:0: 469); 
lng al 
m+1  (m+1)? 
απ (In x)” 


m+1 "ml 


yin minti 
473. fore = (nz) 








471. [ων ln zdz = g”! | (m. 1). 





472. few x)" dx = Jah z)" dx (m£-1,mnz —1; (see No. 470). 




















x n+1 
Ing Ina 1 
ara. [de = m #1). 
Jam (m-—1)r"-! (m — 1)? m-1 red) 
ln z)” ln r)” (I n— Wt 
475. f pu ea)” 4 oy n τ (m3 1); (see No. 474). 
qm (m—1)r"-! m-—1 
qz" da i 
476. T = -JZ — dy with y 2 —(m 4 1)Inx ; (see No. 451). 
13; 
ax?" da: gm m-41 gt ds 
a77. f 1). 
(In z)^ (n — 1)(In z)n-1 t n—1/J (lng) [η 


dx 
478. f= =InIng. 
vlna 


i — Q4 s —- Mfg 23 
479. / an — Inlnz — (n — 1)Inz 4 ie ney a Tee 
2.2! 3.3! 


a Ina 
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dx EX 
480. / z iii 
J z(Inz)  (n— l1)(Inz)n-! (n #1) 
dx 1 pad da 
ası. f E 
J (lng)  af(n—l)lz)"  n—1/ af(lz)! (#1) 
i a 15 g2n-l pant 
482. J lnsin z dx = x ln s — x B oo MN aan il 


B,, denote the Bernoulli numbers (see 7.2.4.2, p. 465). 








r? 15 xT 22n-1(92n m 1)B 
483. f: T " n dne 
MS 60 315 n(ini1) ^? 
a? — Ta? 22n(22n-1 — 1) B 





484. [stan e de rilir—rz4 9 t 150 Pees a n(2n 4 1)! 


485. [sin Ine de = 5 (sin In z — cosln x). 


486. fos ln zdz = 5 (sin In x 4- cosn z). 


md 


487. Γ΄. ln zdz = laz lng — i [ £e 
4 α aJ x 


21.7.4.4 Integrals with Inverse Trigonometric Functions 
488. n Z dz = z arcsin Ž + Va — z?. 
a a 





























n ntl |... 


(see No. 451). 














" 2 2 mi x 
489. [ varesin "de = (2 9 " aresin = + = Va? — z. 
à a 2 4 a 4 
2 20 αὖ QUE Ἵνα 2 
490. [5 arcsin — dy — — arcsin = + -(9΄ 1 2α΄)ναξ -- 22. 
} α 9 α 9 
arcsin dz — , l z 1-3 αὖ 13:5. αἲ 
491. I a i z+ - 
i T a 2-3.3a3 2-4-.5.5a5 2-4-6-T-Ta* 
arcsin = dx ; 2. 3 
1 í 1 — í 
492. / = arcsin 1n? = = 
i x x a a z 
493. n T de = z arccos  — va? — 23. 
α α 
494 I T a3? αἳ X^ To z 
. | zarccos - dr — arccos a? — q?. 
a 2 4 a 4 
2 © z? E E 2 3 3 
495. / x° arccos = dx arccos —(x* 4-2a^)v a? — z?. 
a 3 a 9 
αι 
arccos = dx τ 3 a) -, 
496. / a Tar x 1 r 1-3 c 1-3-5 g 
i a 2 a 3:83:34 2-4-5-5a5 2-4-6-7-7a™ 
x 
arccos — dx A 2. 72 
1 $ 1 γα; -- 
497. / E arccos + -In IMMUNE : 
i x x a a © 
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g T dà 
498. [arctan Č dx = x arctan — — = In(a? + 2°). 
a a 2 


x 1 r^ ar 
499. nS Z dz = 3 + a”) arctar : B > 


9 T az? a3 


x i ; 
500. {9 arctan - τάν τ arctan = 6 H g eo + a”). 





ntl T pl à 
501. Js arctan Š dx = Ž— arctan ~ e a : (n T 
nal a ntl 















































x 
Eni arctan a dx oaz "E αὔ a 
I x a 3a Ba Pat c (els al: 
arctan 7 — de $ a? 4 a? 
503. / arctan In i 
T a 2a x? 
504. iren 25 = arctan ~ + —“ T (n z 1) 
(n — WT ^" a n=1) (a? + a2) I 
μα 
505. fut Z de = z arccot Č Pa In(a? + 2”). 
a 
506 fe t τά δώ ο 7 
. | xarcco E arccot = + —. 
n v — (rr a)ar URS 
23 a 22 3 
507. [? arccot - τά T arccot- | -- i In(a? + 2°). 
3 a 6 6 
n+l . Qn i 
508. fo n arccot - = de = arccot ^. 4 le sa (n 1). 
n+1 a n+l a? 
— / arccot ~ — T Y on 5 ù E x n a 
; ng 
32a? 524 Tal 
arccot ^ — dz 1 p 1. +r 
510. J — — arccot — 4 In 
x a 2a x? 
arccot 5 dz: 
1 qz a dx 
511, J a arccot n #1). 
g^ (n — 1)z"-1 a a m—1J απ τα» -- zr?) (n ) 


21.7.4.5 Integrals with Inverse Hyperbolic Functions 
512. n Z dz = x Arsinh 5 — Vr? +a. 
a a 


513. J Arcosh Z dy — s Arcosh ~ — V2? — a2. 
j a a 
2 X a 2 2 
514. J Artanh — dx — x Artanh = + 5 In(a* — a”). 
1 a a 


515. [ ^t Z dy — z Arcoth ~ + 5 In? — a2). 
a a 
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21.8 Definite Integrals 


21.8.1 Definite Integrals of Trigonometric Functions 


For natural numbers m, n: 


27 27 27 
1. [ nrdr —0. (211) 2. [o nrdr —0. (212) 3. [sin nrcosmzczdr —0. (31.3) 
0 0 0 
2π 2π 
. . 0 form Z n, 0 form Zn, 
4. [ ng sin mg dx = (21.4) 5. [ cos ng cos mg dr = (21.5) 
3 7 form — n. à mw for m=n. 








2468 n-1 
S 3579 for n odd, 
6. [Μα rdrz—-(| 235$ πι 5 (n 2 2). (21.6) 
0 2548 z for n even 


π/2 
1 (α-Ε 1) 1 1 
τα. / sin? t! g cosh +! g dr = mb = σθία +1,8 +1). (21.7a) 


I'(x)I(y) 
B(z,y)2 —-——*. 

(2,9) I(a+y) 
gamma function or the Euler integral of the second kind (see 8.2.5, 6., p. 514). 


denotes the beta function or the Euler integral of the first kind, I(x) denotes the 


The formula (21.7a) is valid for arbitrary a and 8; we use it, e.g., to determine the integrals 


71/2 71/2 π/2 


dx 
1 vsin z dz, / V'sin x dr, ] Wu etc. 
1 i cos T 


For positive integer α, β: 


π/2 








ΙΙ 
τρ. / sin2“t x cos?4+1 x dr = LAMPES ie . (21.7b 
{ 2(α 4- B 4- 1)! 
π 
AN T$ 
^ Í sinat iy 2 ora > 0 (21.8 
j 2 cm fora < 0. 
1: 
: = SOSOTE (a arbitrary). (21.9 
Ὁ 
π 
LONE TS — fora O0, 
10. jm 2 2 i 
^ x E for a « 0. 
πια ος cos ba: PERI (21.11 
a 
0 
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οὐ, 5 for |a| « 1, 
sin x cos ax 
12. — — — dr -—4 x = 21.12 
/ 4 for |a| — 1, ( 
0 for |a| 7 1. 
7 sina T cosa T 
13. ie = [| SF ae = ds 2 
/ J dx Ja a 2 (21.13 
S assin Dg 
14. / DET dg — 4. e-hab (the sign is the same as the sign of b) . (21.14 
^ a + a? 2 
7 cos ax T 
15. de= el, 15 
/ 1 -- αὖ War PS 
0 
7 sin? ax --- 
16. / = de = Td (21.16 
0 
+00 +00 
^ π 
1Τ. J sin(z?) dx = / cos(x“) dz = a (21.17 
"P^ sinzd 1 1+k 
sin x dx ; 
1. [ SS = frd <1. 21.18 
p vis AM*sn?r 2k 1-k |x| ( 
τη Dod 1 
19. p κο. for |k| « 1. (21.19 
vVi-K?smr k 
d. 1 
jj me sin? ode = —(K — E) for|é| <1. (21.20 
4 vi- Vi= k?si?s k 
Here, and in the following, E and K mean complete elliptic integrals (see 8.1.4.3, 2., p. 490): 
E-E Ὁ 5) Em (&. 5) (see also the table of elliptic integrals 21.9, p. 1103). 
7/2 2 
cos“ x dx 1 
2. | -2 = pIE- (1 κ. 21.21 
4 V1— ksin?x jl ( )K] ( ) 
22. pz - EPOD παν red (21.22) 
= ——., for integer a f : 
| I= boss + 1-0 Se 


21.8.2 Definite Integrals of Exponential Functions 
(partially combined with algebraic, trigonometric, and logarithmic functions) 


icr E 
23. Jre dx = Pos for a > 0, n > —1, (21.23a) 


anti 
0 


n! 


E fora>0, n=0,1,2,.... (21.23b) 


qni 
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I (n) denotes the gamma function (see 8.2.5, 6., p. 514); see also the table of the gamma function 21.10, 
p. 1105). 




















(οἱ 
24. f a" e" dy — Cr) fora>0, n>-—l, (21.24a 
/ που 
1- 2k —1 
= ve for n 22k (k =1,2,...), a>0, (21.24b 
k! 
--Ἕ-με for n 2 2k4-1 (k—0,1,2,...), a» 0. (21.24c 
a 
25. [εν dx = = for a > 0. (21.25 
; a 
26. Jre dx = VT for a > 0. (21.26 
4a 
0 
27. pee cosbede = VT eP? ter aso (21.27 
à 2a 
T mdr T? 
28. f =^, 21.28 
emm 6 ( 
0 
"ET T? 
29. / --. 21.29 
46751 12 ( 
1 
30. = Zn £ SMT de = arccota =arctan— fora> 0. (21.30 
J a 
31. "E "Inzdz — —C ts —0,5772 (21.31 
0 


C denotes the Euler constant (see 8.2.5, 2., p. 513). 


21.8.3 Definite Integrals of Logarithmic Functions 


(combined with algebraic and trigonometric functions) 








1 
32. fin | na | dx = —C = —0,5772 (reduced to Nr. 21.31). (21.32) 
0 
C is the Euler constant (see 8.2.5, 2., p. 513). 
lng T? 
33. 1 ;de =" (reduced to Nr. 21.28). (21.33) 
π-- 
+ Inge τὸ 
34. / alt= Fy (reduced to Nr. 21.29). (21.34) 
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Ina T 

35. ] 3-1 dx = *' (21.35) 
f In(1 4-2) 

36. | > dz - 512. (21.36) 
ὁ 3 +1 8 
1 Ίνα 

37. / (9 dx =I(a+1) for(—1 < a « oo). (21.37) 
0 


T(x) denotes the gamma function (see 8.2.5, 6., p. 514; see also the table of the gamma function 21.10, 
p. 1105). 














π/2 7/2 
38. / lnsin z dz = J ln cos x dx = -5 In2. (21.38 
0 Ὁ 
f ?ln2 
39. fe Insinzdz — —— i 3 (21.39 
0 
71/2 
40. / sin zlnsinz dx — n2 — 1. (21.40 
0 
^ /aà — i 
41. fha + bcos x) dz = rln LU fora > b. (21.41 
0 
7. 5 j 2π]πα for (a 7 b » 0), 
42. f n — 2abcosz -- b^) dx — (21.42 
| 2rlnb for (b>a> 0). 
71/2 
43. / ln tan x dx = 0. (21.43 
0 
π/4 
44. / In(1 + tan x) dx = a n2. (21.44 
Ὁ 
21.8.4 Definite Integrals of Algebraic Functions 
7 F Γ(α -Ε 1)Γ(β 1-1) 
45. fra —a)’ dx = afea - 27 dy = —_- ον 
1 1 Γία 1- β 1-2) 
= B(a+1, 8+1), (reduced to Nr. 21.7a). (21.45) 
Γ(ω)Γ 
B(z,y) = WU denotes the beta function (see 21.8.1, p. 1098) or the Euler integral of the first | 
τν 
kind, /'(v) denotes the gamma function (see 8.2.5, 6., p. 514) or the Euler integral of the second kind. 








T dz 
46 [.—— un fora <i. (21.46) 
1 (l--z)r^ sinar 
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Ῥ la 
4T. hu — —mcotam fora «1. (21.47) 
(1— x)z* 
0 
48 / ο. m forüca«b (21.48) 
] —— —— OU — — Sar r a . á 
4 i+ z^ bsin 
b 
I 1 
dx vir (2) (21.49) 
49. [σπττα m VA. i 
J /i— za ar (522) 
a 


T(x) denotes the gamma function (see 8.2.5, 6., p. 514; see also the table of the gamma function 21.10, 
p. 1105). 


1 
dx a T 
50. = O0<a<—). 21.50 
/ l-2rcosa--a? 2sina ( ss z) pan 
T dx a π 
51. | (o< <7). 21.51 
J 14+2rcosa+a2? sing 4 2 prout) 
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21.9 Elliptic Integrals 
21.9.1 Elliptic Integral of the First Kind F (y,k), k = sina 





e /? eum 





0 10 20 30 40 50 60 70 80 90 





0 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 

10 | 0.1745 | 0.1746 | 0.1746 | 0.1748 | 0.1749 | 0.1751 | 0.1752 | 0.1753 | 0.1754 | 0.1754 
20 | 0.3491 | 0.3493 | 0.3499 | 0.3508 | 0.3520 | 0.3533 | 0.3545 | 0.3555 | 0.3561 | 0.3564 
30 | 0.5236 | 0.5243 | 0.5263 | 0.5294 | 0.5334 | 0.5379 | 0.5422 | 0.5459 | 0.5484 | 0.5493 
40 | 0.6981 | 0.6997 | 0.7043 | 0.7116 | 0.7213 | 0.7323 | 0.7436 | 0.7535 | 0.7604 | 0.7629 
50 | 0.8727 | 0.8756 | 0.8842 | 0.8982 | 0.9173 | 0.9401 | 0.9647 | 0.9876 | 1.0044 | 1.0107 
60 | 1.0472 | 1.0519 | 1.0660 | 1.0896 | 1.1226 | 1.1643 | 1.2126 | 1.2619 | 1.3014 | 1.3170 
70 | 1.2217 | 1.2286 | 1.2495 | 1.2853 | 1.3372 | 1.4068 | 1.4944 | 1.5959 | 1.6918 | 1.7354 
80 | 1.3963 | 1.4056 | 1.4344 | 1.4846 | 1.5597 | 1.6660 | 1.8125 | 2.0119 | 2.2653 | 2.4362 
90 | 1.5708 | 1.5828 | 1.6200 | 1.6858 | 1.7868 | 1.9356 | 2.1565 | 2.5046 | 3.1534 oo 
































21.9.2 Elliptic Integral of the Second Kind E (o, k) , k — sino 





e /? Sp 





0 10 20 30 40 50 60 70 80 90 





0 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 
10 | 0.1745 | 0.1745 | 0.1744 | 0.1743 | 0.1742 | 0.1740 | 0.1739 | 0.1738 | 0.1737 | 0.1736 
20 | 0.3491 | 0.3489 | 0.3483 | 0.3473 | 0.3462 | 0.3450 | 0.3438 | 0.3429 | 0.3422 | 0.3420 
30 | 0.5236 | 0.5229 | 0.5209 | 0.5179 | 0.5141 | 0.5100 | 0.5061 | 0.5029 | 0.5007 | 0.5000 
40 | 0.6981 | 0.6966 | 0.6921 | 0.6851 | 0.6763 | 0.6667 | 0.6575 | 0.6497 | 0.6446 | 0.6428 
50 | 0.8727 | 0.8698 | 0.8614 | 0.8483 | 0.8317 | 0.8134 | 0.7954 | 0.7801 | 0.7697 | 0.7660 
60 | 1.0472 | 1.0426 | 1.0290 | 1.0076 | 0.9801 | 0.9493 | 0.9184 | 0.8914 | 0.8728 | 0.8660 
70 | 1.2217 | 1.2149 | 1.1949 | 1.1632 | 1.1221 | 1.0750 | 1.0266 | 0.9830 | 0.9514 | 0.9397 
80 | 1.3963 | 1.3870 | 1.3597 | 1.3161 | 1.2590 | 1.1926 | 1.1225 | 1.0565 | 1.0054 | 0.9848 
90 | 1.5708 | 1.5589 | 1.5238 | 1.4675 | 1.3931 | 1.3055 | 1.2111 | 1.1184 | 1.0401 | 1.0000 
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21.9.3 Complete Elliptic Integral, k = sina 















































a /° K E a /? K E a /° K E 
0 1.5708 | 1.5708 | 30 | 1.6858 | 1.4675 | 60 | 2.1565 | 1.2111 
1 1.5709 | 1.5707 | 31 | 1.6941 | 1.4608 61 | 2.1842 | 1.2015 
2 1.5713 | 1.5703 || 32 | 1.7028 | 1.4539 62 | 2.2132 | 1.1920 
3 1.5719 | 1.5697 | 33 | 1.7119 | 1.4469 | 63 | 2.2435 | 1.1826 
4 1.5727 | 1.5689 | 34 | 1.7214 | 1.4397 | 64 | 2.2754 | 1.1732 
5 | 1.5738 | 1.5678 | 35 | 1.7312 | 1.4323 | 65 | 2.3088 | 1.1638 
6 1.5751 | 1.5665 | 36 | 1.7415 | 1.4248 || 66 | 2.3439 | 1.1545 
7 1.5767 | 1.5649 | 37 | 1.7522 | 1.4171 | 67 | 2.3809 | 1.1453 
8 1.5785 | 1.5632 | 38 | 1.7633 | 1.4092 | 68 | 2.4198 | 1.1362 
9 1.5805 | 1.5611 | 39 | 1.7748 | 1.4013 | 69 | 2.4610 | 1.1272 
10 | 1.5828 | 1.5589 || 40 | 1.7868 | 1.3931 || 70 | 2.5046 | 1.1184 
11 | 1.5854 | 1.5564 || 41 | 1.7992 | 1.3849 | 71 | 2.5507 | 1.1096 
12 | 1.5882 | 1.5537 42 | 1.8122 | 1.3765 72 | 2.5998 | 1.1011 
13 | 1.5913 | 1.5507 49 | 1.8256 | 1.3680 || 73 | 2.6521 | 1.0927 
14 | 1.5946 | 1.5476 || 44 | 1.8396 | 1.3594 || 74 | 2.7081 | 1.0844 
15 | 1.5981 | 1.5442 || 45 | 1.8541 | 1.3506 || 75 | 2.7681 | 1.0764 
16 | 1.6020 | 1.5405 |} 46 | 1.8691 | 1.3418 | 76 | 2.8327 | 1.0686 
17 | 1.6061 | 1.5367 || 47 | 1.8848 | 1.3329 | 77 | 2.9026 | 1.0611 
18 | 1.6105 | 1.5326 | 48 | 1.9011 | 1.3238 | 78 | 2.9786 | 1.0538 
19 | 1.6151 | 1.5283 |} 49 | 1.9180 | 1.3147 | 79 | 3.0617 | 1.0468 
20 | 1.6200 | 1.5238 | 50 [1.9356 | 1.3055 | 80 | 3.1534 | 1.0401 
21 |1.6252| 1.5191 | 51 | 1.9539 | 1.2963 | 81 | 3.2553 | 1.0338 
22 | 1.6307 | 1.5141 | 52 | 1.9729 | 1.2870 | 82 | 3.3699 | 1.0278 
23 | 1.6365 | 1.5090 | 53 | 1.9927 | 1.2776 | 83 | 3.5004 | 1.0223 
24 | 1.6426 | 1.5037 || 54 | 2.0133 | 1.2681 84 | 3.6519 | 1.0172 
25 | 1.6490 | 1.4981 || 55 | 2.0347 | 1.2587 | 85 | 3.8317 | 1.0127 
26 | 1.6557 | 1.4924 | 56 | 2.0571 | 1.2492 | 86 | 4.0528 | 1.0080 
27 | 1.6627 | 1.4864 | 57 | 2.0804 | 1.2397 | 87 | 4.3387 | 1.0053 
28 | 1.6701 | 1.4803 || 58 | 2.1047 | 1.2301 88 | 4.7427 | 1.0026 
29 | 1.6777 | 1.4740 || 59 | 2.1300 | 1.2206 | 89 | 5.4349 | 1.0008 
90 οο «0000 
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21.10 Gamma Function 


x I(x) a T(x) x T(x) z£ T(x) 








1.00 | 1.00000 | 1.25 | 0.90640 | 1.50 | 0.88623 | 1.75 | 0.91906 
01 | 0.99433 26 | 0.90440 51 | 0.88659 76 | 0.92137 
02 | 0.98884 27 | 0.90250 52 | 0.88704 77 | 0.92376 
03 | 0.98355 28 | 0.90072 53 | 0.88757 78 | 0.92623 
04 | 0.97844 29 | 0.89904 54 | 0.88818 79 | 0.92877 


1.05 | 0.97350 | 1.30 | 0.89747 | 1.55 | 0.88887 | 1.80 | 0.93138 
06 | 0.96874 31 | 0.89600 56 | 0.88964 81 | 0.93408 
07 | 0.96415 32 | 0.89464 57 | 0.89049 82 | 0.93685 
08 | 0.95973 33 | 0.89338 58 | 0.89142 83 | 0.93969 
09 | 0.95546 34 | 0.89222 59 | 0.89243 84 | 0.94261 


1.10 | 0.95135 | 1.35 | 0.89115 | 1.60 | 0.89352 | 1.85 | 0.94561 
11 | 0.94740 36 | 0.89018 61 | 0.89468 86 | 0.94869 
12 | 0.94359 37 | 0.88931 62 | 0.89592 87 | 0.95184 
13 | 0.93993 38 | 0.88854 63 | 0.89724 88 | 0.95507 
14 | 0.93642 39 | 0.88785 64 | 0.89864 89 | 0.95838 


1.15 | 0.93304 | 1.40 | 0.88726 | 1.65 | 0.90012 | 1.90 | 0.96177 
16 | 0.92980 41 | 0.88676 66 | 0.90167 91 | 0.96523 
17 | 0.92670 42 | 0.88636 67 | 0.90330 92 | 0.96877 
18 | 0.92373 43 | 0.88604 68 | 0.90500 93 | 0.97240 
19 | 0.92089 44 | 0.88581 69 | 0.90678 94 | 0.97610 


1.20 | 0.91817 | 1.45 | 0.88566 | 1.70 | 0.90864 | 1.95 | 0.97988 
21 | 0.91558 46 | 0.88560 71 | 0.91057 96 | 0.98374 
22 | 0.91311 47 | 0.88563 72 | 0.91258 97 | 0.98768 
23 | 0.91075 48 | 0.88575 73 | 0.91467 98 | 0.99171 
24 | 0.90852 49 | 0.88592 74 | 0.91683 99 | 0.99581 



































1.25 | 0.90640 || 1.50 | 0.88623 | 1.75 | 0.91906 | 2.00 | 1.00000 





The values of the gamma function for x < 1 (x z 0,—1,—2,...) and x > 2 can be calculated by the 
following formula: 
(e+ 
SII ως δες. 
5 


P'(1.7) 090864 
E A: r(0.7) = aI. = 


WB: [(3.5) 2 2.5- (2.5) = 2.5 - 1.5- (L5) — 2.5- 1.5- 0.88623 — 3.32336. 








I(x) 








= 1.2981. 
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21.11 BesselFunctions (Cylindrical Functions) 












































c | Jo(æ) | A(x) | Yo(v) | Yi(m) | lolx) | K(x) | Ko(v) | Kile) 
0.0 | +1.0000 | +0.0000} -co | —oo | +1.000| 0.0000| oo 

0.1 | 09975 | 0.0499 | —1.5342 | 6.4590 | "1.003 | +0.0501 | 2.4971 | 9.8538 
0.2 | 0.9900] 0.0995 | 1.0181] 3.3238] 1.010 | 0:1005 | 1.7527 | 4:7760 
0.3 | 0.9776 | 0.1483 | 0.8073 | 2.2031| 1.023 0.1517 | L3725 | 3.0560 
0.4 | 0.9604 | 0.1960| 0.6060| 1.7809 | 1.040 | 0:2040 | 1:1145  |2.1844 
0.5 | +0.9385 | +0.2423 | —0.4445 | -1.4715 | 1.063| 0.2579 | 0.9244 — | 1.6564 
0.6 | 0.9120] 0.2867 | 0.3085] 1.2604] 1092] 0.3137 | 0.7775 | 13028 
0.7 | 0.8812] 0.3290} 0.1907] 1.1032| 1.126] 0.3719 | 0.6605 | 1:0503 
0.8 | 0.8463] 0.3688 | —0.0868 | 0.9781 | 1.167] 0.4329 | 0.5653 | 08618 
0.9 | 0.8075 | 0.4059 | 0.0056 | 0.8731 | 1.213 | 0.4971 | 0.4867 | 0.7165 
1.0 | +0.7652 | +0.4401 | +0.0883 | —0.7812 | 1.266 | 0.5652 | 0.4210 | 0.6019 
Ll | 0:7196 | 0.4709 | 0.1622 | 0.6981 | 1:326 | 0.6375 | 0.3656 | 0:5098 
12 | 0.6711] 0.4983] 0.2281 | 0.6211) 1.394| 0.7147 | 0.3185 — 0.4346 
13 | 0.6201} 0.5220] 0.2865 | 0.5485 | 13691 0.7973 | 0.2782 | 0:3725 
14 | 0.5609 0.5419 | 0.3379 | 0.4791 | 1.553) 0.8861 | 0.2437 | 03008 
1.5 | +0.5118 | +0.5579 | +0.3824 | 0.4123 | 1.047 | 0.9817 | 0.2138 — | 0.2774 
L6 | 0.4554 | 0.5699 | 0.4204 | 0.3476 | 1.750) 1.085 | 0.1880 | 02406 
LT | 0.3980} 0.5778 | 0.4520 0.2847 | 1864! 1.196 | 0.1655 | 02094 
L8 | 0.3400 | 0.5815 | 0.4774 | 0.2237 | 1.990 | 1.317 | 0.1459 | 0.1826 
19 | 0.2818] 0.5812] 0.4968| 0.1644] 2.128 | 1.448 | 0.1288 | 0.1597 
2.0 |--0.2239 | 40.5767 | --0.5104 | —0.1070 | 2.280 | 1.591 |0.1139 | 0.1399 
21 | 0.1606 | 0.5683| 0.5183 | —0.0517 | 2.446 | 1.745 | 0.1008_ | 0.1227 
22 | 0.1104] 0.5560| 0.5208 | --0.0015 | 2.629 1:914 | 0.08927 | 0.1079 
2:3 | 0.0555 | 0.5399 | 0.5181 | 0.0522| 2.830| 2.098 | 0.07914 | 0.09498 
2:4 | 0.0025 | 0.5202 | 0.5104 | 0.1005 | 3.049 | 2:298 | 0.07022 | 0.08372 
2.5 | -0.0484 | +0.4971 | +0.4981 | +0.1459| 3.290 | 2.517 | 0.06235 | 0.07389 
2:6 | 0.0968 | 0.4708 | 0.4813 | 0.1884 | 3:553] 2755 | 0.05540 | 0.06528 
27 | 0.1224 | 0.4416 | 0.2605 | 0.2276 | 3.842 | 3.016 | 0.04926 | 0.05774 
2:8 | 0.1850 | 0.4097 | 0.4359 | 0.2635 | 4.157 | 3.301 | 0.04382 | 0.05111 
2.9 | 02243 | 0.3754 | 0.4079 | 0.2059 | 4:503 | 3.613 | 0.03901 | 0.04529 
3.0 | —0.2601 | 40.3391 | 0.3769 | 40.3247 | 4.881 | 3.953 | 0.03474 | 0.04016 
3.1 | 0.2921 | 0.3009 | 0.3431 | 0.3496 | 5.294} 4326 | 0.03095 | 0.03563 
3.2 | 0.3202] 0.2613| 0.3070 | 0.3701 | 5.141 | 4.734 | 0.02759 | 0.03164 
3.3 | 0.3443 | 0.2207 | 0.2691] 0.3879| 0.243 | 5.181 | 0.02461 | 0.02812 
3.4 | 0.3643 | 0.1792 | 0.2206 | 0.4010| 6.785 | 5.670 | 0.02196 | 0.02500 
3.5 | —0.8801 | 0.1374 | 0.1890 | -0.4102 | 7.378 | 6.206 | 0.01960 | 0.02224 
3.6 | 0.3918 | 0.0055 | 0.1477 | 0.4154| 8.028 | 6.793 | 0.01750 | 0.01979 
3.7 | 0.3992 | 0.0538 | 0.1061 | 0.4167 | 8.739 | 7.436 | 0.01563 | 0.01763 
3.8 | 0.4026 | +0.0128 | 0.0645 | 0.4141 | 9.517 | 8.140 | 0.01397 | 0.01571 
3.9 | 04018 | —0.0272 | 0.0234 | 0.4078 | 10.37 | 8.913 | 0.01248 | 0.01400 
4.0 | —0.3971 | 0.0660 | —0.0169 | +0.3979 | 11.30 | 9.759 | 0.01116. | 0.01248 
4.1 | 0.3887 | 0.1033 | 0.0561 | 0.3846 | 12.32 | 10.69 | 0.009980 | 0.01114 
4.2 | 0.3766 | 0.1386 | 0.0938 | 0.3680 | 13.44 | 11:71 | 0.008927 | 0.009938 
43 | 0.3610] 0.1719 | 0.1296] 0.3484 | 14.67 | 12.82 | 0.007988 | 0.008872 
44 | 0.3423 | 0.2028 | 0.1633 | 0.3260 | 16.01 | 14.05 | 0.007149 | 0.007923 
4.5 | —0.3205 | —0.2311 | 0.1947 | 0.3010 | 17.48 | 15.39 | 0.006400 | 0.007078 
4.6 | 0.2961] 0.2566 | 0.2235 | 0.2737 | 19.09 | 16.86 | 0.005730 | 0.006325 
4.7 | 0.2603] 0.2791 | 0.2494 | 0.2445 | 20.86 | 18.48 | 0.005132 | 0.005654 
18 | 02401 | 0.2985 | 0.2723 | 0.2190 | 22.79 | 20.25 | 0.004597 | 0.003055 





0.2097 .3147 | 0.2921 0.1812 | 24.91 22.20 0.004119 | 0.004521 
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Si(x) | Yæ) | Yi) | I(x) | Læ) | Ko) | K (æ) 














i .00 

—0.3276 | —0.3085 | +0.1479 | 27.24 | 24.34 | 3691 4045 
0.3371 | 0.3216 | 0.1137 | 29.79 | 26.68 | 3308 3619 
0.3432 | 0.3313 | 0.0792] 32.58 | 29.25 | 2966 3239 
0.3460 | 0.3374 | 0.0445 | 35.65 32.08 | 2659 2900 
0.3453 | 0.3402 | 40.0101 | 39.01 2395.18 | 2385 2597 


—0.3414 | —0.3395 | —0.0238 | 42.69 | 38.59 | 2139 2326 
0.3343 | 0.3354 | 0.0568 46.74 | 42.33 | 1918 2083 
0.3241 | 0.3282 | 0.0887 | 51.17 | 46.44 | 1721 1866 
0.3110 | 0.3177 | 0.1192 | 56.04} 50.95 | 1544 1673 
0.2951 | 0.3044} 0.1481 | 61.38 | 55.90 | 1386 1499 


—0.2767 | —0.2882 | —0.1750 | 67.23 | 61.34 | 1244 1344 
0.2559 | 0.26904 | 0.1998 | 73.66 | 67.32 | 1117 1205 
0.2329 | 0.2483 | 0.2223 | 80.72 | 73.89 | 1003 1081 
0.2081 | 0.2251 | 0.2422] 388.46 | 81.10 | 09001 | 09691 
0.1816 | 0.1999 | 0.2596 | 96.96 | 89.03 | 08083 | 08693 


—0.1538 | —0.1732 | —0.2741 | 106.3 97.74 | 07259 | 07799 
0.1250 | 0.1452 | 0.2857 | 116.5 | 107.3 | 06520 | 06998 
0.0953 | 0.1162 | 0.2945 | 127.8 | 117.8 | 05857 | 06280 
0.0652 | 0.0864 | 0.3002 | 140.1 | 129.4 | 05262 | 05636 
0.0349 | 0.0563 | 0.3029 | 153.7 | 142.1 | 04728 | 05059 


—0.0047 | —0.0259 | —0.3027 | 168.6 | 156.0 | 04248 | 04542 
+0.0252 | +0.0042 | 0.2995 | 185.0 | 171.4 | 03817 | 04078 
0.0543 | 0.0339 | 0.2934 202.9 | 188.3 | 03431 | 03662 
0.0826 | 0.0628 | 0.2846 | 222.7 | 206.8 | 03084 | 03288 
0.1096 | 0.0907} 0.2731 | 244.3 | 227.2 | 02772 | 02953 


+0.1352 | +0.1173 | —0.2591 | 268.2 | 249.6 | 02492 | 02653 
0.1592 | 0.1424 | 0.2428] 294.3 | 274.2 | 02240 | 02383 
0.1813 | 0.1658 | 0.2243 | 323.1 | 301.3 | 02014 | 02141 
0.2014 | 0.1872 0.2039 | 354.7 | 331.1 | 01811 | 01924 
0.2192 | 0.2065 | 0.1817] 389.4 | 363.9 | 01629 | 01729 


+0.2346 | +0.2235 | —0.1581 | 427.6 | 399.9 | 01465 | 01554 
0.2476 | 0.2381 | 0.1331 | 469.5 | 439.5 | 01317 | 01396 
0.2580 | 0.2501 | 0.1072] 515.6 | 483.0 | 01185 | 01255 
0.2657 | 0.2595 | 0.0806 | 566.3 | 531.0 | 01066 | 01128 
0.2708 | 0.2662} 0.0535 | 621.9 | 583.7 | 009588 | 01014 


0.2731 | --0.2702 | —0.0262 | 683.2 | 641.6 | 008626 | 009120 
0.2728 | 0.2715 | 0.0011 | 750.5 | 705.4 | 007761 | 008200 
0.2697 | 0.2700 | 0.0280 824.4 | 775.5 | 006983 | 007374 
0.2641 | 0.2659 | 0.0544 | 905.8 | 852.7 | 006283 | 006631 
0.2559 | 0.2592 | 0.0799 | 995.2 | 937.5 | 005654 | 005964 


0.2453 | --0.2499 | +0.1043 | 1094 1031 005088 | 005364 























0.2324 | 0.2383 | 0.1275 | 1202 1134 004579 | 004825 
0.2174 | 0.2245 | 0.1491 | 1321 1247 004121 | 004340 
0.2004 | 0.2086 | 0.1691 | 1451 1371 003710 | 003904 
0.1816 | 0.1907 | 0.1871 | 1595 1508 003339 | 003512 





C LLLP DLL [0909090 0000000000 -1-1-1-1-l -i-1-1-1Tl Go O09 ooom9 ouo ouo 
5 COND BRWHHS CONN BWNHHOS COND bio CONN RWHHOS CONOR olio 





ren 











0.1613 | +0.1712 | +0.2032 | 1753 1658 003036 | 003160 
0.1395 | 0.1502 | 0.2171 | 1927 1824 002706 | 002843 
0.1166 | 0.1279 | 0.2287 | 2119 2006 002436 | 002559 
0.0928 | 0.1045 | 0.2379 | 2329 2207 002193 | 002302 
0.0684 | 0.0804 | 0.2447 | 2561 2428 001975 | 002072 


-F0.0435 | +0.0557 | +0.2490 | 2816 2671 001778 | 001865 
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21.12 Legendre Polynomials ofthe First Kind 


P(x) = 1; 
1 
Βα) = 8a? — 1) 
1 
Py(x) = z857 — 30x? + 3); 


P(x) = 


Ps(x) 


x: 


1 
367 — 3z); 

1 

g (632° — 70x? + 15x); 


1 
— (429a7 — 693a° + 31523 — 352). 




















n= 1 jy: - r4 up Pe P 
P(x) = 1; 31s 315x* + 105z“ —5) P(x) 16 
z— PQ(r)| Ρι(α) P3(ax) Ῥι(α) P(x) P(x) P(x) 
-00 —0.3000 «0000 0.3750 0.0000 | —0.3125 0.0000 
.05 —0.4962 | —0.0747 0.3657 0.0927 | —0.2962 | —0.1069 
0.10 —0.4850 | —0.1475 0.3379 0.1788 | —0.2488 | —0.1995 
0.15 —0.4662 | —0.2166 0.2928 0.2523 | —0.1746 | —0.2649 
0.20 —0.4400 | —0.2800 0.2320 0.3075 | —0.0806 | —0.2935 
25 —0.4062 | —0.3359 0.1577 0.3397 | +0.0243 | —0.2799 
.30 —0.3650 | —0.3825 | +0.0729 0.3454 1292 | —0.2241 
0.35 —0.3162 | —0.4178 | —0.0187 0.3225 .2225 | —0.1318 
0.40 —0.2600 | —0.4400 | —0.1130 0.2706 .2926 | —0.0146 
0.45 —0.1962 | —0.4472 | —0.2050 0.1917 .3290 | 40.1106 
.50 —0.1250 | —0.4375 | —0.2891 | --0.0898 .3232 0.2231 
59 —0.0462 | —0.4091| —0.3590 | —0.0282 27708 0.3007 
0.60 -F0.0400 | —0.3600 | —0.4080 | —0.1526 1721 0.3226 
0.65 0.1338 | —0.2884 | —0.4284 | —0.2705 | +0.0347 0.2737 
0.70 0.2350 | —0.1925 | —0.4121 | —0.3652 | —0.1253 | +0.1502 
A45 0.3438 | —0.0703 | —0.3501 | —0.4164 ) —0.2808 | —0.0342 
.80 0.4600 | +0.0800 | —0.2330 | —0.3995 | —0.3918 | —0.2397 
0.85 0.5838 .2603 | —0.0506 | —0.2857 | —0.4030 | —0.3913 
0.90 0.7150 .4725 | +0.2079 | —0.0411 | —0.2412 | —0.3678 
0.95 0.8538 7184 0.5541 | 40.3727 | 40.1875 | +0.0112 
1.00 1.0000 1.0000 1.0000 1.0000 1.0000 1.0000 
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21.13 Laplace Transformation 
(see 15.2.1.1, p. 770) 


oo 


F(p) = /[ ο” θα, f(t) =0 fort <0. 


C is the Euler constant: 


0 


C — 0.577216 (see 8.2.5, 2., p. 513). 






































No. F(p) f(t) 
1 10 0 
2 |> 1 
p 
3 1 pel 
p" (n — 1)! 
4 1 p eat 
(p — a)^ (n — 1)! 
E 1 eßt — e% 
5 E 
(p — o)(p — 8) β--α 
6 p Be — ae% 
(p — o)(p — 8) p-a 
7 - 3. πι αι 
pP + 2ap 4 f? JB sin yf 
α 
δ sinat 
PFO sina 
a cos B 4- psin B . 
9 ο... sin(at 4- B) 
Ῥ "OS Q yi f a~at 
9 πρ μ.ο. απ 
p 
1 pro cos at 
s — asi 
? pee cos(at + β) 
a f 
3 dia sinh ot 
p 
4 pus cosh at 
5 1 (B — ye -- (^ — a)e* + (a — B)e*t 
(p — e)(p — 8)Y(p — 9) (a— 8Y(8 — 3) — a) 
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No. F(p) f(t) 
16 1 e% — [1 + (a — 8)t] e** 
(p — o)(p — 8* (a — 8? 
T p a e? — [a + B(o — B)t]e?t 
(p—a)(p— BP (a 89 
18 p a’e* — [2a — B + B(a — 8)t] Be 
(p—a)p— 5) ESO 
T 1 a sin Bt — B sinat 
(P+ 0) (p? + B) αβζα» -- ϱ2) 
20 p cos Bt — cos at 
(9 xay F A?) (a2 89) 
2 2 
21 ETT: cos? at 
p(p? + 4a?) 
20? 23 
22 nis 4-452) sin’ at 
2 2 
pe — 2a 42 
23 D — 4a) cosh* at 
202 212 
24 pe — 4a) sinh* at 
9 2 
25 xw sin at - sinh at 
ne 9 2 
26 d sin at - cosh at 
p* + 4a 
Wee 9 2 
27 red cos at - sinh at 
p* 4- 4o 
23 
28 ct cos at + cosh at 
p* 4- 4o* 
ap i. 
29 Gre? 5 sinat 
ap t. 
30 (gay 5 sinh at 
31 ap Bsinh at — asinh Bt 
(p? — a?) (p? — 8?) a? — 62 
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No F(p) f(t) 
32 p cosh at — cosh Bt 
(p? — a?) (p? — 8?) =? 
1 1 
33 | — —— 
VP Tt 
1 t 
34 | — | 
pp T 
1 n! 43 
E e q inteser 
35 P (an)! Va (n > 0, integer) 
1 1 
36 _ c 
pra Vit 
1 
37 Fa- ypt ~ (e Pt gdt 
vpta- pi naa ee") 
sin at 
38 VP +æ -p 
ty 2nt 
/2 
39 — sin at 
Ti 
+a /2 
40 MEE D — cos at 
PFa mt 
Vp —a?—p 2 
41 p TOT — sinh at 
pa mt 
Vp? — o? 2 
42 τ 4| — cosh at 
7 = a Tt 
vat 
43 Le zi / eV dr 
PVP +a Jar 4 
: Sod (Bat 
ο 
44 | — j e^ dr 
(p -- o)v/p * B n(B — a) 3 
vat 
/ y ατα 
45 = LG. I e dr 
p vit πη 
1 
46 CSI Jo(at) (Bessel function of order 0, p. 562) 
P +a 
47 : lo(ot) (modified Bessel function of order 0, p. 562) 
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No. F(p) 16) 
a8 | e “Ft. I, (“+ ) 
(p + a)(p + 8) 3 
1 
49. | ----------- gat. Jy (\/a? — B2t 
Vp? + 2ap + B? E ? ( RS ) 
el/P sinh 2V/t 
50 ———— 
p/p vm 
α sin at 
51 | arctan — 
t 
2a 2 
52 | arctan a m * sin at - cos Bt 
ae vwd. f at 
53 | arctan iets dae sin Bt 
αβ 
Inp 
54 | — —C —lnt 
p 
I t£ 1 1 
55 = —[v(n)-Inti], v(m) =14+=4+---+—--C 
peti n! 2 n 
1 2 2 
sg | np" (Int - C? - 7 
p 6 
p-a 1 Bt αἱ 
57 m =p z(e —e ) 
] ν 2 
58 | In Rm = 2artanh — sinh at 
p-a t 
so [4 p +a? 9. 608 Bt — cosat 
n _ 
pg t 
2 2 
p-a cosh Bt — cosh at 
60 | In pg PE com 
, g-o?/A4t 
α ο 
61. ον Rea>0 =e 
ea Ja bE 
1 et 
62 e^. ReaPf0 ——— 
VP Vat 
(Vp? + a — p)” 
63 τπτ C Rev > —1 | a” J,(at) (see Bessel function, p. 562) 
p? + a’ 
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No. F(p) f(t) 
64 peus Rev» -1 ([|a"L(at) (see Bessel functi 563) 
1 er — Q Ly (a see Dessel function, p. 9 
vp — a? 
dre ; 0 fort< pf 
65 ze æ (8>0, reell) { l darte B 
. eB rho? 0 fort « B 
66 Jv 3 od Jo (αντ = BP) fort > B 
- e BV Po? 0 fort « B 
9 JP b (av =P?) fort >£ 
. ο-θν Φα) 8) 0 : for t « 8 
μπαρ senis (522 8) tto o 
ο-θγρ ταἳ 1 0 fort« B 
St dfe τς VETE 
Ore ( zra] Vn (aye =F) fort > 8 
α 
eb Po? 1 0 fort « B 
Eod gea ο VE- 
70 p o ( σπα) NU (oye — D fort » B 
α 
αι 0 fort « B 
αρ νυν Γαἳ B 
ΤΕ με. τ oe (ave — 8 fort > B 
-! 
deus 0 fort < B 
τὰ | ον -ᾱἳ ϱ-β B 
72 je * ES Jm (ave — A) fort > 6 
z 
73 l-e 0 fort >a 
í p 1 for0<t<a 
e-oP — e-r 0 for0<t<a 
74 | ————— lfora<t< 








0 fort > 8 
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21.14 Fourier Transformation 
The symbols in the table are defined in the following way: 
C: Euler constant (C — 0.577215...) 
F(z) = 4 et dt, Rez>0 (Gamma function, see 8.2.5, 6., p. 514), 
0 
oo (—1)"($2)"*?" 


me) = È 


2 nlro +n) (Bessel functions, see 9.1.2.6, 2., p. 562), 
1 1 s 
Ky(z) = 57 (sin(v v))-UL,() —1L(z) with I(z)-— e-2i7"J,(z e2i7) 


(modified Bessel functions, see 9.1.2.6, 3., p. 563), 








1 z cost 
Oley = = / dt 
y 2" a vt (Fresnel integrals, see 14.4.3.2, 5., p. 757), 
S y = as sem t 
(ο) V2mJo Vt 
z sint 
Si(z) = f 22^ dt 
Jog sint i (Integral sine, see 14.4.3.2, 2., p. 756), 
siz) = - 1 = = Sift) — 5 
. ?9 cost . 
Cir) = -f : dt (Integral cosine, see 14.4.3.2, 2., p. 756). 


Tr 
The abbreviations for functions occurring in the table correspond to those introduced in the corre- 
sponding chapters. 


21.14.1 Fourier Cosine Transformation 























No. | f(t) F.(w) = T f(t) cos(tw) dt 
0 
1 1, 0<t<a | sin(aw) 
` D; t>a ω 
t, 0<t<l d 
2 2—1 1«t«2 ΠΟΤΟ w2 
0, t>2 
0, 0ct«a 
3. 1 —Ci(aw) 
πα ta 
t 
4 1 -T. 1 
| yt 2 Ju 
: 0<t< 2C (aw) 
E -F3 a " à aw 
t ag pee e 
5 | vi 2 Ju 
0, ta 
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No. | f(t) Fw) =f f(t) cos(tw) dt 
0 
6 - ΠΗ 7T 1— 2C (aw) 
i 7 ta 2 Jw 
7. (ac 0)3, a>0 | [—si (ac) sin (aw) — Ci (aw) cos (aw)] 
8. (a= t), a>0 [eos (aw) Ci (aw) + sin (aw) G t Si (ω))] 
2 | ,2Y-1 T etm 
9. | (a +t) a 
ið: 2 ga T sin (aw) 
0. | (a ) πο 
b b = 
11. Wa Warp T e™ cos (aw) 
act a—í _ . 
12. + (+0? | P+ 7 e^" sin(aw) 
13. | (a? -12)73 Ko (aw) 
2-10). 0<t< 
14. ία EL : m Jo (aw) 
0, t>a | 2 
1δ. εν, 0<Rev <1 | sin (=) P(l-v)wr-? 
n j— at a 
16. |e d à 
z 6 -b eat 1 a2 4L ω2 
Ti: i am Paw 
18. | Vte% v7 (a? + wi cos G arctan (2)) 
2 2 a 
1 
e*t T 
19. /5 








a+ (a2 +w)?\? 
a? + w? 
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No. | f(t) Fw) = | f(t) cos(tw) dt 
d 
2m 
20. | t^ e-t n! a” + (a? 4- w?) (+1) Σ (-1)" (" + ') (5) 
0<2m<n41 2m a 
2i η τις 8 T (v) (a? +w)? cos (v arctan (=)) 
a 
ix 3-34 1 m 
22. πα) =z- e ) 
1 w? 
23. | ea? VO cies 
1 a 
24. |t 2e t T des 204 (cos V2aw — sin V2aw) 
2 fw 
3 a 7T Aus 
25. |t 2e t (re 2w cos y2aw 
a 
Int, 0<t<l Si (w) 
26. 0, t1 uw 
Int T T 
oF || =—— -|Z (C+7+m4 
JE E 
t md. : : 
28. | (t — a?) In (;) a (sin (aw) Ci (aw) — cos (aw) si (αω)) 
a 2a 
29. | (t? — a2)! In (bt) 5 ; {sin (aw) [Ci (aw) — In (ab)] — cos (aw) si(aw) } 
2a 
: 1 1 Q(o0NM (NM? 
30. 1 In (1 4 t) 5 l(a (2)) + (si (2)) | 
1 
i = {5 [cos (bw) — cos (aw)| 
w 
31. | In a 4 + cos (bw) Si (bw) + cos (aw) Si (aw) 
—sin (aw) Ci(aw) — sin (bw) Ci (ho) 
1 a ω 
2. |e 9 | προ ας τς 2 2 arcta (2) 
3 e" lnt Pa ac 5 In (a +w") +w arctan a 
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No. | f(t) Fw) =f f(t) cos(tw) dt 
0 
a? +t? T pyb —aw 
33. | In (Sca) i (e ""—e t) 
+e Er 
34. | In PoP i (cos (bw) — e *7) 
1 t\? 
35. |- 6 T ) — 2m si (aw) 
t a—ít 
In (a? + £? 2w - 
36. s = = = IG tn (=)) Ko (a) 
2 _ paw 
Sd $4 T im 
{2 ω 
2 ne 
38. In 1— Lengua) 
{2 ω 
ahd > ; w<a 
: sin (a 
39. r T . "S 
f 4 
0, w>a 
T. äi 
t sin (at) 3* ^ cosh (bu) , ω«α 
4. are T 
Pob = €-"" sinh (ab) , wa 
: T p2 (1— e7% cosh (bw)) w<a 
sin (at) 2 5 ᾿ 
4L. | ΠΣΕ 2 
t(P c 0) E b-? e~™ sinh (ab) , w>a 
1 a+w α-ω 
42. | e~ © sin (at ua fs aes a ey 
d 2 li +(a+w) + (a-w) 
o—t sin t 2 
43. pm 1 arctan (5) 
t w? 
an? 2 
ja EE nie 
t w? 
, sin (at) sin(bt) 1 (a +b}? — w? 
45. BE 3 In (a - 5E uà 
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No. | f(t) Fw) = f f(t) cos(tw) dt 
o 
T 1 
46. ae αν 2 (a -3 w) w < 2a 
t Us w > 2a 
1 € 
. | sir? (at) 8 {(w + 3a) In (w + 3a) 
41. 2 +(w — 3a) 1n |w — 3a| — (w + a) ln (w + a) 
—(w — a) In |u — al} 
a Ge cu?) O<w<a 
48. sin? (at) : w? w=a 
P I (3q— wy à « uU « 3a 
16 f 
0, w> 3a 
T 2 
jg: 1-- ces (at) d aule τ 
π 
: 1 — cos (at) σία-ω), w<a 
50. RE - 
0, w>a 
a e~® cosh (bw) 
51. | 08 (αἱ) 9 b , ue d 
a b+ m e™™ cosh (ab) 
MEM w>a 
2 b 
b 1 1 
52. | e~ cos (at s [e—Ó—ÁÉi——É 
52. | e™ cos (at) 2 er | 
1 a? 4 v? 
53. | e^'. cos (at) E (re 4b cosh (=) 
t 
54. LE tan (at) m cosh (bw) (1 + e?%)-t 
t 2ab =i 
55. Pap cot (at) π cosh (bw) (e?” — 1) 
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No. | f(t) Fw) = f f(t) cos(tw) dt 
o 
2 2 
56. | sin (at?) : ix (s (=) — sin (=)) 
1 [mv w? 

5 4 =f =^] MD = τον 
57. | sin [a(1 — &?)] 5 i= cos («+ 1 + <) 

5 sin (at?) T w? T w? UT w? 
58. 72 z” 5 zm C Ta + V2a sin 4 + da 

a 2 32 29 12 
: sin (at?) tji Jo w? . w? 
n t 212 E 4a P 4a 
60. | ea? sin (bt?) ^ m; | b pu? 
sin |5 arctan |z IF) 
1 [π w? w? 
: ig 1gm| (w - γω 
61. | cos (at?) o ns feos (5) c sin (5) 
l [m T w? 
: .- ο ας LL 
62. | cos[a(1 — t?)] : E sin (« + 7 + =) 
ve (a2 + p) ei au? (a? 4- 3) 1 
,— at? ‘i 2 
63. | e cos (bt?) | bu? b. oy b 
-< cos το ΕΙ] τη arctan a 
l. a T 
64. 9m (2) 3 Jo (2 Vaw) 
1 a 1 [m- 

Be || tasty pe 2T. ja s 7) — e- 2V 
65. A sin (2) 5 y= [sin (2 Vaw) + cos (2 Vaw) — e ] 
w (5) s i 5 as [sm ο, συ 

ς Vi sin (5 9 V», Bin aw) + cos au) + € 
67. A cos (7) 5 dz [cos (2 Vaw) — sin (2 Vaw) + gm 
68 Es : cos (2) Em [cos (2 Vaw) — sin (2 Vaw) + αμα] 
i νι ON 2 Y 2a i ] ᾿ s 
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No. | f(t) Fw) =f f(t) cos(tw) dt 
0 
. TEM T a? : a? a? a? 
69. Vi sin (a vt) 2 z le (5) sin (5) S (5) cos (5) 
5 Ja (a+ pj si a?b (b? + w?) 7! 
70. ο- δέ sin (avt) τν... a2w 3 act e 
X cos irs 3 arctan [7 
sin (avt) a? a? 
Til, d T E (ὦ + σ dw 
1 T. "v a? 
72. Vi cos (a Vt) "B sin G + £) 
i apata ai 
"T Tlu iei ab? (a? + b°) 
73. | —— cos (bvt) rw 1 i 
vi - cos |— — ——. — = arctan (3) 
4(a2 +w?) 2 a 
3 a? 
74. | en ave cos (a vt) vma(2w) 3 e 26 
e-avt Vi ντ T a? 
75 20S — si —e w 
75. Vi [cos (a Vt) — sin (av t)| πο 
21.14.2 Fourier Sine Transformation 
No. | f(t) F,(w) = { f(t) sin(tw) dt 
ο 
1 il θα 1 — cos (aw) 
` 0, t>a ω 
ty 0<t<l w 
2. 2--ἲ 1«t«2 dw? sin w sin? (=) 
0, t>2 2 
1 π 
3: z 3 
: 0<t 
4. | #? ΞΈΞα. Bra) 
0, ia 
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No. | f(t) Fw) = f f(t) sin(tw) dt 
o 
0, 0<t<a 
5. 1 — si (aw) 
3 t>a 
t 
6 1 /5 1 
η νι 2 fw 
: 0<t< 2 S (aw) 
= a T aw 
jf t = 
n | v 2 fa 
0, t>a 
8 ^ vetaa e 
: Wi t>a 2 γω 
3 
9. (=) πλω 
10. | (a t)! (a > 0) [sin (aw) Ci (aw) — cos (aw) si (aw)| 
11. | (a— t)! (a > 0) sin (aw) Ci(aw) — cos (aw) (5 + ϑι(αω))] 
t T 
12. | —2—; ue ve 
αἲ εξ 2^ 
d - lg ; ; 
13. | (a° — t’) = [sin (aw) Ci(aw) — cos (aw) Si(aw) 
a 
b b E 
14. P+? Prati 7 e^" sin (aw) 
: αγί = a—t tase 
15. Pat Pat m e™ cos (aw) 
: t π 
16. Zg — «05 (aw) 
1 7 1— cos (au) 
Ha cde 2 4 
1 mg) 
fuels TE 
᾽ t (a? +t) 2 @ 
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No. | f(t) Fi(w) = f f(t) sin(tw) dt 
0 
ae 0<Rev <2 cos (=) Dü-w)w- 
ω 
20. | e-*t ----- 
: a? + w? 
5— at 
μι arctan (5) 
t a 
ο at _ ,4—bt 1 p? 2 
25.8 ü ' ls w In aa +b arctan (=) —a arctan (2) 
; 3 3 t 
23. | vte% VE (a +w?) sin E arctan (=) 
2 2 a 
1 
1 1 
"ES Ἐν (a? + w?)2 —a 
Vi a2 + o? 
[zn] n+1 w\2mt1 
25. | {6 αἱ nlatt}(a? + w?)— +) n (-1)* Bii d e 
vo. ω 
26. | t" let I(v) (a? -- 4?2)73 sin L arctan (2) 
a 
ET secu 1 
27. |e 2" (1— e$) -5 tanh (mw) 
t w? 
28. | te oS eae 
a 4a 
1 a 
29. |t 2e t doc €^ Y?*? [cos V/2au 4- sin v/2au] 
2ω 
3 a T Lm 
30. t2et (re 2w sin γ΄λαω 
ω 
31 Int, 0<ż<1 Ci(w)- C -ln w 
Ui t1 ω 
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No. | f(t) F,(w) = | f(t) sin(tw) dt 
o 
32. . -5 (C +n w) 
: Int T [r 
mo z|g-C-ma 
34. | t(? — a2)! In (bt) = [cos (aw) (In (ab) — Ci(aw)) — sin(aw) - si(aw)| 
35. | t( — a2) ἲ mn (É) -7 [cos (aw) Ci(aw) + sin (aw) si (aw)] 
a 2 
36. | e7% Int 38 |a arctan (5) — Cu — ; uw In (a? 4- o?) 
1 a . ] 
; — fin B + cos (bw) Ci(bw) — cos (aw) Ci(aw) 
44 w 
37. | In =] + sin (bw) Si(bw) — sin (aw) Si(aw) 
+5 [sin (bw) + sin (az)]) 
38. | In L— Z sin (aw) 
a—t ω 
2 2 
39. 2 In (= T τ) = [1 — cos (aw) — aw si(aw)| 
t a—t a 
a+eP+t 2m —-wifat—2 | fw 
40. | In (=) Jê 4 sin (5) 
2 
41. |In|1 — a = [C + In (aw) — cos (aw) Ci(aw) — sin (aw) Si(aw)] 
j ω 
a? +(b+t)? 2T uu 
42. | In (2454) wt sn (bw) 
1 2,2 zn 
43. | - In |1 — a*t*| πο (2) 
t a 
2 
44. T πι |1 — B 7 [C + ln (aw) — Ci(aw)| 




























































































1124 21. Tables 
No. | f(t) F,(w) = | f(t) sin(tw) dt 
o 
45. sin (at) 1 in w+a 
t 2 ω-α 
π 
sin (at) δω 0O<w<a 
46. 72 τ 
σα w>a 
sin (zt) sin w O<w<mt 
47. TM 
Dz WT 
4—ab 
k sin (at) i ; sinh (bu) , 0<w<a 
` | B+E me κ 
Ata sinh (ab) , wa 
1 1 1 
49. | e^ αἱ t Sbe s ee EE 
Bei Biat) 2 li +(a-—w)? BP+(a+ 2] 
50 c^" sin (at) 1 ii b+ (w +a)? 
` t 4 b? + (w— a)? 
1 1 atu? 
51. | e? sin (at) 5 ne 1 ^ sinh (=) 
b 
π 
4 $ 0 « w « 2a 
sin? (at 
59, | sit (a ) π | -- 
8 
0, ω 2a 
0, 0<w<a-—b 
sin (at) sin (bt 
53. m T, a—b<w<atb 
0: w>atb 
1 c 
Bi sin? (at) i (w + 2a) In (w + 2a) 
54. 8 à 
+(w — 2a) In|w — 2a| — σω Inw 
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No. | f(t) F,(w) = | f(t) sin(tw) dt 
0 
π w 
zm Ms 94, c 
pi er (a) 1 ω ( α z) ; 0 « w « 2a 
t πο : 
σα, w > 2a 
0, O<w<a 
56 cos (at) π | een 
i 4 
π 
31 w>a 
7T ab 
m t cos (at) -73° sinh (bw) , θ«ως«α 
τι, [ο τα 
e+e 
5 e™ cosh(ab), w>a 
2 o 2 2 
58. | sin (at? ορ σι ο δι 
7 E 2a feos (= 4α ως 4a 4a 
z sin (at?) T w? z w? 
m t aea A gs 
2 2 2 2 
60. | cos (at? ,/— |sin (—) c (=) — cos (* 2: 
csat) 2a De 4a Λα "em 4a 4a 
cos (at?) T w? w? 
i, Ee τ τ. = 
B t 2 le e us 4a. 
62. | e-*Y* gin (avt) /5 AE cs 
2 20 uw 











21.14.3 Fourier Transformation 


Although F(w) can be represented by the Fourier cosine transformation 7; and the Fourier sine trans- 
formation F; according to (15.75a), here we give some direct transforms F(w). 














No. | f(t) F(w) = pene) dt 
1. | 6(t) (Dirac 6 function) 1 

2. set) (iw) 

3. | 609 (t — a) (iw)"e* (n —0,1,2,...) 
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No. | f(t) F(w) — f e-"f(t) dt 
4. |1 2πδ(ω) 
5. |t 2i" 6) (u) (n-—1,2,...) 
H(t)-1 fo 120 1 š 
6. — +76(w) 
H(t)=0 for t«0 iw 
(Heaviside unit step function) 
7 n n! in s(n 
7. |t" H(t) pam δί)(ω) (n=1,2 
e* H(t) e^ for t>0 1 
8. s - (a > 0) 
e *' H(t) 20 fon t«0 at+iw 
9 1 et /(4a) e" (a0) 
ΓΕ Μπα 
1 
—. e all 
10. πα΄ Vg (a > 0) 
1 T T al 
| t+ a? a 
t 
12. Pua —ire~*+lsign w 
3 H(t+a)—H(t—a)=1 for |t| «a 2sinaw 
- | H(tra)-H(t-a) 20 for σα w 
14. | εαν 2πδ(ω -- α) 
15. | cosat m[6(w + a) + d(w — a)] 
16. | sinat in[d(w + a) — 6(w — a)| 
1 π 
^i cosht μεις 
2 
1 πω 
18. | —— —irtanh — 
1 sinht Ug 
2 
19. | sinat? - e + 3 (a » 0) 
20. | cos at? E EE (a > 0) 
` τ a\4a 4 
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21.14.4 Exponential Fourier Transformation 


Although the exponential Fourier transformation F.(w) can be represented by the Fourier transforma- 


1 
tion F(w) according to (15.77), ie., Fe(w) = gf (-»), here we give some direct transforms. 





No. | f(t) Fw) —1 J. f(t) e at 





(ei = gio) 


f@=A for a<t<b iA 
i Ze 


otherwise 2w 





f(t)=t" for 0<t<b 








1 + —(n ibw = πι : m-—n-1 m 
2. f(t) 20 . otherwise 7 ο E ei mi (σίω) tb 
(n —1,2,...) m=0 
π π v—1 ,—aw for 0 
3. στ Rev >0 T (v) w E or Ww > 
(oq for w <0 
1 0 for w > 0 
4. η. πο Rev > 0 " 
(a - it) (-ω)” 16”  forw«0 
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21.15 Z Transformation 


For definition see 15.4.1.2, p. 794, for rules of calculations see 15.4.1.3, p. 795, for inverses see p. 797 



































No. | Original Sequence fn Transform Convergence Region 
F (z) = Z (fn) 
ılı Ξ ; 41 
z= 
| fer Ξ 41 
P z+1 
9 y 
3 [n G-I z| 21 
-ᾱ 2(2 1-1) - 
4 n (n — T? 2/51 
5 | n8 (22 +4z+4+1) jied 
(z —1)4 
Ζ 
6 | e z| » le? 
ge 
7 | a” Z z| > |α| 
z—a 
8 - ež 2] » 0 
n 
za 
9 [na* ap z| > |α| 
10 | n?a” e τ z| > |α| 
z—a 
n z 
11 (;) G- z|>1 
k tyk 
12 ( ) (1 + -) 230 
n z 
z sin b 
13 | sin bn X2. os RTI 2] »1 
— cos b 
14 | cos bn ο ΠΕΙ, 3) ---1. 
2? — 2z cos b - 1 
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No. | Original Sequence f, Transform Convergence Region 
F (z) = Z (fn) 
ats ze^ sin b i 
15 | e^" sin bn ooh a c z| > |e*| 
-- z(z — e* cos b) . 
16 |e"" cos bn SUE DU EHE z| » le? 
zsinh b 
17 |sinh bn Ace aah bed z| > max(]e*|, |e-*|) 
— cosh b 
18 | cosh bn a m z| » max(|e*|, |e7*|) 
zasinh b 
19 | a" sinh bn ο. z| » max(|ae^|, |ae-*|) 
z(z — a cosh b 
20 | a” cosh bn a E z| » max(|ae^|, |ae-*|) 
n=O fur n#k, 1 
21 ] i i P = {50 
E 2 
25 
22 fm —0,  fanti = 2 3 251 
g^] 
23 fon = 0, 2z(z2 4-1) PEN 
Fanti = 2(2n + 1) (2-1) 
on = z—1 
24 fin —0, In 7I z|>1 
fon+1 = 2n 1 = 
nT z 
25 sS -=> z|>1 
TIT a 2 ΕΤ 
3 
26. | (351) οὔ" (en z| » |e?| 
eb(n+1) _ pa(n+1) 32 p 
27 E Geje) z| > max(|e?], |e^]), a zz b 
1 g? 
28 6 (n — 1) n(n 1) z 1 
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oer Transform : 
No. | Original Sequence f, Convergence Region 
F(z)—Z(fu) 

1 z 

29 | fg20, f,—-—, nz21 In |z| >1 
n g= 

ο ου" "E i 

30 Qn TI Jz sin vs |z| >0 

a, | =I)" 1 

1 08 ---- z 
3 Gn)! ΗΝ d |z| >0 
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21.16 Poisson Distribution 


For the formula of the Poisson distribution see 16.2.3.3, p. 817. 























A 

k 0.1 0.2 0.3 0.4 0.5 0.6 

0 0.904837 | 0.818731 | 0.740818 | 0.670320 | 0.606531 | 0.548812 
1 0.090484 | 0.163746 | 0.222245 | 0.268128 | 0.303265 | 0.329287 
2 0.004524 | 0.016375 | 0.033337 | 0.053626 | 0.075816 | 0.098786 
3 0.000151 | 0.001091 | 0.003334 | 0.007150 | 0.012636 | 0.019757 
4 0.000004 | 0.000055 | 0.000250 | 0.000715 | 0.001580 | 0.002964 
5 0.000002 | 0.000015 | 0.000057 | 0.000158 | 0.000356 
6 0.000001 | 0.000004 | 0.000013 | 0.000035 
7 0.000001 | 0.000003 

λ 

k 0.7 0.8 0.9 1.0 2.0 3.0 

0 0.496585 | 0.449329 | 0.406570 | 0.367879 | 0.135335 | 0.049787 
1 0.347610 | 0.359463 | 0.365913 | 0.367879 | 0.270671 | 0.149361 
2 0.121663 | 0.143785 | 0.164661 | 0.183940 | 0.270671 | 0.224042 
3 0.028388 | 0.038343 | 0.049398 | 0.061313 | 0.180447 | 0.224042 
4 0.004968 | 0.007669 | 0.011115 | 0.015328 | 0.090224 | 0.168031 
5 0.000696 | 0.001227 | 0.002001 | 0.003066 | 0.036089 | 0.100819 
6 0.000081 | 0.000164 | 0.000300 | 0.000511 | 0.012030 | 0.050409 
7 0.000008 | 0.000019 | 0.000039 | 0.000073 | 0.003437 | 0.021604 
8 0.000001 | 0.000002 | 0.000004 | 0.000009 | 0.000859 | 0.008102 
9 0.000001 | 0.000191 | 0.002701 
10 0.000038 | 0.000810 
11 0.000007 | 0.000221 
12 0.000001 | 0.000055 
13 000013 
14 000003 
15 000001 
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(continuation) 
A 

h 4.0 5.0 6.0 7.0 8.0 9.0 

0 0.018316 .0067738 0.002479 0.000912 0.000335 .000123 
1 0.073263 .033690 0.014873 0.006383 0.002684 .001111 
2 0.146525 .084224 0.044618 0.022341 0.010735 .004998 
3 0.195367 .140374 0.089235 0.052129 0.028626 .014994 
4 0.195367 175467 0.133853 0.091126 0.057252 033737 
5 0.156293 175467 0.160623 127717 0.091604 060727 
6 0.104194 146223 0.160623 .149003 0.122138 .091090 
7 0.059540 104445 0.137677 0.149003 0.139587 117116 
8 0.029770 065278 0.103258 0.130377 0.139587 131756 
9 0.013231 .036266 0.068838 0.101405 0.124077 131756 
10 0.005292 018133 0.041303 .070983 0.099262 .118580 
1 0.001925 .008242 0.022529 .045171 0.072190 .097020 
12 0.000642 .003434 0.011264 0.026350 0.048127 072765 
13 0.000197 001321 0.005199 0.014188 0.029616 .050376 
14 0.000056 .000472 0.002228 0.007094 0.016924 .032384 
15 0.000015 .000157 0.000891 .003311 0.009026 .019431 
16 0.000004 .000049 0.000334 .001448 0.004513 .010930 
17 0.000001 000014 0.000118 0.000596 0.002124 05786 
18 .000004 0.000039 0.000232 0.000944 02893 
19 .000001 0.000012 0.000085 0.000397 .001370 
20 0.000004 «900050 0.000159 000617 
21 0.000001 -000010 0.000061 «000264 
22 0.000003 0.000022 .000108 
23 0.000001 0.000008 .000042 
24 0.000003 «000016 
25 0.000001 .000006 
26 .000002 
27 «000001 
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21.17 Standard Normal Distribution 


For the formula of the standard normal distribution see 16.2.4.2, p. 819. 
21.17.1 Standard Normal Distribution for 0.00 € x < 1.99 

x P(x) cz P(x) a P(x) x P(x) x | B(x) 
0.00 | 0.5000 | 0.20 | 0.5793 | 0.40 | 0.6554 | 0.60 | 0.7257 || 0.80 | 0.7881 
0.01 | 0.5040 | 0.21 | 0.5832 .41 | 0.6591 | 0.61 .7291 || 0.81 | 0.7910 
0.02 | 0.5080 | 0.22 | 0.5871 .42 | 0.6628 || 0.62 .7324 || 0.82 | 0.7939 
0.03 | 0.5120 || 0.23 | 0.5910 .43 | 0.6664 | 0.68 .7357 || 0.83 | 0.7967 
0.04 | 0.5160 || 0.24 | 0.5948 .44 | 0.6700 | 0.64 .7389 || 0.84 | 0.7995 
0.05 | 0.5199 || 0.25 | 0.5987 45 | 0.6736 || 0.65 .7422 || 0.85 | 0.8023 
0.06 | 0.5239 || 0.26 | 0.6026 46 | 0.6772 || 0.66 .7454 || 0.86 | 0.8051 
0.07 | 0.5279 || 0.27 | 0.6064 47 | 0.6808 || 0.67 | 0.7486 || 0.87 | 0.8079 
0.08 | 0.5319 || 0.28 | 0.6103 48 | 0.6844 || 0.68 .7517 || 0.88 | 0.8106 
0.09 | 0.5359 | 0.29 | 0.6141 .49 | 0.6879 | 0.69 .7549 || 0.89 | 0.8133 
0.10 | 0.5398 | 0.30 | 0.6179 | 0.50 | 0.6915 || 0.70 | 0.7580 || 0.90 | 0.8159 
0.11 | 0.5438 | 0.31 | 0.6217 .51 | 0.6950 | 0.71 -7611 |] 0.91 | 0.8186 
0.12 | 0.5478 | 0.32 | 0.6255 .52 | 0.6985 | 0.72 -7642 || 0.92 | 0.8212 
0.13 | 0.5517 || 0.33 | 0.6293 .53 | 0.7019 | 0.73 .7673 || 0.93 | 0.8238 
0.14 | 0.5557 | 0.34 | 0.6331 54 | 0.7054 || 0.74 .7704 || 0.94 | 0.8264 
0.15 | 0.5596 || 0.35 | 0.6368 .55 | 0.7088 || 0.75 .7734 || 0.95 | 0.8289 
0.16 | 0.5636 | 0.36 | 0.6406 .56 | 0.7123 | 0.76 .7764 || 0.96 | 0.8315 
0.17 | 0.5675 || 0.37 | 0.6443 57 | 0.7157 || 0.77 | 0.7794 || 0.97 | 0.8340 
0.18 | 0.5714 || 0.38 | 0.6480 58 | 0.7190 || 0.78 .7823 || 0.98 | 0.8365 
0.19 | 0.5753 || 0.39 | 0.6517 59 | 0.7224 || 0.79 .7852 || 0.99 | 0.8389 
























































x P(x) cz P(x) x P(x) x P(x) x P(x) 
1.00 | 0.8413 | 1.20 | 0.8849 | 1.40 | 0.9192 | 1.60 | 0.9452 | 1.80 | 0.9641 
1.01 | 0.8438 | 1.21 | 0.8869 .41 | 0.9207 | 1.61 | 0.9463 | 1.81 | 0.9649 
1.02 | 0.8461 | 1.22 | 0.8888 .42 | 0.9222 || 1.62 | 0.9474 | 1.82 | 0.9656 
1.03 | 0.8485 | 1.23 | 0.8907 .43 | 0.9236 | 1.63 | 0.9484 | 1.83 | 0.9664 
1.04 | 0.8508 | 1.24 | 0.8925 .44 | 0.9251 | 1.64 | 0.9495 | 1.84 .9671 
1.05 | 0.8531 | 1.25 | 0.8944 .45 | 0.9265 | 1.65 | 0.9505 | 1.85 9678 
1.06 | 0.8554 | 1.26 | 0.8962 .46 | 0.9279 | 1.66 | 0.9515 | 1.86 9686 
1.07 | 0.8577 | 1.27 | 0.8980 .47 | 0.9292 | 1.67 | 0.9525 | 1.87 | 0.9693 
1.08 | 0.8599 | 1.28 | 0.8997 .48 | 0.9306 | 1.68 | 0.9535 | 1.88 | 0.9699 
1.09 | 0.8621 | 1.29 | 0.9015 .49 | 0.9319 | 1.69 | 0.9545 | 1.89 9706 
1.10 | 0.8643 | 1.30 | 0.9032 | 1.50 | 0.9332 || 1.70 | 0.9554 | 1.90 | 0.9713 
1.11 | 0.8665 | 1.31 | 0.9049 51 | 0.9345 | 1.71 | 0.9564 | 1.91 | 0.9719 
1.12 | 0.8686 | 1.32 | 0.9066 .52 | 0.9357 | 1.72 | 0.9573 | 1.92 9726 
1.13 | 0.8708 | 1.33 | 0.9082 .53 | 0.9370 | 1.73 | 0.9582 | 1.93 | 0.9732 
1.14 | 0.8729 | 1.34 | 0.9099 .54 | 0.9382 | 1.74 | 0.9591 | 1.94 .9738 
1.15 | 0.8749 | 1.35 | 0.9115 .55 | 0.9394 | 1.75 | 0.9599 | 1.95 9744 
1.16 | 0.8770 | 1.36 | 0.9131 .56 | 0.9406 | 1.76 | 0.9608 | 1.96 | 0.9750 
1.17 | 0.8790 | 1.37 | 0.9147 -57 | 0.9418 | 1.77 | 0.9616 | 1.97 | 0.9756 
1.18 | 0.8810 | 1.38 | 0.9162 -58 | 0.9429 | 1.78 | 0.9625 | 1.98 | 0.9761 
1.19 | 0.8830 | 1.39 | 0.9177 .59 | 0.9441 | 1.79 | 0.9633 | 1.99 9767 




































































1134 21. Tables 





21.17.2 Standard Normal Distribution for 2.00 < x < 3.90 




































































x P(x) £ B(x) x B(x) x P(x) x P(x) 
2.00 | 0.9773 | 2.20 | 0.9861 | 2.40 | 0.9918 | 2.60 | 0.9953 || 2.80 | 0.9974 
2.01 | 0.9778 | 2.21 .9864 | 2.41 | 0.9920 | 2.61 | 0.9955 | 2.81 .9975 
2.02 | 0.9783 | 2.22 .9868 | 2.42 | 0.9922 | 2.62 | 0.9956 | 2.82 .9976 
2.03 | 0.9788 | 2.23 | 0.9871 | 2.43 | 0.9925 | 2.63 | 0.9957 | 2.83 | 0.9977 
2.04 | 0.9793 | 2.24 | 0.9875 | 2.44 | 0.9927 | 2.64 | 0.9959 | 2.84 | 0.9977 
2.05 | 0.9798 | 2.25 | 0.9878 | 2.45 | 0.9929 | 2.65 | 0.9960 | 2.85 | 0.9978 
2.06 | 0.9803 | 2.26 | 0.9881 | 2.46 | 0.9931 | 2.66 | 0.9961 | 2.86 | 0.9979 
2.07 | 0.9808 | 2.27 | 0.9884 | 2.47 | 0.9932 | 2.67 | 0.9962 | 2.87 | 0.9979 
2.08 | 0.9812 | 2.28 | 0.9887 | 2.48 | 0.9934 | 2.68 | 0.9963 | 2.88 | 0.9980 
2.09 | 0.9817 | 2.29 | 0.9890 | 2.49 | 0.9936 | 2.69 | 0.9964 | 2.89 | 0.9981 
2.10 | 0.9821 | 2.30 | 0.9893 | 2.50 | 0.9938 | 2.70 | 0.9965 || 2.90 | 0.9981 
2.11 | 0.9826 | 2.31 .9896 | 2.51 | 0.9940 | 2.71 | 0.9966 | 2.91 .9982 
2.12 | 0.9830 | 2.32 .9894 | 2.52 | 0.9941 | 2.72 | 0.9967 | 2.92 .9983 
2.13 | 0.9834 | 2.33 | 0.9901 | 2.53 | 0.9943 | 2.73 | 0.9968 | 2.93 | 0.9983 
2.14 | 0.9838 | 2.34 | 0.9904 | 2.54 | 0.9945 | 2.74 | 0.9969 | 2.94 | 0.9984 
2.15 | 0.9842 | 2.35 | 0.9906 | 2.55 | 0.9946 | 2.75 | 0.9970 | 2.95 | 0.9984 
2.16 | 0.9846 | 2.36 | 0.9909 | 2.56 | 0.9948 | 2.76 | 0.9971 | 2.96 | 0.9985 
2.17 | 0.9850 | 2.37 | 0.9911 | 2.57 | 0.9949 | 2.77 | 0.9972 | 2.97 | 0.9985 
2.18 | 0.9854 | 2.38 | 0.9913 | 2.58 | 0.9951 | 2.78 | 0.9973 | 2.98 | 0.9986 
2.19 | 0.9857 | 2.39 | 0.9916 | 2.59 | 0.9952 | 2.79 | 0.9974 | 2.99 | 0.9986 

x P(x) cz P(x) x P(x) x P(x) cz P(x) 
3.00 | 0.9987 | 3.20 | 0.9993 | 3.40 | 0.9997 || 3.60 | 0.9998 | 3.80 | 0.9999 
3.10 | 0.9990 | 3.30 | 0.9995 | 3.50 | 0.9998 | 3.70 | 0.9999 | 3.90 | 0.9999 
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21.18 x? Distribution 


For the formula of the x? distribution see 16.2.4.6, p. 822. 


x? Distribution: Quantile XÀ m 














Degree 
of Probability a 
Freedom 
m 0.99 0.975 0.95 005 0025 001 
1 0.00016 0.00098 . 0.0039 3.8 5.0 6.6 
2 0.020 0.051 0.103 6.0 7.4 9.2 
3 0.115 0.216 0.352 7.8 9.4 113 
4 0.297 0.484 0.711 9.5 111 133 
5 0.554 0.831 1.15 11.1 128 151 
6 0.872 1.24 1.64 126 14.44 16.8 
7 1.24 1.69 2.17 141 16.0 18.5 
8 1.65 2.18 2.73 155 175 20.1 
9 2.09 2.70 3.33 169 419.0 217 
10 2.56 3.25 3.94 183 20.5 23.2 
1 3.05 3.82 4.57 19.7 219 24.7 
12 3.57 4.40 5.23 21.0 23.3 262 
13 4.11 5.01 5.89 224 24.7 27.7 
14 4.66 5.63 6.57 23.7 26.1 29.1 
15 5.23 6.26 7.26 25.0 27.5 30.6 
16 5.81 6.91 7.96 26.3 28.8 320 
17 6.41 7.56 8.67 27.06 30.2 334 
18 7.01 8.23 9.39 28.9 31. 348 
19 7.63 8.91 10.1 30.1 32.9 36.2 
20 8.26 9.59 10.9 314 34.2 37.6 
21 8.90 10.3 11.6 32.7 35.5 38.9 
22 9.54 11.0 12.3 33.9 36.8 40.3 
23 10.2 11.7 13.1 35.2 38.1 41.6 
24 10.9 12.4 13.8 36.4 39.4 43.0 
25 11.5 13.1 14.6 37.7 406 44.3 
26 12.2 13.8 15.4 38.9 41.9 45.6 
27 12.9 14.6 16.2 40.1 43.2 47.0 
28 13.6 15.3 16.9 41.3 445 483 
29 14.3 16.0 E 42.6 45.7 49.6 
30 15.0 16.8 18.5 43.8 47.0 50.9 
40 22.2 24.4 26.5 55.8 59.3 63.7 
50 29.7 32.4 34.8 67.5 714 76.2 
60 37.5 40.5 43.2 79.1 83.3 884 
70 45.4 48.8 51.7 90.5 95.0 100.4 
80 53.5 57.2 60.4 101.9 106.6 112.3 
90 61.8 65.6 69.1 113.1 1181 124.1 
100 70.1 74.2 77.9 124.3 129.6 135.8 
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21.19 Fisher P Distribution 
For the formula of the Fisher F distribution see 16.2.4.7, p. 823. 


Fisher F' Distribution: Quantile fom;jm, for a = 0.05 











my 
m2 
1 2 3 4 5 6 8 12 24 30 40 oo 
1 | 161.4] 199.5 | 215.7 | 224.6 | 230.2 | 234.0 | 238.9 | 243.9 | 249.0 | 250.0) 251.0 | 254.3 
2 | 18.51) 19.00] 19.16 | 19.25 | 19.30 | 19.33 | 19.37 | 19.41 | 19.45 | 19.46 | 19.47) 19.50 
3 |10.13| 9.55} 9.28 9.12| 9.01| 8.94| 8.85] 8.74] 8.64) 8.62] 8.59} 8.53 
4 7.71] 6.94] 6.59 6.39) 6.26] 6.16] 6.04} 5.91} 5.77) 5.75] 5.72} 5.63 
5 6.61) 5.79) 5.41 5.19} 5.05) 4.95] 4.82] 468) 4.53] 4.50) 4.46] 4.36 
6 5.99] 5.14] 4.76 4.53| 4.39| 4.28| 4.15} 4.00} 3.84) 3.81] 3.77] 3.67 
7 5.59) 4.74) 4.35 4.12) 3.97] 3.87] 3.73| 3.57] 3.41) 3.38] 3.34] 3.23 
8 5.382] 4.46) 4.07 3.84) 3.69) 3.58) 3.44 | 3.28} 3.12) 3.08] 3.05} 2.93 
9 5.12) 4.26] 3.86 3.63) 3.48] 3.37] 3.23} 3.07} 2.90) 2.86] 2.83] 2.71 


10 4.96 4.10 3.71 | 3.48) 3.33) 3.22 3.07 2.91 2.74 2.70| 2.66) 2.54 


11 4.84) 3.98) 3.59 | 3.36] 9.201 9.091 2.95 | 2.9 2.61| 2.57, 2.53) 2.40 
12 4.75} 3.89} 3.49 | 3.26) 3.11] 3.00] 2.85] 2.69] 2.51} 2.47) 2.43) 2.30 
19 4.67| 3.81] 3.41 | 3.18] 3.03] 2.92] 2.77] 2.60] 2.42) 2.38] 2.34] 2.21 
14 460] 9.14] 9.34 93.11] 2.96] 2.85) 2.70} 2.53] 2.35 2.31) 2.27) 2.13 
15 4.54] 3.68} 3.29 | 3.06] 2.90] 2.79] 2.64} 2.48] 2.29} 2.25) 2.20) 2.07 


16 449] 3.63] 3.24 | 3.01] 2.85] 2.74] 2.59) 2.42) 2.24) 2.19} 2.15] 2.01 
17 4.45} 3.59} 3.20 | 2.96] 2.81] 2.70) 2.55} 2.38] 2.19} 2.15) 2.10) 1.96 
18 4.41} 3.55} 3.16 | 2.93] 2.77] 2.66) 2.51] 2.34] 2.15] 2.11) 2.06) 1.92 
19 4.38 | 3.52} 3.13 | 2.90] 2.74] 2.63] 2.48] 2.31] 2.11] 2.07) 2.03) 1.88 
20 4.35] 3.49} 3.10 | 2.87] 2.71] 2.60] 2.45) 2.28} 2.08] 2.04) 1.99) 1.84 


21 4.32| 3.47| 3.07 | 2.84] 2.68] 2.57] 2.42) 2.25) 2.05) 2.01} 1.96] 1.81 
22 4.30] 3.44} 3.05 | 2.82] 2.66] 2.55) 2.40] 2.23] 2.03} 1.98) 1.94) 1.78 
23 4.28] 3.42} 3.03 | 2.80] 2.64] 2.53) 2.37, 2.20} 2.00} 1.96) 1.91) 1.76 
24 4.000 3.40 3.01 | 2.78) 2.62) 2.51) 2.36] 2.18] 1.98] 1.94) 1.89) 1.73 
25 4.24} 3.39} 2.99 | 2.76] 2.60] 2.49] 2.34) 2.16) 1.96] 1.92} 1.87] 1.71 


26 4.23 | 3.37} 2.98 | 2.74] 2.59] 2.47) 2.32} 2.15] 1.95} 1.90) 1.85) 1.69 
27 4.21] 3.35} 2.96 | 2.73] 2.57) 2.46) 2.31} 2.13] 1.93] 1.88) 1.84) 1.67 
28 4.000 3.84 2.95 | 2.71) 2.56) 2.45) 2.29) 2.12] 1.91] 1.87) 1.82) 1.65 
29 4.1δ 3.33} 2.93 | 2.70] 2.55) 2.43) 2.28) 2.10} 1.90] 1.85) 1.80) 1.64 
30 417, 3.82 2.92 | 2.69] 2.53) 2.42) 2.27) 2.09] 1.89] 1.84) 1.79) 1.62 


40 4.08) 3.28| 2.84 | 2.61) 2.45] 2.34] 2.18) 2.00) 1.79) 1.74} 1.69] 1.51 
60 4.00} 3.15} 2.76 | 2.53) 2.37] 2.25) 2.10} 1.92] 1.70} 1.65) 1.59) 1.39 
125 3.92, 3.07 2.68 | 2.44] 2.29) 2.17) 2.01} 1.83] 1.60] 1.55) 1.49) 1.25 
oo 3.84} 3.00} 2.60 | 2.37, 2.21) 2.10) 1.94] 1.75] 1.52} 1.46) 1.39) 1.00 
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Fisher F Distribution: Quantile f5,5,,; for a — 0,01 
m; 
m2 
1 2 3 4 5 6 8 12 24 30 40 oo 

1 | 4052| 4999| 5403| 5625| 5764| 5859| 5981| 6106| 6235| 6261| 6287| 6366 
2 | 98.50 | 99.00 | 99.17 | 99.25 | 99.30 | 99.33 | 99.37 | 99.42 | 99.46 | 99.47 | 99.47 | 99.50 
3 | 34.12 | 30.82 | 29.46 | 28.71 | 28.24 | 27.91 | 27.49 | 27.05 | 26.60 | 26.50 | 26.41 | 26.12 
4 | 21.20) 18.00 | 16.69 | 15.98 | 15.52 | 15.21 | 14.80 | 14.37 | 13.93 | 13.84 | 13.74 | 13.46 
5 | 16.26 | 13.27 | 12.06 | 11.39) 10.97) 10.67] 10.29] 9.89] 9.47] 9.38] 9.29} 9.02 
6 | 13.74] 10.92] 9.78] 9.15} 8.75} 847] 8.10] 7.72} 7.31) 7.23) 7.14] 6.88 
7 |12.25| 9.55} 8.45} 7.85) 7.46) 7.19) 6.84] 6.47] 6.07] 5.99] 5.91] 5.65 
8 11126. 8.65] 7.59} 7.01) 6.63) 6.37) 6.03] 5.67] 5.28] 5.20] 5.12] 4.86 
9 | 10.56] 8.02] 6.99] 6.42} 6.06] 5.80} 5.47] 5.11] 4.73) 4.65) 4.57] 4.31 
10 | 10.04} 7.56] 6.55) 5.99] 5.64] 5.39] 5.06} 4.71} 4.33} 4.25} 4.17] 3.91 
T1 9.65 | 7.21} 6.22} 5.67) 5.32) 5.07| 4.74] 4.40] 4.02] 3.94] 3.86] 3.60 
12 9.33] 6.93} 5.95} 5.41) 5.06) 4.82) 4.50] 4.16] 3.78] 3.70] 3.62] 3.36 
13 9.07| 6.70} 5.74} 5.21) 4.86) 4.62) 4.30] 3.96] 3.59] 3.51] 3.43] 3.16 
14 8.86| 6.51] 5.56} 5.04) 4.70) 4.46) 4.14] 3.80] 3.43] 3.35] 3.27] 3.00 
15 8.08] 6.36] 5.42} 4.89) 4.56) 4.32) 4.00] 3.67] 3.29] 3.21] 3.13] 2.87 
16 8.53] 0.23] 5.29} 4.77) 4.44) 4.20) 3.89] 3.55] 3.18] 3.10] 3.02] 2.75 
17 840! 6.11| 5.18} 4.67) 4.34] 4.10] 3.79} 3.46} 3.08) 3.00] 2.92] 2.65 
18 8.29] 6.01} 5.09} 4.58) 4.25) 4.01) 3.71] 3.387] 3.00] 2.92] 2.84] 2.57 
19 8.18} 5.93} 5.01} 4.50) 4.17) 3.94) 3.63] 3.30] 2.92] 2.84] 2.76] 2.49 
20 8.10) 5.85| 4.94] 4.43] 4.10) 3.87] 3.56] 3.23] 2.86) 2.78] 2.69} 2.42 
21 8.02] 5.78| 4.87] 4.37] 4.04] 3.81] 3.51] 3.17] 2.80) 2.72] 2.64] 2.36 
22 7.95) 5.72| 4.82] 4.31) 3.99] 3.76] 3.45] 3.12} 2.75) 2.67] 2.58} 2.31 
23 7.88] 5.66| 4.76] 4.26] 3.94] 3.71] 3.41] 3.07] 2.70) 2.62] 2.54] 2.26 
24 7.82) 5.61| 4.72) 4.22) 3.90] 3.67] 3.36] 3.03} 2.66) 2.58] 2.49} 2.21 
25 7.77| 5.57| 4.68] 4.18] 3.86] 3.63] 3.32} 2.99} 2.62) 2.54] 2.45] 2.17 
26 7.72) 5.53| 4.64] 4.14] 3.82] 3.59] 3.29} 2.96] 2.58) 2.50] 2.42} 2.13 
27 7.68) 5.49) 4.60} 4.11] 3.78) 3.56) 3.26] 2.93} 2.55) 2.47] 2.38] 2.10 
28 7.64) 5.45| 4.57] 4.07] 3.76) 3.53] 3.23} 2.90} 2.52) 2.44] 2.35] 2.06 
29 7.60) 5.42) 4.54] 4.04] 3.73| 3.50] 3.20} 2.87} 2.49) 2.41] 2.33} 2.03 
30 7.56| 5.39} 4.51} 4.02) 3.70) 3.47) 3.17] 2.84] 2.47] 2.38] 2.30} 2.01 
40 7.31) 5.18] 4.31] 3.83} 3.51] 3.29] 2.99} 2.66} 2.29) 2.20] 2.11] 1.80 
60 7.08) 4.98] 4.13] 3.65} 3.34] 3.12] 2.82} 2.50} 2.12) 2.03] 1.94] 1.60 
125 6.84] 4.78} 3.94] 3.48) 3.17) 2.95) 2.66] 2.33] 1.94] 1.85] 1.75] 1.37 
oo 6.63} 4.60} 3.78} 3.32) 3.02| 2.80| 2.51] 2.18] 1.79} 1.70] 1.59} 1.00 
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21.20 Student t Distribution 


For the formula o 


the Student t distribution see 16.2.4.8, p. 824. 


Student t Distribution: Quantile tam or ta/2m 

















Degree 
of Probability o for Two-Sided Problem 
Freedom 
m 0.10 0.05 0.02 0.01 0.002 0.001 
1 6.31 12.7 31.82 63.7 318.3 637.0 
2 2.02 4.30 6.97 9.92 22.33 31.6 
3 2.85 3.18 4.54 5.84 10.22 12.9 
4 2.13 2.78 3.75 460 CLT 8.61 
5 2.01 2.57 3.97 4.03 5.89 6.86 
6 1.94 2.45 3.14 3.71 5.21 5.96 
7 1.89 2.36 3.00 3.50 4.79 5.40 
8 186 2.31 2.90 3.36 4.50 5.04 
9 1.83 2.26 2.82 3.25 4.30 4.78 
10 1.81 2.23 2.76 3.17 4.14 4.59 
11 1.80 2.20 2./70 3.11 4.03 4.44 
12 1.78 2.18 2.68 | 3.05 3.93 4.32 
13 1.77; 246 2.65 3.01 3.85 4.22 
14 176 2.14 2.62 2.98 3.79 4.14 
15 1.75 243 2.60 2.95 3.73 4.07 
16 175. 22 2.58 2.02 3.69 4.01 
17 174 2411 2.57 2.90 3.65 3.96 
18 1.73 2.10 2.55 2.88 3.61 3.92 
19 1.73 2.09 2.54 2.86 3.58 3.88 
20 1.73 2.09 2.53 2.85 3.55 3.85 
21 1.72 2.08 252 2.83 3.53 3.82 
22 1.72 2.07 251 2.82 3.51 3.79 
23 1.71 2.07 2.50 2.81 3.49 3.77 
24 1.71 2.06 2.49 2.80 3.47 3.74 
25 1.71 2.06 2.49 2.79 3.45 3.72 
26 1.71 2.06 2.48 2.78 3.44 3.71 
27 1.71 2.05 2.47 2.77 3.42 3.69 
28 1.70 2.05 2.46 2.76 3.40 3.66 
29 1.70 2.05 2.46 2.76 3.40 3.66 
30 1.70 2.04 2.46 2.75 3.39 3.65 
40 1.68 2.02 2.42 2.70 3.31 3.55 
60 1.67 2.00 2.39 2.66 3.23 3.46 
120 1.66 1.98 2.6 262 3.17 3.37 
oo 1.64 1.96 2.83 2.58 3.09 3.29 
0.05 0.025 0.01 0.005 0.001 0.0005 








Probability o for One-Sided Problem 
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21.21 Random Numbers 


For the meaning of random numbers see 16.3.5.2, p. 843. 





4730 1530 [8004 |7993 |3141 [0103 |4528 |7988 |4635 |8478 [9094 [9077 |5306 |4357 |8353 
0612 |2278 |8634 |2549 |3737 |7686 |0723 |4505 |6841 |1379 16460 |1869 |5700 |5339 |6862 
0285 |1888 |9284 |3672 |7033 |4844 10149 |7412 |6370 |1884 |0717 |5740 |8477 |6583 |0717 
7768 |9078 |3428 |2217 [0293 |3978 |5933 |1032 |5192 |1732 |2137 |9357 |5941 (6564 |2171 
4450 [8085 [8931 |3162 |9968 |6369 |1256 [0416 |4326 |7840 |6525 |2608 |5255 |4811 |3763 


17332 |6563 |4013 [7406 [4439 [5683 [6877 [2920 |9588 [3002 [2869 |3746 [3690 (6931 |1230 
4044 [1643 [9005 |5969 |9442 |7696 |7510 |1620 |4973 |1911 11255 16160 |9797 |8755 |6120 
0067 |7697 |9278 |4765 [9647 |4364 |1037 |4975 |1998 |1359 |1346 |6125 [5078 |6742 |3443 
5358 |5256 |7574 |3219 |2532 |7577 |2815 |8696 |9248 |9410 |9282 |6572 |3940 |6655 |9014 
0038 |4772 [0449 [6906 |8859 |5044 |8826 |6218 |3206 |9034 |0843 |9832 |2703 |8514 |4124 


8344 |2271 |4689 [3835 |2938 |2671 |4691 [0559 |8382 |2825 |4928 |5379 |8635 |8135 |7299 
7164 |7492 |5157 |8731 |4980 |8674 |4506 |7262 |8127 |2022 |2178 |7463 |4842 |4414 |0127 
7454 |7616 |8021 |2995 |7868 [0683 |3768 |0625 |9887 |7060 [0514 |0034 [8600 |3727 |5056 
3454 |6292 [0067 |5579 |9028 15660 [5006 |8325 |9677 |2169 |3196 [0357 |7811 |5434 |0314 
0401 |7414 |3186 [3081 |5876 |8150 |1360 |1868 |9265 |3277 |8465 |7502 |6458 |7195 |9869 


6202 |0195 |1077 [7406 |4439 [5683 1/6877 2920 |9588 [3002 |2869 |3746 13690 [2705 |6251 
8284 |0338 |4286 |5969 |9442 |7696 |7510 |1620 |6973 |1911 |1288 |6160 [9797 |1547 |4972 
9056 |0151 |7260 |4765 |9647 |4364 |1037 |4975 |1998 |1359 |1346 |6125 |5078 |3424 |1354 
9747 |3840 |7921 [3219 |2532 |7577 |2815 |8696 |9248 |9410 |9282 |6572 |3940 |8969 |3659 
2992 |8836 |3342 [6906 |8859 |5044 |8826 |6218 |3206 |9034 [0843 |9832 |2703 |5225 |8898 


6170 |4595 |2539 |7592 |1339 [4802 |5751 [3785 |7125 |4922 [8877 |9530 16499 (6432 |1516 
3265 |8619 |0814 [5133 |7995 |8030 |7408 |2186 |0725 |5554 |5664 |6791 [9677 |3085 |8319 
0179 |3949 [6995 |3170 |9915 [6960 |2621 |6718 |4059 9919 |1007 |6469 |5410 [0246 |3687 
1839 (6042 9650 |3024 |0680 |1127 |8088 |0200 |5868 |0084 |6362 |6808 |3727 |8710 |6065 
2276 |8078 |9973 |4398 |3121 |7749 |8191 |2087 |8270 |5233 [3980 |6774 |8522 |5736 |3132 


4146 [9952 |7945 |5207 |1967 [7325 |7584 |3485 |5832 |8118 18433 10606 |2719 |2889 |2765 
3526 |3809 |5523 [0648 |3326 |1933 [6265 [0649 |6177 |2139 |7236 |0441 |1352 |1499 |3068 
3390 |7825 |7012 [9934 |7022 |2260 [0190 |1816 |7933 |2906 |3030 |6032 |1685 |3100 |1929 
4806 [9286 |5051 |4651 |1580 |5004 |8981 |1950 |2201 |3852 |6855 |5489 |6386 |3736 |0498 
7959 |5983 |0204 |4325 |5039 |7342 |7252 |2800 |4706 |6881 |8828 |2785 |8375 |7232 |2483 


8245 |9611 [0641 |7024 |3899 [8981 |1280 |5678 |8096 |7010 |1435 |7631 [7361 [8903 |8684 
7551 |4915 |2913 [9031 [9735 |7820 |2478 |9200 |7269 |6284 |9861 |2849 |2208 |8616 |5865 
5903 |2744 |7318 |7614 |5999 |1246 |9759 |6565 |1012 [0059 |2419 [0036 |2027 |5467 |5577 
9001 |4521 |5070 |4150 |5059 |5178 |7130 |2641 |7812 |1381 |6158 |9539 |3356 |5861 |9371 
0265 |3305 |3814 [0973 |4958 |4830 [6297 [0575 |4843 [3437 |5629 |3496 |5406 |4790 |9734 
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antikink soliton, 607 


convex angle, 130 antilog, 10 
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right angle, 130 
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Cardan, 214, 295 
complementary, 130 
corresponding, 130 
Euler, 215, 296 
exterior-interior, 130 
names, 129 
opposite, 130 
sum 
plane triangle, 133 
spherical triangle, 165 
supplementary, 130 
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angular 


coefficient, curve third degree, 68 
coefficient, plane, 195 
frequency, 84 


annuity, 23, 24 


calculation, 25 
payment, 25 
perpetual, 25 


annulator, 682 
annulus, 141 


theorem, 200 


apothem, 133 
applicate, 210 
approximate equation, 549 
approximate formula, empirical curve, 108 
approximation, 982 


arc, 


asymptotic, polynomial part, 15 
best, Hilbert space, 675 
bicubic, 1000 
Chebyshev, 988 
6 function, 777 
formulas, series expansion, 472 
in mean, 456, 984 
Liouville theorem, 4 
numbers, 4 
partial differential equations, 977 
problem, 674 

solution by extreme value, 456 
successive 

Banach space, 679 


differential equations, ordinary, 549 


integral equation, 625 
uniform, 988 
using given functions, 974 
Weierstrass theorem, 665 
131, 161 
ellipse, 201 
graph, 401 
chain, 410 
hyperbola, 204 
intersection, 150 
length 
circular segment, 141 
line integral, first type, 517 
parabola, 206 
plane curve, 141, 502 
space curve, 264, 517 
sequences, 410 


Archimedean spiral, 105 


area 


annulus, 141 

sector, 141 
circle, 140 
circular sector, 141 
circular segment, 141 
cosine, 93 
cotangent, 94 
curved surface, 535 
curvilinear bounded, 501 
curvilinear sector, 501 
double integral, 527 
ellipse, 201 
formula, Heron’s, 144 
function, 93 
hyperbola, 204 
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parabola, 206 
parallelogram, 135 
with vectors, 190 
planar figures, 501 
polyeder with vectors, 190 
polygon, 194 
rectangle, square, 136 
rhombus, 136 
similar plane figures, 134 
sine, 93 
subset, 693 
surface patch, 265 
tangent, 94 
triangle, 194 
plane, 142, 144 
spherical, 165, 169 
argument 
function of one variable, 48 
function of several variables, 118 
arithmetic, 1 
sequence, 18 
Arnold tongue, 907 
arrangement, 805, 806 
with repetition, 806 
without repetition, 806 
arrow 
diagram, 331 
function, 331 
article number, European, 384 
ASCII (American Standard Code for Information Inter- 
change), 1001 
associative law 
Boolean algebra, 396 
generalized imaginary units, 290 
matrices, 272 
missing, 356 
scalar product, 185 
vector product, 185 
propositional logic, 324 
quaternions, 290 
sets, 330 
tensors, 282 
astroid, 104, 527 
asymptote 
curve, 249, 252 
definition, 252 
attractor, 860 
chaotic, 888 
examples, 887 
fractal, 888 
Hénon, 886, 888 
hyperbolic, 888 
Lorenz, 887, 892 
quantitative description, 876 
solenoid, 888 
strange, 888 
autocorrelation function, 878 
axiomatization, probability theory, 809 
axioms 
algebra, 672 
closed set, 664 
metric spaces, 662 
normed space, 669 


open set, 664 
pseudonorm, 682 
scalar product, 673 
vector space, 654 
ordered, 659 
axis 
abscissae 
plane coordinates, 190 
space coordinates, 210 
ordinates 
plane coordinates, 190 
space coordinates, 210 
parabola, 204 
azimuth, 161 
azimuthal equation, 599 


backward substitution, 956 
Baire, second Baire category, 874 
Bairstow method, 954 
ball, metric space, 663 
Banach 
fixed-point theorem, 689 
space, 670 
example, 670 
series, 670 
theorem, continuity, inverse operator, 679 
band structure, coefficient matrix, 960 
barrel, 159 
circular, 159 
parabolic, 159 
base, 7 
power, 7 
vector, 187 
reciprocal, 186, 187 
vector space, 366 
basis, 657 
Clifford algebra, 289 
contravariant, 285 
covariant, 284 
Lie Algebra, 357 
orthonormal, 358 
vector 
contravariant, 284 
covariant, 284 
vector space, 657 
Bayes theorem, 810 
B-B representation 
curve, 1000 
surface, 1000 
bending, 237 
Berge theorem, 409 
Bernoulli 
inequality, 30 
numbers, 465 
shift mapping, 876 
shift, chaotic, 889 
Bernoulli-l’Hospital rule, 56 
Bernstein polynomial, 1000 
Bessel 
differential equation, 562 
differential equation, linear, zero order, 783 
function, 562 
imaginary variables, 563 
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modified, 563 
functions 
spherical, 564 
inequality, 676 
beta function, 1098 
biangle, spherical, 163 
bifurcation, 604 
Andronov-Hopf, 894 
Bogdanov-Takens, 896 
codimension, 892 
cusp, 895 
double semistable periodic orbits, 898 
flip, 898 
global, 892, 901 
homoclinic, 902 
Hopf, 894 
generalized, 897 
local, 892 
mappings, subcritical saddle node, 898 
pitchfork, 899 
supercritical, 896 
saddle node, 893 
transcritical, 894 
bilinear, 356 
bilinear form, 368 
antisymmetric, 368 
positive definite, 368 
symmetric, 368 
binary number, 876, 1002 
binary system, 1002 
binomial, 71 
coefficient, 13 
distribution, 815 
formula, 12 
linear, 71 
quadratic, 71 
theorem, 12 
binormal, space curve, 257, 259 
biquaternion, 290 
normalized, 306 
biquaternions, 306 
multiplication, 306 
rigid-body motion, 306 
birth process, 829 
bisection method, 319 
bisector, 143 
triangle, 133 
bit, 1001 
reversing the order, 995 
block, 153 
body of revolution, lateral surface, 502 
Bolzano theorem 
one variable, 61 
several variables, 124 
Bolzano-Weierstrass property, 686 
Boolean 
algebra, 395, 807 
finite, 397 
expression, 397 
function, 324, 397 
n-ary, 397 
variable, 397 
Borel 





set, 694 
c algebra, 694 
bound 
function, 51 
sequence, 457 
boundary 
collocation, 978 
condition, 610 
uncertain for some variables (fuzzy), 422 
boundary value 
conditions, 540, 569 
problem, 540, 569, 973 
Hilbert, 651 
Hilbert, homogeneous, 651 
Hilbert, inhomogeneous, 652 
homogeneous, 569 
inhomogeneous, 569 
linear, 569 
bounded set (order-bounded), 660 
boundedness of a function 
one variable, 61 
several variables, 124 
Box-Müller method, 934 
brackets, Lie, 356, 596 
Bravais lattice, 349 
break of symmetry, bifurcation, 897 
Breit-Wigner curve, 791 
Brodetsky-Smeal method, 45 
Brouwer fixed-point theorem, 691 
business mathematics, 21 


byte, 1001 


calculation 
complex numbers, 36 
coordinates, polar and rectangular, 146 
determinants, 278 
numerical 
accuracy, 1006 
basic operations, 1005 
polynomials, shift register, 364 
rule 
irredicible polynomials, 363 
quaternions, 292 
tensors, 282 
triangle, oblique plane, 142 
triangle, plane, 144 
triangle, right-angled plane, 142 
triangle, spherical, 171 
calculus 
differentiation, 432 
errors, 848 
integral, 480 
observations, measurement error, 848 
propositional, 323 
variations, 610 
canonical form, circle-mapping, 905, 907 
Cantor function, 907 
Cantor set, 882, 883, 887 
cap, spherical, 158 
capacity, dimension, 883 
capacity, edge, 411 
Caratheodory conditions, 689 
Cardan angles, 214, 295 
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definition, 214 
direction cosines, 215 
quaternions, 298 
rotation matrix, 215, 295 
Cardano formula, 41 
cardinal number, 328, 335 
cardinality, set, 335 
cardioid, 99 
carrier function, 633 
Carson transformation, 769 
Cartesian 
coordinates 
plane, 190 
space, 210 
folium, 96 
cartography scale, 145 
cascade, period doubling, 901, 905 
Cassinian curve, 100 
category, second Baire category, 874 
catenary curve, 89, 107, 613 
catenoid, 89 
Cauchy 
integral, 650 
integral formula, 748 
integral formulas, application, 754 
method, differential equations of higher order, 555 
principal value, improper integral, 507, 510 
principle, 666 
problem, 572 
sequence, 665 
theorem, 443 
Cauchy-Riemann differential equations, partial, 732 
Cauchy-Riemann operator, 306 
Cayley 
table, 337 
theorem, 339, 408 
center 
circle, 198 
curvature, 248 
spherical, 176 
center manifold theorem 
differential equations, 892 
mappings, 897 
center of area method, 427 
center of gravity, 216 
arbitrary planar figure, 506 
arc segment, 505 
closed curve, 506 
double integral, 527 
line integral, first type, 517 
method, 426 
generalized, 427 
parametrized, 427 
trapezoid, 506 
triangle, 133 
triple integral, 532 
center of mass, 193, 216 
central 
angle, 131 
curves, 207 
field, 702 
surface, 224 
central limit theorem, Lindeberg—Levy, 825 


chain, 334, 404 
directed, elementary, 410 
graph, 410 
elementary, 410 
Markov, 826 
stationary, 826 
time-homogeneous, 826 
rule, 702 
composite function, 435 
stochastic, 825, 826 
chaos, 857, 904 
attractor, strange, 888 
from torus to chaos, 904 
one-dimensional mapping, 889 
routes to chaos, 892 
through intermittence, 905 
transition to chaos, 901 
character 
group element, 343 
representation of groups, 344 
characteristic (logarithm), 10 
characteristic strip, 573 
Chebyshev 
approximation, 988 
continuous, 988 
discrete, 991 
inequality, 31 
polynomial formula, 88 
polynomials, 989 
theorem, 489 
Chinese postman problem, 406 
Chinese remainder theorem, 379 
x? distribution, 822 
x? test, 835, 836 
Cholesky 
decomposition, 958 
method, 314, 958 
chord, 141 
theorem, 139 
circle 
Apollonius, 742 
area, 139 
center, 198 
chord, 139 
circumference, 139 
convergence, 750 
curvature, 248 
plane curve, 246 
dangerous, 150 
definition, 139, 198 
equation 
Cartesian coordinates, 198 
polar coordinates, 199 
great circle, 160, 174 
intersection, 160 
mapping, 906, 907 
parametric representation, 199 
periphery, 139 
plane, 139, 198 
radius, 139, 198 
small circle, 160, 176 
tangent, 139, 199 
circuit 
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directed, graph, 410 
Euler circuit, 405 
Hamilton circuit, 406 
circuit integral, 521 
being zero, 523 
circular 
field, 705 
point, 267 
sector, 141 
segment, 141 
circumcircle 
quadrangle, 136 
triangle, 133, 143 
circumscribing 
quadrangle, 137 
triangle, 133 
cissoid, 96 
Clairaut 
differential equation, ordinary, 546 
differential equation, partial, 574 
class 
defined by identities, 395 
equivalence class, 334 
midpoint, 834 
statistics, 832 
Clebsch-Gordan 
coefficient, 345 
series, 345 
theorem, 345 
Clifford algebra, 289 
Clifford numbers, 290 
closure 
closed linear, 674 
linear, 655 
set, 665 
transitive, 332 
clothoid, 107 
code, 383 
ASCII (American Standard Code for Information 
Interchange), 1001 
BCH, 386 
cyclic, 386 
error correcting, 385 
linear, 385 
public key, 392 
RSA Code, 392 
codimension, 892 
coding, 388, 392 
coefficient, 11 
Clebsch-Gordan, 345 
Fourier, 474 
leading, 38 
matrix, extended, 957 
metric, 186, 187 
vector decomposition, 183 
collinearity, vectors, 185 
collocation 
boundary, 978 
domain, 978 
method, 634, 974, 978 
points, 974, 978 
column pivoting, 957 
column sum criterion, 960 


combination, 805 
with repetition, 805 
without repetition, 805 
combinatorics, 805 
commensurability, 4 
commutative law 
Boolean algebra, 396 
matrices, 272, 273 
missing, quaternion multiplication, 290 
propositional logic, 324 
sets, 330 
vectors, 273 
commutativity 
groups, 353 
scalar product, 184 
commutator, 356, 367, 596 
comparable function, 617 
complement, 329 
algebraic, 278 
fuzzy, 421 
fuzzy set, 418 
orthogonal, 674, 682 
sets, 328 
Sugeno complement, 421 
Yager complement, 421 
complementary angles formulas, 80 
completeness relation, 676 
completion, metric space, 668 
complex analysis, 731 
complex function, 48 
pole, 733 
complex numbers, 34 
generalized, 290 
mapping, plane, 745 
plane, 34 
complex-valued function, 48 
complexification, 659 
composition, 424 
max-average, 424 
max-(t norm), 424 
compound interest, 22 
calculation, 22 
computation of adjustment, 982 
computer 
basic operations, 1005 
error of the method, 1007 
error types, 1004, 1006 
internal number representation, 1003 
internal symbol representation, 1001 
numerical accuracy, 1006 
numerical problems in calculation, 1004 
use of computers, 1001 
computer algebra systems, 1023 
differential and integral calculus, 1042 
graphics, 1045 
manipulation of algebraic expressions, 1036 
purpose, 1023 
computer graphic 
Mathematica, 1045 
quaternions, 302 
concave, curve, 246 
conchoid 
Nicomedes, 97 
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general, 98 
of the circle, 98 
of the line, 98 
conclusion, 425 
concurrent expressions, 398 
condition 
boundary value, ordinary differential equation, 540 
Caratheodory conditions, 689 
Cauchy (convergence), 123 
Dirichlet (convergence), 475 
initial value, ordinary differential equation, 540 
Kuhn-Tucker 
global, 925 
local, 925 
Lipschitz, higher-order differential equations, 551 
Lipschitz, ordinary differential equation, 541 
number, 959 
regularity condition, 941 
cone, 157, 225, 671 
central surface, 228 
circular, 157 
generating, 660 
imaginary, 228 
normal, 671 
regular, 671 
solid, 671 
truncated, 157 
vector space, 657 
confidence 
interval, 841 
regression coefficient, 841 
variance, 838 
limit 
for the mean, 837 
prescription, 853 
probability, 837 
region, 841 
congruence 
algebraic, 377 
corners, 152 
linear, 378 
method, 844 
plane figures, 134 
polynomial congruence, 380 
quadratic, 379 
relation, 394 
kernel, 394 
simultaneous linear, 379 
system simultaneous linear, 379 
theorems, 134 
congruent 
directly, 133 
indirectly, 134 
mapping, 133, 134 
conic section, 158, 206, 207 
singular, 207 
conjugate complex number, 36 
conjunction, 323 
elementary, 399 
consistency 
integration of differential equation, 972 
order p, 972 
constant 


Euler, 513 
polynomials, 62 
propositional, 323 
term, 308 
constants 
important natural, table, 1053 
mathematical, frequently used, 1053 
continuation, analytic, 751 
continued fractions, 3 
continuity 
composite functions, 61 
elementary function, 60 
from below, 694 
function 
one variable, 58 
several variables, 124 
Holder, 649 
continuous, absolutely, 697 
continuum, 335 
contour integral, vector field, 721 
contracting principle, 666 
contraction, 282 
tensor, 287 
transformation (geometric), 236 
contradiction, Boolean function, 397 
control by step size, 935 
control digit, 383 
convergence 
absolute, 462, 469 
complex terms, 750 
alternating series test of Leibniz, 463 
Banach space, 670 
circle of convergence, 750 
condition of Cauchy, 53 
conditional, 462 
complex terms, 750 
dominated, 697 
in mean, 475 
infinite series, complex terms, 749 
integration of differential equation, 972 
non-uniform, 468 
order p, 972 
sequence of numbers, 458 
complex terms, 749 
series, 460, 462 
complex terms, 749 
uniform, 468, 469 
uniformly, function sequences, 664 
weak, 687 
Weierstrass criterion, 468 
convergence criterion 
alternating series test of Leibniz, 463 
comparison criterion, 460 
D’Alembert’s ratio test, 461 
integral test of Cauchy, 462 
necessary, 460 
root test of Cauchy, 461 
sufficient, 460 
uniformely convergence, Weierstrass, 468 
convergence theorem, 459 
measurable function, 697 
conversion, number systems, 1002 
convex, curve, 246 
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convolution Soldner, 162 
Fourier transformation, 789 spherical, 211 
Laplace transformation, 773 vector field, 706 
one-sided, 789 transformation, 191 
two-sided, 789 between orthogonal coordinates, 212 
Z-transformation, 796 Cartesian to polar, 192 
coordinate inversion, 287 triangle coordinates, 982 
coordinate line, 210, 284 vector, 183 
coordinate surface, 210, 284 coprime, 5 
coordinate system corner 
cartesian, 210 convex, 152 
curvelinear, 210 figure, 152 
cylindrical polar, 211 symmetric, 152 
double logarithmic, 117 trihedral, 152 
Gauss-Krueger, 144 correction form, 961 
left-hand, 209 corrector, 971 
local, 234, 238 correlation, 839 
object, 229, 234 analysis, 839 
orientation, 209 coefficient, 840 
orthogonal, 181 empirical, 840 
orthonormal, 181 cosecant 
plane, 190 hyperbolic, 89 
projection, 238 trigonometric, 78 
right-hand, 209 geometric definition, 131 
semilogarithmic, 116 coset 
Soldner, 144 left, 338 
spatial, 209 right, 338 
spherical polar, 211 cosine 
transformation, 280 hyperbolic, 89 
world-, 234 geometric definition, 132 
coordinate transformation, 229, 286, 706 trigonometric, 77 
2-dimensional, 231 cosine law, 143 
3-dimensional, 234 geometric definition, 131 
equation law for sides, 166 
central curves second order, 207 rule, spherical triangle, 166 
quadratic curve (parabolic), 208 cosine integral, 513 
coordinates cotangent 
affine, 186 hyperbolic, 89 
axis, 190 trigonometric, 77 
backward, 890 geometric definition, 131 
barycentric, 982 Coulomb field, point-like charge, 705, 727 
Cartesian, 187 counterpoint, 160 
plane, 190 course angle, 161 
space, 210 covariance, two-dimensional distribution, 840 
contravariant, 188, 287 covering transformation, 336 
covariant, 188, 287 covering, open, 860 
curvilinear, 191, 261, 284 Cramer rule, 311 
three dimensional, 210 credit, 22 
cylindrical, 211 criterion 
vector field, 706 convergence 
Descartes, 190 sequence of numbers, 458 
equation, space curve, 259 series, 459 
forward, 890 series with positive terms, 460 
Gauss, 261 uniform, Weierstrass, 468 
Gauss-Kriiger, 162 divisibility, 373 
geodetic, 144 subspace, 367 
geographical, 162 cross product, 184 
homogeneous, 231, 355 cryptanalysis, classical, methods, 389 
mixed, 286 Kasiski-Friedman test, 390 
point, 190 statistical analysis, 390 
polar, plane, 191 cryptography, conventional 
polar, spherical, 211 methods, 388 





representation with scalar product, 188 linear substitution ciphers, 389 
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substitution, 388 
monoalphabetic, 388 
monographic, 388 
polyalphabetic, 388 
polygraphic, 388 

transposition, 388 

cryptology, 386 

classical 
Hill cypher method, 389 
matrix substitution method, 389 
Vigenere cypher method, 389 

cryptosystem, 387 

DES algorithm, 393 

Diffie-Hellman key exchange, 391 

encryption 
context free, 387 
context sensitive, 387 

IDEA algorithm, 393 

mathematical foundation, 387 

one-time pad, 390 

one-way function, 391 

public key methods, 391 

RSA method, 392 

security of cryptosystems, 388 

subject, 386 

crystal class, 350 
crystal system, 350 
crystallography 

lattice, 348 

symmetry group, 348 

cube, 154 
curl, 727 

density, 728 

field 
pure, 727 
zero-divergence field, 727 

line, 714 

curvature 

center, 248 

circle, 248 

curves on a surface, 265 

Gauss surface, 267 

mean, surface, 267 

minimal total curvature, 997 

plane curve, 246 

radius, 248 
curve on a surface, 265 
principal, 265 

space curve, 258 

surface, 265, 267 
constant curvature, 267 

total, 261 

curve 

algebraic, 95, 195 
n-th order, 252 

are cosine, 86 

arc cotangent, 86 

arc sine, 86 

arc tangent, 86 

Archimedean spiral, 105 

area cosine, 93 

area cotangent, 94 

area sine, 93 


curve (continued I) 


area tangent, 94 
astroid, 104 
asymptote, 249, 252 
asymptotic point, 250 
B-B representation, 1000 
cardioid, 99 
Cartesian folium, 96 
Cassinian, 100 
catenary, 107 
cissoid, 96 
clothoid, 107 
concave, 246 
conchoid of Nicomedes, 97 
convex, 246 
corner point, 250 
cosecant, 78 
cosine, 77 
cotangent, 77 
curvature, 246 
cuspidal point, 250 
cycloid, 101, 102 
damped oscillation, 85 
directing, 156 
double point, 250 
empirical, 108 
envelope, 254 
epicycloid, 102 
epitrochoid, 104 
equation 
complex form, 760 
plane, 195, 243 
second degree, 206 
second order, 206 
space, 218 
error curve, 73 
evolute, 254 
evolvent, 254, 255 
of the circle, 106 
exponential, 72 
fourth order, 97 
Gauss error curve, 819 
general discussion, 253 
hyperbolic cosine, 89 
hyperbolic cotangent, 90 
hyperbolic sine, 89 
hyperbolic tangent, 90 
hyperbolic type, 70, 72 
hyperbolice spiral, 105 
hypocycloid, 103 
hypotrochoid, 104 
imaginary, 195 
inflection point, 249 
involute, 255 
isolated point, 250 
Koch curve, 883 
lemniscate, 101 
length, line integral, first type, 517 
logarithmic, 73 
logarithmic spiral, 106 
Lorentz curve, 95 
multiple point, 251 
normal curve, plane, 244 
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curve (continued II) 
n-th degree, 65, 195 
n-th order, 65, 195 
parabolic type, 65 
Pascal limaçon, 98 
plane, 243 
direction, 243 
vertex, 250 
quadratic, 206 
radius of curvature, 246, 247 
representation with splines, 996 
secant, 78 
second degree, 206 
second order, 206 
semicubic parabola, 95 
sine, 77 
space, 256 
spherical, 160, 174, 702 
spiral, 105 
strophoide, 97 
tacnode, 250 
tangent, 77 
plane, 244 
terminal point, 250 
third degree, 67 
third order, 95 
tractrix, 108 
transcendent, 195 
trochoid, 102 
witch of Agnesi, 95 


curves 
family of, envelope, 255 
plane 
angle, 246 


second order 
central curves, 207 
parabolic curves, 208 
polar equation, 208 
spherical, intersection point, 180 
cut, 329 
Dedekind’s, 334 
fuzzy sets, 417 
set, 329 
cutting plane method, 942 
cycle, 404 
chain, 410 
limit, 895 
cycloid, 101 
basis, 101 
common, 101 
congruent, 101 
curtate, 102 
prolate, 102 
cylinder, 156, 227 
circular, 156 
elliptic, 227 
hollow, 157 
hyperbolic, 227 
invariant signs, 229 
parabolic, 227 
cylindrical coordinates, 211 
cylindrical function, 562 
cylindrical surface, 156, 218 


d'Alembert formula, 591 
damping parameter, 962 
damping, oscillations, 85 
Darboux vector, 261 
data type, 394 
De Morgan 
law, 330 
rule, 324 
Boolean algebra, 396 
death process, 829 
debt, 23 
decay, radioactive, 829 
decimal 
number, 1002 
normalized, 1005 
representation, 1002 
system, 1002 
decoding, 392 
decomposition 
orthogonal, 674 
partial fraction, 778 
partial fractions, 15 
QR, matrix, 959 
singular value, 321 


theorem, higher order differential equations, 554 


triangle, matrix, 955 
vectors, 183 
decyphering, 392 
Dedekind cut, 334 
defect, 974, 978 
vector space, 367 
definite 
negative, 317 
positive, 317, 958 
defuzzification, 426 
degeneracy of states, 598 
degree 
curve, second degree, 206 
curve, n-th degree, 195 
homogeneity, 122 
in-degree, 401 
matrix, 408 
measure in degrees, 131 
out-degree, 401 
Delambre equations, 168 
6 distribution, 699 
6 function, 694, 699 
application, 777 
approximation, 777 
Dirac, 774 
6 functional, 681 
density function, 812 
multidimensional, 814 
dependence, linear, 308, 366 
deposit 
in the course of the year, 22 
regular, 22 
single, 22 
depreciation, 26 
arithmetically declining, 26 
digital, 27 
geometrically declining, 27 
straight-line, 26 
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derivative 


complex function, 731 
constant, 433 
directional, 708 
scalar field, 708 
vector field, 708 
distribution, 700 
exterior, 435 
fraction, 435 
Fréchet, 690 
function 
composite, 435 
elementary, 433 
implicit, 436 
inverse, 436 


conjugate 
ellipse, 200 
hyperbola, 203 

ellipse, 200 

hyperbola, 203 


diffeomorphism, 858 


Anosov, 889 
orientation-preserving, 906, 907 


difference 





bounded, 419, 420 
finite expression, 973 
sets, 330 

symmetric, 330 
significant, 839 
Z-transformation, 796 


parametric form, 437 difference equation, 794 
several variable, 445 boundary value, 800 
generalized, 699 linear, 798 
higher order, 438, 448 partial differential equations, 976 
inverse function, 440 second-order, 799, 800 
parametric form, 440 difference method, 973 
interior, 435 partial differential equations, 976 
left-hand, 433 difference quotient, 963 
logarithmic, 435 difference schema, 18 
mixed, 448 differentiability 
one variable, 432 complex function, 731 
partial, 445 function of one variable, 432 
product, 434 function of several variables, 447 
quotient, 435 with respect to the initial conditions, 857 
right-hand, 433 differentiable 
scalar multiple, 433 continuously, 690 
Sobolev sense, 699 Fréchet, 690 
space, 708 differential 
sum, 433 arc 
table, 434 plane, 243 
vector function, 701 surface, 263 
volume differentiation, 709 complete, 447 
Derive (computer algebra system), 1023 first-order, 433 
Descartes rule, 45 higher order, 447 
descendant (offspring), 933 integrability, 521 
descent method, 931 notion, 446 
determinant, 278 partial, 447 





differentiation, 279 

evaluation, 279 

functional, 123, 691 

Jacobian, 123, 691 

multiplication, 279 

reflection, 279 

rotation, transformation properties, 214 
rules of calculation, 278 

Wronskian, 553, 862 

zero value, 279 


determination of extrema 


absolute extremum, 445 
implicit function, 445 


principal properties, 447 
quotient (see also derivative), 432 
second-order, 449 
total, 447, 448 

n-th order, 449 

second-order, 449 


differential calculus, 432 


fundamental theorem, 441 


differential equation, 540 


boundary value problem, 569 
eigenfunction, 569 
eigenvalue, 569 

eigenvalues, 597 


deviation, standard deviation, 813 flow, 857 
devil’s staircase, 907 Fourier transformation, 791 
diagonal matrix, 270 Laplace transformation, 781 
diagonal method, Maxwell, 744 numerical solution, 550 
diagonal strategy, 957 operational notation, 555 
diameter order, 540 

parabola, 205 orthogonality, 570 

circle, 140 Riccati 
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normal form, 544 notion, 540 
ordinary, 544 particular solution, 540 
self-adjoint, 569 point, singular, 546 
singular solution, 542 radicals, 546 
stiff, 973 ratio of arbitrary functions, 548 
topological equivalence, 870 separation of variables, 542 
Weber, 601 series expansion, 549 
differential equation, linear singular point, 547 
autonomous, on the torus, 863 solution, 540 
constant coefficients, 555 successive approximation, 549 
first-order, 861 van der Pol, 895 
fundamental theorem, 861 differential equation, partial, 571 
homogeneous, 861 approximate integration, 976 
inhomogeneous, 861 Cauchy-Riemann, 584, 732 
matrix-differential equation, 861 characteristic system, 572 
non-autonomous, on the torus, 906 Clairaut, 574 
periodic coefficients, 863 completely integrable, 576 
second order, 560 eigenfunction, 598 
Bessel, 562 electric circuit, 585 
Hermite, 568 elliptic type, 576 
hypergeometric, 567 constant coefficients, 578 
Laguerre, 568 Euler, calculus of variations, 612 
Legendre, 565 field theory, 729 
method of unknown coefficients, 560 first-order, 571 
second-order system, 560 canonical systems, 573 
differential equation, ordinary, 540 characteristic strip, 573 
approximate integration, 969 characteristic system, 571 
autonomous, 860 linear, 571 
Bernoulli, 544 non-linear, 573 
boundary value conditions, 540 non-linear, complete integral, 573 
boundary value problem, 540 total differentials, 576 
center, 548 two variables, 575 
central point, 548 Fourier transformation, 792 
Clairaut, 546 Hamilton, 861, 903 
direction field, 540, 541 heat conduction equation 
element, singular, 546 one-dimensional, 583, 585 
exact, 542 three-dimensional, 591 
existence theorem, 540 Helmholtz, 594 
explicit, 540 hyperbolic type, 576, 578 
first-order, 540 integral surface, 572 
approximation methods, 549 Laplace, 729 
important solution methods, 542 Laplace transformation, 783 
flow, 857 linear, 571 
fraction of linear functions, 547 longitudinal vibrational bar, 580 
general solution, 540 normal system, 574 
general, n-th order, 540 notion, 540 
graphical solution, 550 parabolic type, 576 
homogeneous, 542 constant coefficients, 578 
implicit, 540, 545 Poisson, 592, 726, 729 
initial value conditions, 540 quasilinear, 571 
initial value problem, 540 reduced, 893 
integral, 540 reduced, normal form, 893 
integral curve, 541 second-order, 576 
integral, singular, 546 constant coefficients, 578 
integrating factor, 543 separation of variables, 597 
Lagrange, 545 ultra-hyperbolic type, constant coefficients, 578 
linear, 792 vibrating membrane, 581 
constant coefficients, 781 vibrating string, 579 
first-order, 543 differential equations, higher order, 550 
variable coefficient, 782 constant coefficients, 553 
linear, planar, 857, 871 decomposition theorem, 554 
multiplier, 543 Euler, n-th order, 557 


non-autonomous, 860 fundamental system, 553 
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linear, n-th order, 553 
lowering the order, 552, 554 
quadrature, 554 
reduction to a system of differential equations, 550 
superposition principle, 554 
system of solutions, 551 
variation of constants, 554 
differential equations, linear 
second order 
defining equation, 561 
differential equations, partial, non-linear, 603 
first-order, 573 
non-linear waves, 604 
pattern, periodic, 604 
Schroedinger, 606 
differential operation 
review, 717 
vector components, 718 
differential operator 
divergence, 712, 719 
gradient, 710, 719 
Laplace, 716, 719 
nabla, 715, 719 
non-linear, 690 
relations, 719 
rotation, 713, 719 
rules of calculations, 717 
space, 708, 715 
vector gradient, 711 
differential transformation, affine, 734 
differentiation, 432 
complex function, 731 
composite function, 450 
function 
elementary, 433 
implicit, 436 
inverse, 436 
one variable, 432 
parametric form, 437 
several variables, 445 
graphical, 437 
higher order 
inverse function, 440 
parametric form, 440 
implicit function, 451 
logarithmic, 435 
matrix, 276 
several variables, 450 
under the integration sign, 512 
volume, 709 
differentiation rules 
basic rules, 433 
derivative of higher order, 438 
function 
one variable, 433 
several variables, 433, 448 
table, 439 
vector function, 701 
vectors, 701 
diffusion coefficient, 592 
diffusion equation, 609 
diffusion equation, three-dimensional, 591 
digon, spherical, 163 


dihedral angle, 152 
dihedral group, 336 
dimension, 882 
capacity, 883 
correlation, 885 
defined by invariant measures, 884 
Douady-Oesterlé, 886 
embedding, 890 
formula, 367 
generalized, 885 
Hausdorff, 882, 886 
information, 884 
Lie algebra, 353 
lower point-wise, 884 
Lyapunov, 885 
matrix-Lie group, 353 
measure, 884 
metric, 882 
Renyi, 885 
upper point-wise, 884 
vector space, 366, 657 
Dirac 
distribution, 699 
matrices, 290 
measure, 694, 876 
operator, 306 
theorem, 406 
directing curve, 156 
direction 
cosines, space, 212 
plane curve, 243 
projection, 237 
space, 212 
space curve, 256 
direction cosines 
Cardan angles, 215 
Euler angles, 216 
directional 
angle, 146 
derivative, 710 
directrix 
ellipse, 200 
hyperbola, 202 
parabola, 204 
Dirichlet 
condition, 475 
problem, 582, 729 
discontinuity, 58 
function, 58 
removable, 59 
discount, 21 
discretization error 
global, 972 
local, 972 
discretization step interval, 963 
discriminant, 576 
disjoint, 329 
disjunction, 323 
elementary, 399 
dispersion, 813 
dissolving torus, 904 
distance, 504 
Hamming distance, 662 
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line-point, 196 complex numbers, 37 
metric space, 662 computer calculation, 1005 
planes, 221 external, 193 
parallel, 221 golden section, 194 
point-line, space, 222 harmonic, 193 
point-plane, space, 220 in extreme and mean ratio, 194 
spherical, 160 internal, 193 
two lines, space, 223 line segment, plane, 193 
two points, 192 polynomial, 14 
space, 217 ring, 363 
distribution, 698, 699, 777 quaternions, 293 
binomial, 815 rational numbers, 1 
x, 822 segment, space, 217 
x?, table, 1135 divisor, 370 
continuous, 818 greatest common (gcd) 
derivative, 699 integer numbers, 373 
Dirac, 699 linear combination, 374 
discrete, 814 polynomials, 14 
exponential, 820 positive, 372 
Fisher, 823 zero, 361 
frequency, 833 dodecahedron, 156 
function, 811 domain, 119 
function, continuous, 812 closed, 119 
hypergeometric, 814, 816 convergence, function series, 467 
logarithmic normal, 819 doubly-connected, 119 
lognormal distribution, 819 function, 48 
marginal, 814 image, set, 49 
measurement error density, 848 integrity, 361 
normal, 818 multiply-connected, 119 
Poisson, 817 non-connected, 119 
regular, 699 of attraction, 860 
standard normal, 819 of individuals, predicate calculus, 326 
Student, 824 open, 119 
t distribution, 824 operator, 658 
telephon-call, 829 set, 49 
theory, 774 simply-connected, 119, 747 
Weibull, 821 three or multidimensional, 120 
distributive law two-dimensional, 119 
Boolean algebra, 396 values, set, 49 
matrices, 272 dot product, 184 
propositional logic, 324 double integral, 524 
ring, field, 361 application, 527 
sets, 330 notion, 524 
tensors, 282 double line, 207 
distributivity duality, 396 
vector product, 185 linear programming, 919 
left sided, 368 non-linear optimization, 926 
right sided, 368 principle, Boolean algebra, 396 
divergence, 726 theorem, strong, 926 
central field, 713 dualization, 396 
definition, 712 Duhamel formula, 782 
different coordinates, 712 dynamical system, 857 
improper, 458 continuous, 857 
proper, 458 discrete, 858, 871 
remark, 709 reconstruction space, 889 
sequence of numbers, 458 time discrete, 857 
series, 462 volume contracting, 858 
theorem, 724 volume pressing, 858 
vector components, 718 
vector field, 712 eccentricity, numerical 
divisibility, 370 curve second order, 207 
criteria, 372, 373 ellipse, 199 
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parabola, 204 
edge 
angle, 152 
figure, 152 
graph, 401 
length, 403 
valuation, 403 
multiple, 401 
sequence, 404 
cycle, 404 
directed circuit, 404 
isolated edge, 404 
open, 404 
path, 404 
effective rate, 22 
eigenfunction, 598 
differential equation, 569 
integral equation, 623, 628 
normalized, 570 
eigenvalue, 314, 597 
differential equation, 569 
integral equation, 623, 628 
numerical calculation, 319 
operator, 680 
eigenvalue problem 
general, 314 
matrices, 314 
special, 314 
eigenvector, 283, 314, 680 
Einstein's summation convention, 280 
element, 327 
finite, 979 
generic, 874 
linearly independent, 656 
neutral, 335 
positive, 659 
ement of area, plane, table, 527 
ement of surface, curved, table, 534 
ement of volume, table, 532 
ementary cell 
crystal lattice, 348 
non-primitive, 349 
primitive, 349 
ementary formula, predicate logic, 326 
ementary surface, parametric form, 534 
ementary volume 
arbitrary coordinates, 531 
Cartesian coordinates, 529 
cylindrical coordinates, 530 
spherical coordinates, 530 
ements of curved surfaces, 534 
ements of plane surfaces, 527 
limination method, Gauss, 312, 955 
limination step, 956 
llipse, 199 
arc, 201 
area, 201 
diameter, 200 
equation, 199 
focal properties, 200 
focus, 199 
perimeter, 201 
radius of curvature, 200 
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semifocal chord, 199 
tangent, 200 
transformation, 207 
vertex, 199 
ellipsoid, 224 
central surface, 228 
cigar form, 224 
imaginary, 228 
lens form, 224 
of revolution, 224 
surface second order, 228 
embedding, 890 
canonical, 684 
dimension, 890 
encoding schemes, 381 
encoding, RSA code, 392 
encyphering, 386 
endomorphism, 659 
endomorphism, linear operators, 659 
endpoint, 401 
energy 
particle, 594 
spectrum, 594 
system, 593 
zero-point translational energy, 598 
zero-point vibration energy, 603 
entropy, 879, 880 
generalized, 885 
metric, 879 
topological, 879, 889 
envelope, 254, 255 
epicycloid, 102 
curtate, 104 
prolate, 104 
epitrochoid, 104 
equality 
asymptotic, 472 
complex numbers, 35 
matrices, 272 
equality relation, 10 
equation, 10 
algebraic, 38 
biquadratic, 42 
Boussinesq, 609 
Burgers, 609 
characteristic, 315, 547 
cubic, 64 
curve 
plane, 195, 243 
second degree, 206 
second order, 206 
defining, 561 
degree, 38 
degree one, 39 
diffusion, three-dimensional, 592 
Diophantine, 375 
linear, 376 
ellipse, 199 
evolution, 605 
heat conduction 
three-dimensional, 591 
Hirota, 609 
homogeneous, 687 
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equation (continued) Neper, 169 
hyperbola, 202 plane, 218 
inhomogeneous, 687 real coefficients, 44 
irrational, 39 reducing transcendental to algebraic, 45 
Kadomzev-Pedviashwili, 609 transcendental, 38 
Korteweg de Vries (KdV), 604, 605 trigonometric, solution, 46 
modfied, 608 equilibrium point, 857 
Kuramoto-Sivashinsky (KS), 604 hyperbolic, 864 
line in a plane, 195 equivalence, 323 
line in space, 221 class, 334 
logistic, 858, 899 logic, 324 
non-linear proof, 5 
fixed point, 949 relation, 333 
numerical solution, 949 Eratosthenes sieve, 370 
non-linear evolution, 605 ergodic system, Birkhoff theorem, 877 
non-linear Schroedinger (NLS), 606 ergodic theory, dynamical systems, 877 
normal form, 38 error 
operator equation, 687 absolute, 852, 1004 
parabola, 205 absolute maximum error, 852 
Parseval, 475, 570, 676 apparent, 851 
pendulum, 904 average, 849-852 
plane computer calculation, 1006 
curve, 195 defined, 853 
general, 218 density function, 848 
Hessian normal form, 219 discretization, 1007 
intercept form, 219 equation, 958 
polynomial estimation, iteration method, 962 
numerical solution, 952 input error, 1006 
quadratic, 64 least mean squares, 475 
root, definition, 38 mean, 849, 851 
Schroedinger square, 851, 852 
linear, 592 normally distributed, 849 
non-linear (NLS), 604 percentage, 852 
sine-Gordon (SG), 607 probable, 849-852 
Sinh-Gordon, 609 propagation, 854 
solution, general, 38 relations between error types, 850 
space curve, 218, 256 relative, 852, 1004 
vector form, 256 relative maximum error, 853 
sphere, 261 round-off, 1007 
surface single measurement, 850 
normal form, 224 standard, 851 
second order, 228 true, 850 
space, 217, 261 truncation error, 1007 
surface normal, 264 type 1 error rate, 813 
tangent plane, 264 type, measurement errors, 848 
term algebra, 395 error analysis, 856 
vector, 188 error calculus 
wave, three-dimensional, 590 direct problem, 1006 
equations inverse problem, 1006 
algebraic, 38 error curve, 73 
general properties, 43 error estimation, mean value theorem, 442 
degree four, 42 error function, 514 
degree one (linear), 39 Gauss, 819 
degree three (cubic), 40 error integral, Gauss, 514 
degree two (quadratic), 40 error orthogonality, 974 
degree n, 43 error propagation law, Gauss, 855 
solution, 45 error types, computer calculation, 1006 
Delambre, 168 ess. sup (essential supremum), 697 
exponential, solution, 46 estimate, 831 
hyperbolic functions, solution, 47 Euclidean 
L'Huilier, 169 algorithm, 3, 14 
logarithmic, solution, 46 polynomial rings, 363 





Mollweide, 143 group, 354 
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scaled, 354 
norm, 367 


R? (3 dimensional vector space), 289 


R* (4 dimensional vector space), 289 
vector norm, 276 
vector space, 367 
Euler 
angles, 215 
broken line method, 969 
circuit, 405 
constant, 513 
differential equation 
n-th order, 557 
variational calculus, 612 
formula (curvature of a surface), 265 
formulas (Fourier representation), 474 
function (theory of numbers), 381 
graph, 405 
integral, first kind, 1098 
integral, second kind, 512, 1098 
numbers, 466 
polygonal method, 969 
relation, 36 
complex numbers, 758 
theorem (polyeder), 155 
trail, 405 
open, 405 
Euler angles, 296 
definition, 215 
direction cosines, 216 
rotation matrix, 216, 296 
Euler-Hierholzer theorem, 405 
event, 807 
certain, 807 
complete system of, 808, 810 
elementary, 807 
impossible, 807 
independent, 810 
random, 807 
set of events, 807 
simple, 807 
evolute, 254 
evolution 
equation, 605 
function, 605 
evolution algorithms, 933 
evolution principles, 933 
evolution strategies, 933 
algorithms, 933 
application, 934 
classification, 934 
from some populations, 936 
(uw +A) strategy, 935 
(uw, A) strategy, 935 
mutation, 933 
mutation-selection 
mutation step, 934 
selection step, 935 
step size determination, 935 
strategy, 934 
populations, 935 
recombination, 933 


selection, 933 
pressure, 936 
with recombination, 936 
evolvent, 254, 255 
of circle, 106 
excess, spherical triangle, 165 
exchange theorem, 448 
exchange, cyclic, sides and angles, 142 
excluded middle, 324 
existential quantifier, 326 
expansion 
Fourier expansion, forms, 477 
Laplace expansion, 278 
Laurent expansion, 752 
Maclaurin, 472 
Taylor, 442, 471 
expectation, 813 
expectation value (quantum mechanical), 595 
expected value, 813 
bivariate distribution, 840 
two-dimensional distribution, 840 
exponent, 7 
exponential distribution, 820 
exponential function, 72 
complex, 739 
general, 759 
natural, 758 
quaternions, 293 
exponential integral, 514 
exponential sum, 74 
expression 
algebraic, 10 
analytic, 49 
domain, 49 
explicit form, 49 
implicit form, 49 
parametric form, 50 
Boolean, 397 
concurrent, 398 
explicit form, 49 
finite, 973 
partial differential equations, 976 
implicit form, 49 
integral rational, 11 
irrational, 11, 17 
parametric form, 50 
propositional logic, 323 
rational, 11, 14 
semantically equivalent, 398 
tautology, 325 
transcendent, 11 
vector analysis, 718 
extension field, 362 
extension principle, 421 
extension theorem of Hahnium, 683 
extension, algebraic, 364 
extension, linear functional, 682 
extraction of the root 
complex numbers, 38 
real numbers, 8 
extrapolation principle, 967 
extremal, radius of curvature, 616 
extreme value, 51, 443 
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absolute, 443 

determination, 444 
higher derivatives, 444 
side conditions, 456 
sign change, 444 

function, 454 

relative, 443 


face, corner, 152 
factor algebra, 394 
factor group, 339 
factor ring, 363 
factorial, 13 
generalization of the notion, 515 
factoring out, 11 
Falk scheme, 273 
Feigenbaum constant, 900, 901 
FEM (finite element method), 978 
FFT (fast Fourier transformation), 993 
Fibonacci 
explicite formula, 375 
numbers, 375, 908 
recursion formula, 375 
sequence, 375 
field, 361 
algebraically closed, 364 
axial field, 703 
central symmetric, 702 
circular, 705 
complex numbers, 364 
conservative, 721 
Coulomb field, point-like charge, 705, 727 
cyclotomic, 364 
cylinder symmetric, 703 
extension field, 363 
extensions, 362 
finite, 363 
flow, 723 
function, 741 
Galois, 363 
gravitational field, point mass, 728 
Newton field, point-like mass, 705 
over-field, 362 
pole, 703 
potential, 721 
scalar field, 702 
skew field, 361 
rings, 361 
source field, 726 
spherical field, 702 
splitting 
polynomial, 363 
vector field, 701 
field theory 
basic notions, 701 
differential equations, partial, 729 
fields, superposition, 728 
finite difference method, 588 
finite element method, 588, 978 
fitting problem 
different versions, 456 
linear, 958 
non-linear, 109 


fixed point 
conformal mapping, 735, 736 
flip bifurcation, 899 
fixed-point number, 1003 
floating-point number, 1003 
IEEE standard, 1004 
Mathematica, 1025 
semilogarithmic form, 1003 
Floquet 
representation, 865 
theorem, 863 
flow 
differential equation, 857 
edge, 411 
scalar field, 723 
vector field 
scalar flow, 723 
vector flow, 723 
fluctuation 
random, 825 
focal line, tractrix, 108 
focus, 869 
compound, 894 
ellipse, 199 
hyperbola, 201 
parabola, 204 
saddle, 864 
saddle focus, 869 
stable, 864 
form 
bilinear, 368 
negative definite, 317 
normal 
generating, 318 
Jordan, 319 
positive definite, 317 
quadratic, 958 
real positive definite, 318 
transformation, 317 
transformation, principal axis, 317 
real quadratic, 317 
saddle, 267 
semidefinite, 317 
sesquilinear, 369 
formula 
binomial, 12 
Cardano, 41 
closed, predicate logic, 326 
d'Alembert, 591 
Duhamel, 782 
Euler (curvature of a surface), 265 
Heron's, 144 
interpretation, predicate logic, 326 
Kasterinian, 565 
Kirchhoff, 590 
Leibniz, 438 
Liouville, 554, 862 
manipulation, 1023 
de Moivre 
complex number, 38 
generalized, 293 
hyperbolic functions, 92 
quaternions, 293 
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formula (continued) 
trigonometric functions, 81 
Parseval, 789 
Pesin, 881, 888 
Plemelj and Sochozki, 651 
Poisson, 591 
predicate logic, 326 
Rayleigh, 565 
rectangular, 964 
Riemann, 584 
Simpson, 965 
Stirling, 515 
tangent, 143 
tautology, 327 
Taylor 
one variable, 442 
several variables, 450 
trapezoidal, 964 
formulas 
basic, plane trigonometry, 142 
Darboux, 261 
Euler (Fourier representation), 474 
Frenet, 261 
half-angle (plane trigonometry), 143 
half-angle (spherical trigonometry), 166 
predicate logic, 326 
four group, Klein’s, 339 
four vector 
homogeneous coordinates, 355 
quaternions, 290 
Fourier analysis, 474 
Fourier coefficient, 474, 992 
asymptotic behavior, 475 
determination, 456 
numerical methods, 477 
Fourier expansion, 474 
complex functions, 479 
forms, 477 
symmetries, 476 
Fourier integral, 478, 784 
complex representation, 784 
equivalent representations, 784 
Fourier series, 474 
best approximation, 675 
complex representation, 475 
Hilbert space, 675 
Bessel inequality, 676 
Fourier sum, 475, 992 
complex representation, 993 
Fourier transformation, 784 
addition law, 788 
comparing to Laplace transformation, 790 
convolution, 789 
two-sided, 773 
definition, 785 
differentiation 
image space, 788 
original space, 788 
discrete complex, 993 
fast, 993 
Fourier cosine transformation, 786 
Fourier exponential transformation, 786 
Fourier sine transformation, 786 


frequency-shift theorem, 788 
integration 
image space, 788 
original space, 789 
inverse, 785 
linearity law, 788 
localization property, no, 801 
shifting theorem, 788 
similarity law, 788 
spectral interpretation, 786 
survey, 769 
tables, 786 
transform, 790 
fractal, 882 
fractile, 812 
fraction 
continued, 3 
decimal, 1 
improper, 15 
proper, 15 
fractional part of x, 50 
frames (wavelet transformations), 803 
Fréchet 
derivative, 690 
differential, 690 
Fredholm 
alternative, 687 
alternative theorem, 629 
integral equation, 621, 667 
first kind, 635 
solution method, 627 
theorems, 627 
Frenet formulas, 261 
frequency, 77 
angular/radial, 84 
distribution, 833 
locking, 907 
relative, 808 
sine, 84 
spectrum, 787 
continuous, 478 
discrete, 478 
statistics, 808, 833 
cumulative, 833 
Fresnel integral, 757 
frustum 
cone, 157 
pyramid, 154 
function, 48 
algebraic, 62 
complex, 758 
amplitude function, elliptic, 763 
analytic, 732 
analytic representation, 49 
explicite form, 49 
implicit form, 49 
parametric form, 50 
area, 93 
arrow, 331 
autocorrelation, 878 
Bessel, 562 
modified, 563 
beta function, 1098 
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function (continued I) 


Boolean, 324, 397 
bounded, 51 
circular, 131 
comparable, 617 
complement, 421 
complex, 48, 731 
algebraic, 758 
bounded, 733 
linear, 735 
linear fractional, 736 
quadratic, 737 
square root, 737 
complex-valued, 48 
complex-variable, 731 
composite, 63 
derivative, 435 
intermediate variable, 435 
continuity 
in interval, 58 
one-sided, 58 
piecewise, 58 
continuous, complex, 731 
cosecant, 78 
cosine, 77 
cotangent, 77 
cyclometric, 85 
cylindrical, 562 
density, 812 
dependent, 122 
discontinuity, 58 
removable, 59 
discrete, 794 
distribution, 811 
double periodic, 763 
elementary, 62 
elementary, transcendental, 758 
elliptic, 490, 753, 763 
entire rational, 62 
error function, 514 
Euler (theory of numbers), 381 
even, 51 
exponential, 62, 72 
complex, 739 
natural, 758 
quaternions, 293 
extreme value, 51 
function series, 467 
gamma, 514 
generalized, 698, 699, 777 
Green, 586 
Hamilton, 574, 861 
harmonic, 729, 732 
Heaviside, 700 
Hermite, 674 
holomorphic, 732 
homogeneous, 122 
homographic, 62, 66 
hyperbolic, 89, 759 
geometric definition, 131, 132 
quaternions, 294 
impulse, 774 
increment, 972 


function (continued II) 


independent, 122 
integrable, 494 
absolutely, 508, 511 
integral rational, 62 
first degree, 63 
n-th degree, 65 
second degree, 64 
third degree, 64 
inverse, 52 
complex, hyperbolic, 759 
complex, trigonometric, 759 
derivative, 436 
derivative of higher order, 440 
existence, 61 
hyperbolic, 93 
trigonometric, 63, 85 
irrational, 62, 71 
Jacobian, 763 
Lagrangian, 925 
Laguerre, 674 
Laplace, 729 
limit, 53 
at infinity, 54 
infinity, 53 
iterated, 124 
left-hand, 54 
right-hand, 54 
Taylor expansion, 57 
theorems, 55 
linear, 62, 63 
linear fractional, 62, 66 
local summable, 698 
logarithm, complex, 738 
logarithmic, 63, 73 
quaternions, 294 
MacDonald, 563 
matrix-exponential, 352, 862 
mean value, 497 
measurable, 695 
measuring, 889 
meromorphic, 753, 763, 779 
monotone 
decreasing, 50 
increasing, 50 
strictly, 50 
non-elementary, 62 
notion, 48 
objective 
linear programming, 909 
selection, 933 
odd, 51 
of angle, 76 
one variable, 48 
order of magnitude, 57 
parametric form 
derivative, 437 
derivative of higher order, 440 
periodic, 52, 776 
piecewise continuous, 58 
point of discontinuity, 58 
finite jump, 59 
tending to infinity, 59 
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function (continued III) 
positive homogeneous, 616 
power, 71 
primitive, 480 
pure, 1031 
quadratic, 62 
random variable, 811 
rational, 62, 66 
real, 48 
regular, 732 
Riemann, 584 
sample function, 831 
secant, 78 
several variables, 48, 118 
sign of, 50 
sine, 76 
special fractional linear fractional, 66 
state, 594 
statistic, 831 
step, 794 
stream, 741 
sum of linear and fractional linear functions, 738 
summable, 696 
switch, 399 
system 
orthogonal, 985 
orthonormal, 985 
tangent, 77 
theta, 764 
transcendental, 62 
trigonometric, 63, 76, 759 
geometric definition, 131 
quaternions, 293 
truth, 323, 324 
wave, statistical interpretation, 594 
Weber, 562 
Weierstrass, 765 
function theory, 731 
functional, 611, 677 
definition, 48 
linear, 366, 658, 659 
linear continuous, 681 
L? space, 682 
functional determinant, 123, 285, 526 
functions 
analytic, 732 
Bessel 
1. kind, 564 
2. kind, 564 
table, 1106 
Bessel, complex argument, 564 
Bessel, spherical, 564 
complex argument, 564 
Neumann, spherical, 564, 565 
spherical, 564 
fundamental form 
first quadratic, of a surface, 263 
second quadratic, of a surface, 266 
fundamental formulas 
spherical trigonometry, 165 
fundamental laws, set algebra, 329 
fundamental matrix, 862-864 
fundamental problem 


first, triangulation, 148 
second, triangulation, 149 
fundamental space, 699 
fundamental system 
differential equation, higher order, 553 
fundamental theorem 
Abelian groups, 339 
algebra, 43 
elementary number theory, 371 
integral calculus, 495 
future value, 25 
fuzzy 
control, 427 
inference, 425 
linguistics, 414 
logic, 413 
logical inferences, 425 
product relation, 424 
relation, 422 
relation matrix, 423 
system, 430 
systems, applications, 427 
valuation, 422 
fuzzy set 
aggregation, 418 
aggregation operator, 423 
complement, 418 
composition, 424 
cut (representation theorem), 417 
degree, 417 
empty, 416 
intersection, 418 
intersection set, 419 
level set, 417 
normal, 417 
peak, 416 
similarity, 417 
subnormal, 417 
subset, 416 
support, 413 
tolerance interval, 416 
union, 418 
universal, 416 
fuzzy sets 
cut, 417 
intersection, 419 
union, 419 


Gabor transformation, 803 
Galerkin method, 974 
Galois field, 363 
gamma function, 512, 514 
Gauss 
algorithm, 312, 956 
coordinates, 261 
curvature, surface, 267 
elimination method, 312, 955 
error curve, 819 
error function, 819 
error integral, 514 
error propagation law, 855 
integral formula, 725 
integral theorem, 724 
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least squares method, 456, 959, 984 
plane, 34 
step, 312 
transformation, 313, 843, 959 
Gauss-Krüger coordinates, 162 
Gauss-Newton method, 962 
derivative free, 963 
Gauss-Seidel method, 960 
gcd (greatest common divisor), 374 
gcd and lem, relation between, 375 
generating line, 156 
generator, 156 
ruled surface, 226 
generizity 
metric, 891 
geodesic line, 268 
geodesy 
angle, 146 
coordinates, 144 
polar coordinates, 144 
geometric sequence, 19 
geometry, 129 
analytical, 181 
plane, 190 
projections, 237 
space, 209 
transformatians, 229 
vector algebra, 181 
differential, 243 
plane curves, 243 
space curves, 256 
surfaces, 261 
plane, 129 
plane trigonometry, 142 
spherical trigonometry, 160 
stereometry, 151 
Gimbal Lock case, 295 
Girard theorem, 165 
golden section, 2, 4, 194, 908 
gon, 146 
grades, 131 
gradient 
definition, 710 
different coordinates, 710 
remark, 709 
scalar field, 710 
vector components, 718 
vector gradient, 711, 716 
Graeffe method, 954 
Gram-Schmidt, orthogonalization process, 316 
graph 
alternating way, 409 
arc, 401 
bipartite, 402 
complete, 402 
complete bipartite, 402 
components, 404 
connected, 404, 410 
cycle, 410 
directed, 401 
edge, 401 
circuit, 410 
edge, 401 


Euler, 405 
flow, 411 
increasing way, 409 
infinite, 402 
isomorphism, 402 
loop, 401 
mixed, 401 
non-planar, 410 
partial, 402 
planar, 410 
plane, 402 
regular, 402 
simple, 401 
special classes, 402 
strongly connected, 410 
subdivision, 410 
subgraph, 402 
transport network, 402 
tree, 402 
undirected, 401 
vertex, 401 
weighted, 403 
graph paper 
double logarithmic, 117 
log-log paper, 117 
notion, 116 
reciprocal scale, 117 
semilogarithmic, 116 
graph theory, algorithm, 401 
gravitational field, point mass, 728 
great circle, 160, 174 
greatest common divisor (gcd), 373 
linear combination, 374 
polynomials, 14 
Green 
function, 586 
integral theorem, 725 
method, 586, 587 
group, 336 
Abelian, 336 
cyclic, 340 
D3 (dihedral), 341 
point group, 340 
affine transformations, 355 
complete, linear 
GL(n, R), 353 
continuous 
dimension, 354 
dihedral, 336, 341 
element, 336 
character, 343 
inverse, 336 
Euclidean, 354 
scaled, 354 
factor group, 339 
four group (Klein's), 339 
GA(2), 355 
general linear, 352 
GL(n), 353 
Lie group, 351 
definition, 354 
L(n, R), 353 


matrix-Lie, 351 
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group (continued) 
dimension, 353 
multiplicative, cyclic, 364 
O(n), 351 
permutation, 338 
point group, 346 
real, special, linear S'L(n , R), 354 
representation, 340 
irreducible, 344 
reducible, 344 
SE(2), 355 
SE(3), 355 
S E(n), 354 
SL(2), 355 
SO(2), 354 
SO(n), 351 
special, affine, 354 
subgroup, 337 
symmetry group, 346 
table, 337 
Cayley table, 337 
grouping, 11 
groups 
applications, 345 
continuous, 353 
definition, 353 
product, 353 
convergence, 353 
direct product, 338 
general linear, 353 
homomorphism, 339 
theorem, 339 
isomorphism, 339 
Lie groups, 351 
matrix-Lie, 351-353 
definition, 353 
special, 354 
growth factor, 22 
Guldin's first rule, 506 
Guldin's second rule, 506 


half-angle formulas 
plane trigonometry, 143 
spherical trigonometry, 166 
half-line, notion, 129 
half-side formulas, 167 
Hamel basis, 657 
Hamilton 
circuit, 406 
differential equation, 875 
differential equation, partial, 861, 903 
function, 574, 861 
system, 875 
Hamiltonian, 593 
Hamming distance, 662 
Hankel transformation, 768 
harmonic analysis, 982 
harmonics, spherical (Legendre) 
first kind, 566 
second kind, 567 
Hasse diagram, 334 
heat conduction equation 
one-dimensional, 583, 783 


three-dimensional, 591 
Heaviside 
expansion theorem, 779 
function, 700, 757 
unit step function, 774, 1126 
helix, 260 
Helmholtz, differential equation, 594 
Hénon mapping, 858, 872 
Hermite 
differential equation, 568 
polynomial, 568 
polynomials, 602 
trapezoidal formula, 965 
Hessian 
matrix, 931 
normal form 
line equation, plane, 196 
plane equation, space, 219 
hexadecimal 
number, 1002 
system, 1002 
Hilbert 
boundary value problem, 651 
homogeneous, 651 
inhomogeneous, 652 
matrix, 1041 
space, 673 
isomorphic, 676 
pre-Hilbert space, 673 
scalar product, 673 
histogram, 833 
hodograph, 701 
Holder 
condition, 649 
continuity, 649 
inequality, 32 
integrals, 32 
series, 32 
Holladay, theorem, 997 
holohedry, 350 
homeomorphism 
and topological equivalence, 870 
conjugation, 873 
orientation preserving, 906 
homomorphism, 339, 394 
algebra, 672 
groups, 339 
theorem, 339 
linear operators, 658 
natural, 339, 363 
ring, 362 
theorem, 394 
ring, 362 
vector lattice, 661 
Hopf bifurcation, 894 
Hopf-Landau model, turbulence, 904 
Horner 
rule, 1003 
scheme, 952, 1003 
two rows, 953 
horseshoe mapping, 887 
Householder 
method, 314, 985 
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tridiagonalization, 319 
I'Huilier equations, 169 
hull 

convex, 657 

linear, 655 
hyper surface, 119 
hyperbola, 201 

arc, 204 

area, 204 

asymptote, 202 

binomial, 71 

conjugate, 203 

diameter, 203 

equation, 202 

equilateral, 66, 204 

focal properties, 202 

focus, 201 

radius of curvature, 204 

segment, 204 

semifocal chord, 201 

tangent, 202 

transformation, 207 

vertex, 201 
hyperbolic function, 759 

cosecant, 89 

cosine, 89 

cotangent, 89 

geometric definition, 132 

inverse, complex, 759 

secant, 89 

sine, 89 

tangent, 89 
hyperboloid, 225 

one sheet, 225, 226 

central surface, 228 

two sheets, 225 

central surface, 228 
hyperplane, 683 

of support, 684 
hypersubspace, 683 
hypocycloid, 103 

curtate, 104 

prolate, 104 
hypotenuse, 131 
hypothesis testing, 839 
hypotrochoid, 104 


icosahedron, 156 
ideal, 362 
principal ideal, 362 
idempotence law 
Boolean algebra, 396 
propositional logic, 324 
sets, 330 
identically valid, 10 
identity, 10 
Boolean function, 397 
Jacobian, 356 
representation of groups, 342 
identity matrix, 271 
IEEE standard, 1004 
IF-THEN rule, 425 
iff (if and only if), 370 


image 
function, 49 
set, 49 
space, 49, 767 
subspace, 367 
imaginary 
number, 34 
part, 34 
unit, 34 
units, generalized i, 7, k, 290 
immersion, 890 
implication, 323 
proof, 5 
impulse function, 774 
incidence function, 401 
incidence matrix, 403 
incircle 
quadrangle, 137 
triangle, 133, 144 
incommensurability, 4, 866 
increment, 21, 447 
independence 
linear, 308, 656 
path of integration, 521, 747 
potential field, 721 
induction step, 5 
inequality, 28 
arithmetic and geometric mean, 30 
arithmetic and quadratic mean, 30 
Bernoulli, 30 
Bessel, 676 
binomial, 31 
Cauchy-Schwarz, 31 
Chebyshev, 31, 814 
generalized, 32 
different means, 30 
first degree, 33 
Holder, 32 
linear, 33 
first degree, 33 
solution, 33 
Minkowski, 32 
product of scalars, 31 
pure, 28 
quadratic, 33 
solution, 33 
Schwarz-Buniakowski, 673 
second degree, 33 
solution, 28 
special, 30 
triangle, 182 
norm, 669 
triangle inequality 
complex numbers, 30 
real numbers, 30 
infimum, 660 
infinite 
denumerable, 335 
non-denumerable, 335 
infinitesimal quantity, 494 
infinity, 1 
infix form, 335 
inflection point, 249, 443, 444 
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initial conditions, 540 
initial value problem, 540, 969 
inner product, 673 
inscribed circle 
quadrangle, 137 
triangle, 133 
inscribed pentagram, 139 
instability, round-off error 
numerical calculation, 1007 
insurance mathematics, 21 
integer 
non-negative, 1 
part of x, 50 
integrability 
complete, 576 
condition, 521 
conditions, 521, 721 
differential, 521 
quadratic, 637 
integral 
absolutely convergent, 511 
antiderivative, 480 
basic 
notion, 481 
calculus, 480 
Cauchy type, 650 
circuit, 515, 521 
complex 
definite, 745 
indefinite, 746 
complex function 
measurable function, 696 
cosine integral, 513 
elementary functions, 481 
error integral, 514 
Euler integral, second kind, 512, 514 
exponential integral, 514 
Fourier integral, 478, 784 
Fresnel integral, 757 
integral logarithm, 484, 513 
interval of integration, 494 
Lebesgue integral, 507, 696 
comparison with Riemann integral, 506 
limits of integration, 494 
line integral, 515 
first type, applications, 517 
general type, 519 
second type, 517 
logarithm, 484, 513 
lower limit, 494 
non-elementary, 484, 513 
parametric, 512 
primitive function, 480 
probability integral, 819 
Riemann integral, 494 
comparison with Stieltjes integral, 506 
sine integral, 513, 756 
Stieltjes integral, 686 
comparison with Riemann integral, 506 
notion, 506 
surface integral, 532, 723 
triple integral, 527 
upper limit, 494 


volume integral, 527 
integral calculus, 480 
fundamental theorem, 495, 511 
mean value theorem, 497 
integral curves, 858 
integral equation, 621 
Abel, 648 
adjoint, 623, 650 
approximation 
successive, 625 
characteristic, 650 
collocation method, 634 
eigenfunction, 623 
eigenvalue, 623 
first kind, 621 
Fredholm, 621, 667 
degenerate kernel, 635 
first kind, 635 
second kind, 622 
general form, 621 
homogeneous, 621 
inhomogeneous, 621 
iteration method, 625, 642 
kernel, 621 
degenerate, 622 
iterated, 645 
product, 622 
kernel approximation, 632 
tensor product, 632 
linear, 621 
Nystróm method, 631 
orthogonal system, 637, 640 
perturbation function, 621 
quadratically integrable function, 637 
quadrature formula, 630 
second kind, 621 
singular, 648 
Cauchy kernel, 649 
transposed, 623 
Volterra, 621, 667 
convolution type, 645 
first kind, 643 
second kind, 643 
integral equation method, closed curve, 588 
integral exponential function, table, 1093 
integral formula 
Cauchy, 748 
application, 754 
Gauss, 725 
integral logarithm, 484, 513 
table, 1095 
integral norm, 698 
integral surface, 572 
integral test of Cauchy, 462 
integral theorem, 724 
Cauchy, 747 
Gauss, 724 
Green, 725 
Stokes, 725 
integral transformation, 767 
application, 769 
Carson, 768 
definition, 767 
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Fourier, 768, 784 
Fourier (table), 1114, 1125 
Fourier cosine (table), 1114 
Fourier exponential (table), 1127 
Fourier sine (table), 1120 
Gabor, 803 
Hankel, 768 
image space, 767 
inverse, 767 
kernel, 767 
Laplace, 768, 770 
Laplace (table), 1109 
linearity, 767 
Mellin, 768 
multiple, 769 
one variable, 767 
original space, 767 
several variables, 769 
special, 767 
Stieltjes, 768 
Walsh, 804 
wavelet, 800, 801 
fast, 803 
Z-, 794 
Z- (table), 1128 
integral, definite, 493 
differentiation, 496 
notion, 480 
particular integral, 495 
table, 1098 
algebraic functions, 1101 
exponential function, 1099 
logarithmic function, 1100 
trigonometric functions, 1098 
integral, elliptic, 490 
first kind, 484, 490, 763 
second kind, 490 
series expansion, 515 
table, 1103 
third kind, 490 
integral, improper, 493 
Cauchy’s principal value, 507, 510 
convergent, 507, 509 
divergent, 507, 509 
infinite integration limits, 506 
notion, 506 
principal value, 507 
unbounded integrand, 506 
integral, indefinite, 480 
basic integral, 481 
cosine function, table, 1085 
cotangent function, table, 1091 
elementary functions, 481 
elementary functions, table, 1065 
exponential function, table, 1093 
hyperbolic functions, table, 1092 
inverse hyperbolic functions, table, 1097 
inverse trigonometric function, table, 1096 
irrational functions, table, 1072 
logarithmic functions, table, 1095 
notion, 481 
other transcendental functions, table, 1092 
sine and cosine function, table, 1087 


sine function, table, 1083 
table, 1065 
tangent function, table, 1091 
trigonometric functions, table, 1083 
integrand, 481, 494 
integrating factor, 543 
integration 
approximate 
ordinary differential equation, 969 
partial differential equation, 976 
complex plane, 745 
constant, 481 
function, non-elementary, 513 
graphic, 485 
graphical, 499 
in complex, 754 
interval, 494 
limit 
depending on parameter, 512 
lower, 494 
upper, 494 
logarithmic, 483 
numerical, 963 
multiple integrals, 968 
partial, 484 
power, 483 
rational functions, 485 
rules 
by series expansion, 484 
by substitution, 484 
constant multiple rule, 482 
definite integrals, 496 
general rule, 482 
indefinite integrals, 483 
interchange rule, 497 
interval rule, 496 
series expansion, 499, 513 
sum rule, 482 
under the integration sign, 512 
variable, 494 
variable, notion, 481 
vector field, 719 
volume, 503 
integrator, 500 
integrity, domain, 361 
intensity, source, 727 
interaction, soliton, 604 
intercept theorem, 135 
interest, 22 
calculation, 21 
compound, 22 
intermediate value theorem 
one variable, 61 
several variables, 124 
intermediate variable, 435 
intermittence, 902, 905 
Internal Number Representation INR, 1003 


Internationale Standard Book Number ISBN, 383 


interpolation 
Aitken-Neville, 983 
condition, 982 
formula 
Lagrange, 983 
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Newton, 982 
fuzzy system, 430, 431 
knowledge-based, 430 
Lerp, 302 
node, 963 
equidistant, 969 
points, 982 
quadrature, 964 
quaternions, 302 
rotation matrices, 303 
Slerp, 302 
spline, 982, 996 
bicubic, 998 
cubic, 996 
Squad, 303 
trigonometric, 982, 992 
interpretation 
formula, predicate logic, 326 
variable, 324 
intersecting circles, 160 
intersecting plane, 160 
intersection, 329 
angle, 161 
by two oriented lines, 148 
fuzzy set, 418 
on the sphere, 172 
point 
lines, 197 
set, fuzzy set, 419 
sets, 328, 329 
without visibility, 148 
intersection line, 152 
intersection point 
four planes, 221 
plane and line, 223 
spherical curves, 175 
three planes, 220 
two lines, space, 223 
interval 
convergence, 469 
numbers, 2 
order (o) interval, 660 
rule, 496 
statistics, 832 
invariance 
rotation invariance, cartesian tensor, 283 
transformation invariance, cartesian tensor, 283 
translation invariance, cartesian tensor, 283 
invariant, 765 
quadratic curve, 206 
scalar, 214 
scalar invariant, 185 
surface second order, 228 
inverse, 659 
inverse function, 52 
hyperbolic, 93 
trigonometric, 85 
inverse transformation, 767 
inversion 
Cartesian coordinate system, 287 
conformal mapping, 736 
space, 287 
involute, 255 


isometry, space, 669 
isomorphism, 339, 394 
Boolean algebra, 397 
graph, 402 
groups, 339 
surjective norm, 684 
vector spaces, 659 
iteration, 949 
inverse, 319 
method, 319, 949, 960 
ordinary, 949, 961 
sequential steps, 961 
simultaneous steps, 961 
vector, 321 


Jacobian 
determinant, 123, 285, 691 
function, 763 
identity, 356 
matrix, 691, 962 
method, 319, 960 
joint 
kinematics, robotics, 360 
mechanical, S E(3) application, 359 
Jordan 
matrix, 319 
normal form, 319 
jump, finite, 59 


KAM (Kolmogorov-Arnold-Moser) theorem, 875 
Kasterinian formula, 565 
ker, 394, 659 
kernel, 339 
approximation 
integral equation, 632 
spline approach, 633 
tensor product, 632 
congruence relation, 394 
homomorphism, 394 
integral equation, 621 
degenerate, 632 
iterated, 626, 645 
resolvent, 628 
solving, 626 
integral transformation, 767 
operator, 659 
ring, 362 
subspace, 367 
kink soliton, 607 
Kirchhoff formula, 590 
Klein's four group, 339 
Koch curve, 883 
Korteweg de Vries equation, 604 
Kronecker 
generalized delta, 283 
product, 276 
symbol, 271, 283 
Kuhn-Tucker conditions 
global, 925 
local, 925 
reference, 684 
Kuratowski theorem, 410 


Lagrange 


Index 1179 





function, 456 
identity, vectors, 186 
interpolation formula, 983 
method of multiplier, 456 
theorem, 338 
Lagrangian, 456, 925 
Laguerre 
differential equation, linear, second order, 568 
polynomials, 568 
Lanczos method, 319 
Landau, order symbol, 57 
Laplace 
differential equation, partial, 592, 729 
expansion, 278 
function, 729 
wave equation, 590 
Laplace operator 
different coordinates, 716, 717 
polar coordinates, 454 
quaternions, 306 
vector components, 718 
Laplace transformation, 770 
addition law, 771 
comparing to Fourier transformation, 790 
convergence, 770 
convolution, 773 
complex, 774 
one-sided, 773 
definition, 770 
differential equation, ordinary 
linear, constant coefficients, 781 
linear, variable coefficients, 782 
differential equation, partial, 783 
differential equations, 781 
differentiation, image space, 772 
differentiation, original space, 772 
discrete, 796 
division law, 773 
frequency-shift theorem, 771 
image space, 770 
integration, image space, 772 
integration, original space, 772 
inverse, 770, 778 
inverse integral, 780 
linearity law, 771 
original function, 770 
original space, 770 
partial fraction decomposition, 778 
piecewise differentiable function, 775 
series expansion, 779 
similarity law, 771 
step function, 774 
survey, 769 
table, 1109 
transform, 770 
translation law, 771 
largest area method, 427 
lateral area 
cone, 157 
cylinder, 156 
polyhedron, 153 
lateral face, 152 
latitude, geographical, 162, 262 


lattice, 395 
Banach, 672 
Bravais, 349 
crystallography, 348 
distributive, 395 
vector, 672 
Laurent 
expansion, 752 
series, 752, 796 
law 
cosine, 166 
Gauss error propagation, 855 
large numbers, 825 
Bernoulli, 825 
limit theorem, Lindeberg-Levy, 825 
sides (cosine), 166 
sine, 166 
sine-cosine, 166 
polar, 166 
tangent, 169 
laws 
propositional calculus, 324 
layer, spherical, 158 
lcm (least common multiple), 374 
least common multiple (lcm), integer numbers, 374 
least squares method, 109, 313, 974, 978, 984 
calculus of observations, 848 
Gauss, 456 
regression analysis, 841 
least squares problem, linear, 313, 959 
Lebesgue 
integral, 507, 696 
comparison with Riemann integral, 506 
measure, 694, 876 
left-hand coordinate system, 209 
left-singular vector, 321 
leg or side of an angle, 129 
Legendre 
differential equation, 565 
polynomial 
associated, 567 
second kind, 567 
polynomials 
first kind, 566 
symbol, 379 
Leibniz 
alternating series test, 463 
formula, 438 
lemma 
Jordan, 755 
Schur, 345 
lemniscate, 101, 252 
length 
arclenth (radian), 131 
interval, 693 
line integral, first type, 517 
measure of arc 
space curve, 264 
spherical geometry, 161 
reduced, 175 
vector, 190 
level 
curve, 119 
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line, 119, 703 
surface, 703 
level set (fuzzy set), 417 
library (numerical methods), 1009 
Aachen library, 1011 
IMSL library, 1010 
NAG library, 1009 
Lie algebra, 351, 356 
basis, 357 
Lie group - Lie algebra, connection, 356 
matrix-exponential function, 357 
real, 356 
rigid-body motion, 358 
robotics applications, 358 
special, 356 
vector product, 360 
Lie brackets, 356, 596 
Lie groups, 351 
continuous 
definition, 353 
matrix-Lie groups, 353 
applications, 355, 356 
special, 354 
rigid-body movement 
group SE(3), 355 
robotics, control, 355 
limit 
cycle, 866, 895 
stable, 866 
unstable, 866 
definite integral, 493 
function 
complex variable, 731 
one variable, 53 
several variables, 123 
theorems, 55 
function series, 467 
integration, depending on parameter, 512 
partial sum, 467 
sequence of numbers, 458 
sequence, in metric space, 664 
series, 459 
superior, 469 
Lindeberg-Levy, central limit theorem, 825 
line, 63 
curvature, surface, 266 
geodesic, 160 
imaginary, 207 
notion, 129 
vector equation, 189 
line element 
surface, 263 
vector components, 719 
line equation 
plane, 195 
Hessian normal form, 196 
intercept form, 196 
polar coordinates, 196 
slope, 195 
through a point, 195 
through two points, 196 
space, 221 
line integral, 515 


Cartesian coordinates, 721 
first type, 516 
applications, 517 

general type, 519 

second type, 517 

vector field, 719 
linear combination, vectors, 85, 183, 185 
linear form, 658, 659 

continuous, 681 
linear programming 

basic notions, 911 

constraints, 909 

extreme point, 911 

normal form, 911 

objective function, 909 
linearly 

dependent, 366 

independent, 366 
lines (plane) 

angle between two lines, 197 

intersection point, 197 

orthogonal, 129, 198 

parallel, 129, 198 

pencil, 197 

perpendicular, 129, 198 
lines (space), 151 

equations, 218, 221 

intersecting points with planes, 223 

parallelism, 151 

skew, 151 
Liouville 

approximation theorem, 4 

formula, 554, 862 

theorem, 861 

theorem about the constant, 733 
Lipschitz condition 

differential equations of higher-order, 551 

ordinary differential equation, 541 
locus, geometric, 195 
logarithm 

binary, 10 

Briggsian, 9 

decimal, 9 

definition, 9 

natural, 9, 758 

Neperian, 9 

n x n matrices, 356 

principal value, 758 

quaternions, 294 

table, 10 

taking of a number, 9 
logarithm function, 73 

complex, 738 

conformal mapping, 738 

many valued, 739 

quaternions, 294 

representation of transcendental functions, 759 
logarithmic decrement, 85 
logarithmic normal distribution, 819 
logic, 323 

fuzzy, 413 

predicate, 326 

propositional, 323 
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longitude, geographical, 162, 262 
loop, graph, 401 
Lorentz curve, 791 
Lorenz system, 858, 861, 901 
loxodrome, 178 
arclength, 179 
course angle, 179 
intersection point, 179 
intersection point of two loxodromes, 180 
L? space, 697 
LU factorization (lower and upper triangular matrix), 
956 
Lyapunov 
dimension, 885 
exponent, 880 
function, 863 
stability, 863, 864 
theorem, 863 


MacDonald function, 563 
Maclaurin series expansion, 472 
Macsyma (computer algebra system), 1023 
majorant series, 466 
manifold 
center manifold theorem 
local, 897 
mappings, 897 
of solutions, 955 
stable, 866, 872 
unstable, 866, 872 
mantissa, 10, 1005 
Maple 
input, output, 1019 
numerical mathematics, 1019 
differential equations, 1022 
equations, 1020 
expressions and functions, 1019 
integration, 1021 
short characteristic, 1023 
Maple (computer algebra system), 1023 
mapping, 331, 333 
affine, 230 
anti symmetric, 356 
between groups, 339 
bijective, 658 
bilinear, 356, 368 
complex number plane, 745 
conformal, 734 
circular transformation, 736 
exponential function, 739 
fixed-point, 735, 736 
inversion, 736 
isometric net, 735 
linear fractional, 736 
linear function, 735 
logarithm, 738 
quadratic function, 737 
square root, 737 
sum of linear and fractional linear functions, 738 
contracting, 666 
equivalent, 905 
function, 49, 331 
Hénon mapping, 872 


horseshoe, 887 
injective, 333, 658 
inverse mapping, 333 
kernel, 339 
lifted, 905 
linear, 658 
linear operator, 366 
linear transformation, 658 
modulo, 876 
one to one, 333 
Poincaré, 868, 887 
reduced, 897 
regular, 367 
rotation mapping, 877 
shift, 880 
surjective, 333, 658 
tent, 876 
topological conjugation, 873 
unit circle, 905 
mass 
double integral, 527 
line integral, first type, 517 
triple integral, 532 
matching, 409 
maximal, 409 
perfect, 409 
saturated, 409 
Mathcad (computer algebra system), 1023 
Mathematica (computer algebra system), 1023 
Mathematica 
3D graphics, 1051 
algebraic expressions 
manipulation, 1036 
multiplication, 1036 
apply, 1033 
attribute, 1035 
basic structure elements, 1024 
context, 1035 
curves 
parametric representation, 1050 
two-dimensional, 1049 
differential and integral calculus, 1042 
differential equations, 1044 
differential operator, 1042 
differential quotient, 1042 
differentiation, 1033 
functions, 1042 
eigenvalue problems, 1040 
eigenvalues, 1041 
eigenvectors, 1041 
elements, 1024 
equations 
logical expressions, 1038 
solution, 1038 
transcendental, 1039 
expressions, 1024 
FixedPoint, 1033 
FixedPointList, 1033 
floating-point number, conversion, 1026 
function, inverse, 1033 
functional operations, 1032 
functions, 1031 
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Mathematica (continued) 
graphics 
functions, 1047 
options, 1047 
primitives, 1045, 1046 
heads, 1024 
important applications, 1036 
input, output, 1016, 1023, 1025 
integrals 
definite, 1044 
indefinite, 1043 
multiple, 1044 
rational functions, 1043 
trigonometric functions, 1043 
lists, 1027 
nested, 1028 
manipulation 
formulas, 1023 
non-polynomial expressions, 1038 
manipulation with matrices, 1030 
manipulation with vectors, 1030 
Map, 1033 
matrices as lists, 1029 
messages, 1035 
Nest, 1033 
NestList, 1033 
numerical calculations 
introduction, 1024 
numerical mathematics, 1016 
curve fitting, 1016 
differential equations, 1018 
integration, 1018 
interpolation, 1017 
polynomial equations, 1017 
objects, three-dimensional, 1051 
operators, important, 1026 
partial fraction 
decomposition, 1037 
pattern, 1032 
polynomial equations, 1039 
polynomials 
factorization, 1037 
operations, 1037 
programming, 1034 
series, inverse, 1033 
short characteristic, 1023 
surfaces, 1051 
surfaces and space curves, 1051 
syntax, additional information, 1035 
system description, 1024 
system of equations 
general case, 1040 
special case, 1040 
systems of equations, 1040 
solution, 1038 
types of numbers, 1025 
vectors as lists, 1029 
mathematics, discrete, 323 
Matlab, 1011 
functions, survey, 1011 
numerical integration, 1014 
numerical interpolation, 1013 
curve fitting, 1013 


numerical linear algebra, 1012 

numerical solution 
differential equations, 1015 
equations, 1013 

toolbox, 1012 


matrices 


arithmetical operations, 272 
associative law, 272 
commutative law, 272, 273 
Dirac, 290 

distributive law, 272 
division, 272 

eigenvalue, 314 

eigenvalue problem, 314 
eigenvectors, 314 

equality, 272 
multiplication, 272 

Pauli, 290 

powers, 276 

products, 275 
quaternions, 291 

rules of calculation, 275 


matrix, 269 


adjacency, 403 

adjoint, 269, 278 

admittance, 408 

anti-Hermitian, 271 

antisymmetric, 270 

block tridiagonal, 977 

calculation of inverse matrix, 308 

complex, 269 

conjugate, 269 

deflation, 321 

degree, 408 

diagonal, 270 

differentiation, 276 

distance, 404 

extended coefficient, 957 

full rank, 313 

functional, 691 

fundamental, 862 

Hermitian, 271 

Hessian, 931 

identity, 271 

incidence, 403 

inverse, 274, 278 

Jacobian, 691, 962 

Jordan, 319 

lower triangular, 957 

main diagonal element, 270 

matrix-Lie group, 352 

monodromy, 863, 872 

n xn, 353 

normal, 270 

of coefficients, 308 
augmented, 309 

orthogonal, 275 

projection, 238 

quaternions representation, 291 
complex matrix, 292 
real matrix, 291 

rank, 274 

real, 269 


Index 


1183 





matrix (continued) 
rectangular, 269 
regular, 274 
rotation, 231, 275 


space, 212, 215, 216 


rotation, object, 295 
rotations, 294 
scalar, 270 
scaling, 231 
self-adjoint, 271 
shearing, 231 
singular, 274 
size, 269 
skew-symmetric, 270 
spanning tree theorem, 408 
sparse, 977 
spur, 270 
square, 269, 270 
stochastic, 826 
symmetric, 270 
trace, 270 
transformation, 231 
translation, 231 
transposed, 269 
triangle decomposition, 955 
triangular, 271 
unitary, 275 
upper triangular, 957 
zero, 269 

matrix product 


d 


isappearing, 275 


inequality, 273 


matrix-exponential function, 352, 862 


S 


Lie algebra, 357 


ecial, 352 


matrix-Lie algebra, 357 


g 


roup SE(3), 357 


matrix-Lie group 


a 
a 


efinition, 353 
imension, 353 


quaternions, 355 


matrix-Lie groups 


aj 
S 


pplications, 355 
ecial, 354 


max-min composition, 424 
maximum 


al 


bsolute, 51, 61, 125, 443 





g 


obal, 51, 443 


local, 51 

relative, 51, 443 
maximum-criterion method, 426 
Maxwell, diagonal method, 744 


mean 
a 
g 
g 


rithmetic, 19, 813, 850 
eometric, 20 
olden, 908 


harmonic, 20 
quadratic, 20 
sample function, 831 
statistics, 833 


Vi 


alue, 19 


weighted, 813, 850, 854 


mean 


squares problem 


different versions, 456 

linear, 313 

non-linear, 987 

rank deficiency case, 314 
mean value formula, 964 


mean value function, integral calculus, 497 


mean value method, 109 
mean value theorem 
differential calculus, 442 
generalized, 443 
integral calculus, 497 
generalized, 498 
measure, 694 
concentrated on a set, 876 
counting, 694 
degrees, 131 
dimension, 884 
Dirac, 694, 876 
ergodic, 877 


function, convergence theorems, 697 


Hausdorff, 882 
invariant, 876 
Lebesgue, 694, 876 
natural, 876 
physical, 877 
probability, 697 
invariant, 876 
SBR (Sinai-Bowen-Ruelle), 877 
c algebra, 694 
c finite, 697 
support, 876 
measured value, 848 
measurement 
error, 848 
error density, 848 
error, characterization, 848 
error, distribution, 848 
protocol, 848 
measuring function, time series, 889 
median, 143 
sample function, 832 
statistics, 834 
triangle, 133 
Mellin transformation, 768 
Melnikov method, 903 
membership 
degree, 413 
function, 413, 414 
bell-shaped, 415 
trapezoidal, 414 
relation, 327 
meridian, 162, 262 
convergence, 171 
tangential, 178 
method 
Bairstow, 45, 954 
barrier, 941 
Bernoulli, 954 
bisection, 319 
Box-Miiller, 934 
Brodetsky-Smeal, 45 
broken line, Euler, 969 
center of area, 427 
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method (continued I) 


center of gravity, 426 
generalized, 427 
parametrized, 427 

Cholesky, 314, 958 

collocation 
boundary value problem, 974 
integral equation, 634 
partial integral equation, 978 

comparing coefficients, 17 

cutting plane, 942 

descent, 931 

difference 
ordinary differential equations, 973 
partial differential equations, 976 

extrapolation, 970 

Fibonacci, 930 

finite difference, 588 

finite element, 588, 619, 978 

Galerkin, 974 

Gauss elimination, 955 

Gauss-Newton, 962 
derivative free, 963 

Gauss-Seidel, 960 

Gaussian least squares, 456 

gradient, 619 

Graeffe, 954 

Green, 586, 587 

Hildreth-d'Esopo, 929 

Householder, 314, 985 

inner digits of squares, 843 

integrating factor, 543 

interpolation 
polynomials, 982 
quaternions, 302 

iteration, 319, 949, 955, 960 
ordinary, 949, 961 

Jacobian, 319, 960 

Kelley's, optimization, 942 

Lagrange multiplier, 925 

Lanczos, 319 

largest area, 427 

least squares, 109, 955, 974, 978, 984, 986 

Mamdani, 427 

maximum-criterion, 426 

mean-of-maximum, 426 

Melnikov, 903 

Mises, 319 

Monte Carlo, 843 

multi-step, 970 

multiple-target, 975 

Newton, 950, 962 
modified, 950 
non-linear operators, 690 

operator, 769 

orthogonalization, 314, 958, 959 

parametrized center of area, 427 

pivoting, 307 

polygonal, Euler, 969 

predictor-corrector, 971 

regula falsi, 951 

regularization, 321 

relaxation, 961 





method (continued II) 
Riemann, 584 
Ritz, 618, 974 
Romberg, 966 
RSA, 392 
Runge-Kutta, 969 
separation of variables, 542, 579 
shooting, 975 
single-target, 975 
SOR (successive over-relaxation), 961 
statistical experiment, 847 
steepest descent, 931 
Steffensen, 951 
successive approximation 
Banach space, 679 
Picard, 549 
Sugeno, 428 
transformation, 319 
undetermined coefficients, 17 
variation of constants, 554 
Wolfe, 928 
method unknown coefficients 
n-th order, 560 
metric, 662 
Euclidean, 662 
surface, 265 
metric prefixes, 1054 
Meusnier, theorem, 265 
midpoint line segment 
plane, 193 
space, 217 
midpoint rule, 971 
minimal surface, 267 
minimum 
absolute, 51, 61, 125, 443 
global, 51, 443 
local, 51 
point, 924 
global, 924 
local, 924 
relative, 51, 443 
Minkowski 
inequality, 32 
integrals, 33 
series, 32 
Mises, power method, 319 
mixed product, 188 
modal value, statistics, 834 
mode, statistics, 834 
model 
Hopf-Landau, turbulence, 904 
urn model, 814 
module of an element, 661 
modulo 
congruence, 334, 377 
mapping, 876 
modulus, analytic function, 732 
de Moivre formula 
complex number, 38 
generalized, 293 
hyperbolic functions, 92 
trigonometric functions, 81 
Mollweide equations, 143 
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moment of inertia, 282, 505 
double integral, 527 
line integral, first type, 517 
triple integral, 532 
moment, order n, 813 
monodromy matrix, 863, 864, 872 
monotony, 441, 694 
function, 50 
sequence, numbers, 457 
Monte Carlo method, 843 
application in numerical mathematics, 845 
usual, 845 
Monte Carlo simulation, 843 
example, 845 
Morse-Smale system, 875 
multi-index, 671 
multi-scale analysis, 803 
multi-step method, 970 
multiple integral, 523 
transformation, 769 
multiple-target method, 975 
multiplication 
complex numbers, 37 
computer calculation, 1005 
matrices, 272, 275 
polynomials, 11 
quaternions, 292, 293 
rational numbers, 1 
rule 
quaternions, 291 
units, generalized imaginary i, j,k, 290 
multiplicity, of divisors, 370 
multiplier, 543, 863, 864, 872 
Lagrange method, 456 
mutation, 933 


nabla operator 
definition, 715 
divergence, 715 
gradient, 715 
quaternions, 305 
rotation, 715 
twice applied, 716 
vector gradient, 715, 716 
NAND function, 325 
nautical, radio bearing, 172 
navigation, 174 
satellite, 304 
negation, 323 
Boolean function, 397 
double, 325 
neighborhood, 663 
point, 663 
Neper 
analogies, 169 
equations, 169 
logarithm, 9 
rules, 170 
Neumann 
functions, spherical, 565 
problem, 729 
series, 626, 645 
Newton 


field, point-like mass, 705 
interpolation formula, 982 
Newton method, 950, 962 
adaptive, 1021 
modified, 950 
non-linear operators, 690 
approximation sequence, 690 
modified, 690 
non-linear optimization, 931 
damping method, 932, 962 
nodal plane, 598 
node, 869 
approximation interval, 996 
differential equation, ordinary, 547 
saddle node, 864, 869 
stable, 864 
triple, 896 
nominal rate, 22 
nomography, 125 
alignment charts, 125, 126 
examples with three variables, 126 
alignment nomogram, 126 
key equation, 125 
net charts, 125 
more than three variables, 128 
three variables, 125 
nomogram, 125 
non-negative integer, 1 
NOR function, 325 
norm 
axioms, linear algebra, 276 
Euclidean, 367 
integral, 698 
isomorphism, 684 
matrix, 276 
column sum norm, 277 
row sum norm, 277 
spectral norm, 277 
subordinate, 277 
uniform norm, 277 
operator norm, 677, 862 
pseudonorm, 682 
residual vector, 313 
semi-norm, 682 
s norm, 418 
space, 669 
t norm, 418 
vector, 276 
Euclidean norm, 276, 277 
matrix norm, 277 
sum norm, 277 
normal 
plane curve, 244 
principle (surface), 265 
section (surface), 265 
normal distribution, 818 
logarithmic normal distribution, 819 
observational error, 73 
standard normal distribution, 819 
standard, table, 1133 
two-dimensional, 840 
normal equation, 959, 984, 985 
system, 842, 984 
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normal form, 399 
differential equation, Riccati, 544 
equation of a surface, 224 
Jordan, 319 
principal conjunctive, 399 
principal disjunctive, 399 
normal plane, space curve, 257, 259 
normal vector 
plane, 218 
plane sheet, 722 
surface, 262 
normalization condition, 598 
normalizing factor, 196 
northern direction 
geodesical, 170 
geographical, 170 
notation 
Polish notation, 408 
postfix notation, 408 
prefix notation, 408 
reversed polish, 408 
n tuple, 118 
number, 1 
approximation, 4 
cardinal, 328, 335 
Carmichael, 382 
complex, 34 
absolute value, 35 
addition, 36 
algebraic form, 34 
argument, 35 
division, 37 
exponential form, 36 
field, 364 
main value, 35 
multiplication, 37 
power, 38 
subtraction, 36 
taking the root, 38 
trigonometric form, 35 
composite, 370 
conjugate complex, 36 
coprime, 373 
hyper-complex, 289 
imaginary, 34 
integer, 1 
irrational, 2-4, 334 
algebraic, 2 
quadratic, 2 
natural, 1 
non-negative, 1 
pseudoprime, 382 
random, 830 
rational, 1, 3 
real, 2 
taking of root, 8 
sequence, 457 
convergent, 664 
metric space, 664 
transcendental, 2 
number line, extended, 694 
number plane, complex, Gauss, 34 
number representation, computer internally, 1001, 1003 


number system, 1001 
binary, 1002 
decimal, 1002 
hexadecimal, 1002 
octal, 1002 
number theory, elementary, 370 
numbers 
Bernoulli, 465 
Clifford, 290 
complex 
generalized, 290 
Euler, 466 
Fermat, prime, 371 
Fibonacci, 375, 908 
interval, 2 
Mersenne, prime, 371 
prime, 370 
pseudo-random, 844 
pseudorandom, 844 
random, 843 
normal distributed, 934 
numerical 
analysis, 949 
axis, 1 
calculations, computer, 1004 
library (numerical methods), 949 
nutation, 215 
nutation angle, 215 
Nystróm method, 631 


obelisk, 155 
objective function, 909 
selection, 933 
observable, 594 
observational value, 848 
occupancy number, statistics, 833 
octahedron, 156 
octal number, 1002 
octal system, 1002 
octant, 210 
Octave, 1011 
offspring (descendant), 933 
populations, 936 
w-limit set, 859, 865, 872 
operation, 335 
algebraic, 282 
arithmetical, 1 
associative, 335 
binary, 335 
commutative, 335 
exterior, 336 
n ary operation, 335 
operational method, 588 
operational notation, differential equation, 555 
operator 
adjoint, 684 
AND, 421 
bounded, 677 
closed, 678 
coercivity, 693 
compact, 686 
compensatory, 421 
completely continuous, 686 
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operator (continued) 
continuous, 668 
inverse, 679 
contracting, 666 
demi-continuous, 693 
differentiable, 690 
finite dimensional, 686 
gamma, 421 
Hammerstein, 689 
idempotent, 686 
inverse, 659 
isotone, 692 
lambda, 421 
linear, 366, 658 
bounded, 677 
continuous, 677 
linear transformation, 366 
linear, permutability, 367 
linear, product, 366 
linear, sum, 366 
monotone, 693 
Nemytskij, 689 
non-linear, 689 
norm, 862 
notion, 49 
OR, 421 
positive, 660 
positive definite, 686 
positive non-linear, 692 
self-adjoint, 685 
singular, 650 
strongly monotone, 693 
Urysohn, 690 
operator method, 769 
opposite side, 131 
optimality condition, 924, 926 
sufficient, 925 
optimality principle, Bellman, 945 
optimization, 909 
optimization method 
Bellman functional equation, 946 
conjugate gradient, 932 
cutting plane method, 942 
damped, 932 
DFP (Davidon-Fletcher-Powell), 932 
evolution strategies, 933 
feasible direction, 937 
Fibonacci method, 930 
golden section, 930 
Hildreth-d’Esopo, 929 
Kelley, 942 
penalty method, 940 
projected gradient, 938 
unconstrained problem, 931 
Wolfe, 928 
optimization problem 
convex, 683, 926 
dynamic, 943 
non-linear, 991 
optimization, non-linear, 924 
barrier method, 941 
convex, 926 
convexity, 927 


descent (offspring) method, 931 
direction search program, 937 
duality, 926 
evolution strategies, 933 
gradient method 
inequality constraints, 936 
mutation, 933 
Newton method, 931 
numerical search procedure, 930 
offspring (descent) method, 931 
quadratic, 926 
recombination, 933 
saddle point, 925 
selection, 933 
steepest descent (offspring) method, 931 
orbit, 857 
double, periodic, 898 
heteroclinic, 868 
homoclinic, 868, 903 
periodic, 857 
hyperbolic, 865 
saddle type, 865 
order 
Clifford algebra, 289 
curve, second order, 206 
curve, n-th order, 195 
differential equation, 540 
interval, 660 
of magnitude, function, 57 
(ο) interval, 660 
second order surface, 224 
wavelet, 801 
order relation, 333 
linear, 334 
order symbol, Landau, 57 
ordering, 334 
lexicographical, 334 
linear, 334 
partial, 334, 659 
ordinate 
plane coordinates, 190 
space coordinates, 210 
Ore theorem, 406 
orientation, 288 
coordinate system, 209 
numerical axis, 1 
origin 
numerical axis, 1 
plane coordinates, 190 
space coordinates, 210 
original function, 767 
original space, 767 
set, 49 
orthocenter, triangle, 133 
orthodrome, 174 
arclength, 176 
course angle, 174 
intersection point 
meridian, 176 
parallel circle, 176 
two orthodromes, 180 
point, closest to the north pole, 174 
orthogonal 
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function system, 985 
polynomials, 985 
space, 674 
spherical, 171 
orthogonality 
eigenvalues, differential equation, 570 
Hilbert space, 674 
lines, 129 
real vector space, 367 
trigonometric functions, 368 
vectors, 185 
weight, 570 
orthogonality conditions 
line-plane, 224 
lines in space, 223 
planes, 221 
orthogonalization method, 958, 959 
Givens, 960 
Gram-Schmidt, 675 
Householder, 314, 960 
Schmidt, 637 
orthogonalization process 
Gram-Schmidt, 316 
orthonormal function system, 985 
orthonormalization, vectors, 281 
oscillation 
duration, 84 
harmonic, 84 
oscillator, linear harmonic, 601 
osculating plane, space curve, 257, 259 
osculation point, curve, 250 
over-field, 362 
oversliding, 285 


pair, ordered, 331 
parabola, 204 
arclength, 206 
area, 206 
axis, 204 
binomial, 71 
cubic, 64 
diameter, 205 
directrix, 204 
equation, 205 
focus, 204 
intersection figure, 226 
n-th degree, 66 
parameter, 205 
quadratic polynomial, 64 
radius of curvature, 206 
semicubic, 95 
semifocal chord, 204 
tangent, 205 
transformation, 207 
vertex, 204 
paraboloid, 226 
elliptic, 226 
hyperbolic, 226, 227 
central surface, 228 
invariant signs, 229 
of revolution, 226 
parallel circle, 262 
parallelepiped, 153 


rectangular, 153 
parallelism conditions 
line-plane, 224 
lines, 129 
lines in space, 223 
planes, 221 
parallelogram, 135 
parallelogram identity, unitary space, 673 
parameter, 11, 50 
parabola, 205 
statistical, 833 
parameter space, stochastic, 826 
parametric integral, 512 
parametric representation, circle, 199 
parametrized center of area method, 427 
parity, 601 
Parseval 
equation, 475, 570, 676 
formula, 789 
partial fraction decomposition, 15, 778 
special cases, 1071 
partial ordering, 334 
partial sum, 459 
partition, 334 
Pascal limaçon, 98 
Pascal triangle, 13 
path, 404 
closed, 404 
integration, 516 
patterns, periodic, 604 
Pauli matrices, 290 
PCNF (principal conjunctive normal form), 399 
PDNF (principal disjunctive normal form), 399 
pencil of lines, 197 
pendulum 
equation, 904 
Foucault, 1019 
mathematical, 762 
period, 763 
pentagon, regular, 139 
pentagram, 4 
regular, 139 
percentage, 21 
calculation, 21 
performance score, 425 
perimeter 
circle, 139 
ellipse, 201 
polygon, 138 
period 
function, 52 
secant, 78 
sine, 77, 84 
tangent, 77 
period doubling, 898 
cascade, 899, 905 
period parallelogram, 763 
permutability, linear operators, 367 
permutation, 805 
cyclic, vectors, 209 
group, 338 
matrix, 957 
with repetition, 805 
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without repetition, 805 
perturbation, 589, 1006 
Pesin formula, 881, 888 
phase 
initial, 84 
portrait, dynamical systems, 860 
shift, 77, 84 
sine, 77, 84 
space, dynamical systems, 857 
spectrum, 787 
Picard 
iteration method 
integral equation, 625 
successive approximation method, 549 
Picard-Lindelóf theorem, 668 
pivot, 956 
column, 307, 915 
element, 307, 915 
row, 307, 915 
scheme, 307 
pivoting, 307, 310 
column pivoting, 957 
step, 307 
plain trigonometry, 142 
plane (space) 
intersecting line, 152 
normal plane, 257, 262 
osculating plane, 257 
parallel line, 152 
projecting plane, 221 
rectifying plane, 257, 259 
tangent plane, 257, 262, 264 
vector equations, 189 
plane geometry, 129 
planes (space), 151 
equations, 218 
intersecting points with lines, 223 
orthogonality, conditions, 221 
parallelism, 151 
distance, 221 
parallelism, conditions, 221 
planimeter, 500 
Poincaré mapping, 868, 870 
Poincaré section, 903 
point 
accumulation, 664 
asymptotic, curve, 250 
boundary, 664 
circular, 267 
coordinates, 190 
corner, 250 
cuspidal, curve, 250 
discontinuity, 58 
double, curve, 250 
fixed, conformal mapping, 736 
fixed, stable, 899 
focal, ordinary differential equation, 548 
image, 237 
improper, 193 
infinite, 193 
interior, 663 
isolated, 664 
isolated, curve, 250 


limit, 255 
limiting, 664 
multiple, curve, 251 
n-dimensional space, 118 
nearest approach, 255 
neighborhood, 663 
non-wandering, 875 
notion, 129 
plane curve, 249 
principal vanishing, 241 
rational, 1 
regular, surface, 262 
saddle, ordinary differential equation, 548 
singular, 243, 250 
isolated, 547 
ordinary differential equation, 547 
surface, 262, 263 
spectrum, 680 
spiral, ordinary differential equation, 548 
surface point 
circular, 267 
elliptic, 266 
hyperbolic, 267 
parabolic, 267 
spherical, 266 
umbilical, 267 
terminal, curve, 250 
transversal homoclinic, 872 
umbilical, 267 
vanishing, 237, 241 
Poisson 
differential equation, partial, 592, 726, 729 
distribution, 817 
formula, 591 
integral, 587 
polar, 164 
angle, 191 
axis, 191 
coordinates, plane, 191 
coordinates, spherical, 211 
distance, 262 
equation, 599 
curve second order, 208 
subnormal, 245 
subtangent, 245 
pole 
complex function, 733 
field, 703 
function, 60 
left, 164 
multiplicity m, complex function, 753 
on the sphere, 164 
order m, complex function, 753 
origin, 191 
right, 164 
Polish notation, 408 
polyeder, 153 
polygon 
area 
2n gon, 139 
n gon, 138 
base angle, 138 
central angle, 138 
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circumcircle radius, 138 
circumscribing, 138 
exterior angle, 138 
inscribed circle radius, 138 
interior angle, 138 
perimeter, 138 
plane, 138 
regular, 340 
regular, convex, 138, 139 
side length, 138 
similar, 134 
polyhedral angle, 152 
polyhedron, 152, 153 
convex, 155 
regular, 155 
polynomial, 63 
characteristic, 315 
control, 386 
equation, numerical solution, 952 
first degree, 63 
generating, 386 
indecomposable, 363 
integral rational function, 62 
interpolation, 982 
irreducible, 363 
n-th degree, 65 
primitive, 364 
product representation, 43 
quadratic, 64 
ring, 363 
second degree, 64 
third degree, 64 
trigonometric, 992 
polynomials, 11 
Bernstein, 1000 
Chebyshev, 89, 985, 989 
Hermite, 568, 602, 675, 985 
Laguerre, 568, 675, 985 
Legendre, 566, 600, 674, 985 
orthogonality, 985 
population, 830 
two-stage, 814 
population strategies, 935 
Posa theorem, 406 
position coordinate, reflection, 287 
positive definite, 958 
postfix notation, 408 
potential 
complex, 741 
equation, 592 
field, 721 
conservative, 721 
rotation, 715 
retarded, 591 
power 
complex number, 38 
notion, 7 
real number, 7 
reciprocal, 70 
power function, 71 
quaternions, 294 
power series, 469 
asymptotic, 472 


complex terms, 750 
expansion, analytic function, 749 
inverse, 471 
power set, 335 
power spectrum, 878 
precession, 215 
predicate, 326 
logic, 326 
n ary, 326 
predictor, 971 
predictor-corrector method, 971 
pre-Hilbert space, 673 
present value, 25 
pressure, 504 
gravitational, 504 
lateral, 504 
prevalence, 891 
prime 
coprime, 5, 14 
decomposition, canonical, 372 
element, 370 
factorization, 371 
canonical, 371 
Fermat, 371 
Mersenne, 371 
notation (measurement protocol), 848 
pair, 371 
quadruplet, 371 
relatively, 14 
triplet, 371 
prime formula, predicate logic, 326 
prime number, 370 
pseudo, 382 
test 
AKS, 383 
Fermat, 382 
Lucas-Lehmer, 371 
Rabin-Miller, 382 
sieve of Eratosthenes, 370 
principal (a sum of money), 21 
principal axis 
direction, 283 
transformation, 283 
principal ideal, 362 
principal normal 
section, surface, 266 
space curve, 257, 259 
principal quantities, 11 
principal value 
integral, improper, 507 
inverse hyperbolic function, 760 
inverse trigonometric function, 86, 760 
logarithm, 758, 760 
principle 
Cauchy, 666 
Cavalieri, 503 
contracting mapping, 666 
extensionality, 328 
extrapolation principle, 967 
Neumann, 345 
two-values, 323 
prism, 153 
lines, 153 
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regular, 153 quaternions, 290, 293 
probability rules of calculation, 7 
acceptance, 853 scalar, 184, 273 
area interpretation, 812 Hilbert space, 673 
conditional, 810 tensors, dyadic, 282 
definition, 808 vectors 
position of a particle, 596 cross, 184 
total, 810 dot, 184 
probability amplitude, 596 dyadic, 273 
probability integral, 819 properties, 184 
probability measure, 876 quaternions, 291 
ergodic, 880 scalar, 273 
invariant, 876 tensor, 273 
probability paper, 834 vector, 184 
probability theory, 805, 807 programming, 909 
probability vector, 827 continuous dynamic, 943 
problem discrete dynamic, 943 
Cauchy, 572 Bellman functional equation, 944 
Dirichlet, 582, 729 Bellman functional equation method, 946 
discrete, 985 Bellman optimality principle, 945 
eigenvalue, 315 constraint dynamic, 943 
inhomogeneous, 591 constraint static, 943 
linear fitting, system of equations, 958 knapsack problem, 944 
multidimensional problem, 943 
computation of adjustments, 986 purchasing problem, 944 
n-dimensional, 590 state vector, 943 
Neumann, 729 linear, 909 
regularized, 314 Mathematica, 1034 
shortest way, 404 programming, discrete 
Sturm-Liouville, 569 continuous dynamical, 943 
two-body, 574 cost function, 944 
problems, basic decision, 943 
plane trigonometry, 144 space, 943 
spherical trigonometry, 169 dynamic, 943 
process functional equation, 945 
birth process, 829 Bellman, 944 
death process, 829 method, 946 
orthogonalization process, 316 knapsack problem, 944, 947 
Poisson process, 828 minimum interchangeability, 945 
stochastic, 825, 826 minimum separability, 944 
product, 7 n-stage decision process, 943 
algebraic, 419, 420 optimal policy, 946 
Cartesian optimal purchasing policy, 946 
fuzzy sets, 422 purchasing problem, 944 
n fold, 423 state costs, 943 
sets, 331 state vector, 943 
derivative, 434 programming, linear 
direct application, 920 
group, 338 basic solution, 913 
Q algebra, 395 basis of the extreme point, 912 
drastic, 419, 420 basis, inverse, 913 
dyadic, 273 constraints, 909 
inner duality, 919 
continous groups, 353 duality theorems, 919 
Hilbert space, 673 extreme point, 912 
Lie algebra, 358 degenerate, 912 
vectors, 184 feasible set, 910 
Kronecker, 276 forms, 909 
matrices, 272 general form, 909 
complex, 273 Hungarian method, 923 
vanishing, 275 integer programming, 910 
n times direct, 397 maximum point, 910 





product sign, 7 normal form, 913 
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programming (continued) 
northwest corner rule, 921 
objective function, linear, 909 
problem 
assignment, 923 
distribution, 923 
dual, 919 
minimum, 910 
primal, 919 
round-tour, 923 
scheduling, 924 
transportation, 920 
transportation, simplex method, 921 
properties, 911 
simplex method, 913, 914 
revised, 917 
solution point, 910 
variable 
artificial, 916 
basic, 913 
non-basic, 913 
non-negative, 910 
slack, 910 
surplus, 913 
projection 
axonometric, 238 
cabinet, 238, 240 
Cavalier, 238, 240 
central, 237 
isometric, 239 
matrix, 238, 240 
oblique, 238 
orthogonal parallel, 237 
orthographic, 237 
parallel, 237 
oblique, 238, 240 
perspective, 237, 241 
mapping prescription, 241 
vanishing point, 241 
planar, 237 
principal, 238 
stereographic, 303 
projection sides, 143 
projection theorem, orthogonal space, 674 
projector, 686 
proof 
by contradiction, 5 
constructive, 6 
direct, 5 
indirect, 325 
implication, 5 
mathematical induction, 5 
step from n ton 4-1, 5 
proportionality 
direct, 63 
inverse, 66 
proportions, 17 
proposition 
dual, 396 
true or false, 323 
propositional 
calculus, 323 
logic, 323 


expression, 323 
operation, 323 
extensional, 323 

variable, 323 
propositional logic, theorems, 324 
protocol, 832 
pseudo-random numbers, 844 
pseudonorm, 682 
pseudoscalar, 288, 289 
pseudotensor, 287, 288 
pseudovector, 287, 288 
Ptolemy’s theorem, 137 
pulse, rectangular, 774 

bipolar, 791 

Jordan lemma, 757 

unipolar, 786 
pyramid, 154 

frustum, 154 

n faced, 154 

regular, 154 

right, 154 

truncated, 154 
Pythagoras 

general triangle, 143 

right-angled triangle, 142 


QR algorithm, 319, 959 
QR decomposition, 959 
quadrangle, 135, 136 
circumscribing, 136, 137 
concave, 136 
convex, 136 
inscribed, 137 
quadrant relations, trigonometric functions, 79 
quadrant, Cartesian coordinates, 190 
quadratic 
curve, 206 
surface, 228 
quadratic form 
first fundamental, of a surface, 263 
index of inertia, 318 
second fundamental, surface, 266 
quadrature formula, 963 
Gauss type, 965 
Hermite, 964 
integral equation, 630 
interpolation quadrature, 964 
Lobatto type, 966 
quadruple (ordered 4 tuple), 331 
quantification, restricted, 327 
quantifier, 326 
quantile, 812 
quantity, infinitesimal, 494, 500 
quantum number, 597 
energy, 597 
magnetic, 601 
orbital angular momentum, 599 
vibration quantum number, 602 
quartic, 97 
quasiperiodic, 866 
quaternion 
biquaternion, 290 
conjugation, 292 
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definition, 290 
inverse element, 292 
pure quaternion, 291 
unit quaternion, 291 
quaternions, 289 
addition, 292 
algorithm efficiency, 301 
applications, 302 
Cardan angles, 298 
complex matrix, 292 
computer graphic, 302 
conjugation, 298 
division, 293 
Euler angles, 296 
hyperbolic functions, 294 
interpolation, 302 
Lerp, 302 
rotation matrices, 303 
Slerp, 302 
Squad, 303 
Laplace operator, 306 
logarithmic function, 294 
multiplication, 292, 293 
power function, 294 
representation 
complex numbers, 290 
four vector, 290 
matrix, 291 
rigid-body motion, 306 
biquaternions, 306 
rotation matrix, 294 
rotations, 294, 297 
skew field, 290 
subtraction, 292 
trigonometric form, 291 
trigonometric functions, 293 
vector analysis, 305 
queuing, 829 
theory, 829 
quintuple (ordered 5 tuple), 331 
quotient, 1, 12 
derivative, 435 
differential, 432 
set, 334 


R? (3 dimensional Euclidean vector space), 289 


R^ (4 dimensional Euclidean vector space), 289 
radial equation, 599 
radian definition, 131 
radian measure, 131 
radian unit, 1055 
radicals, ordinary differential equation, 546 
radicand, 8 
radius 
circle, 140, 198 
circumcircle, 143 
convergence, 469 
curvature, 248 
curve, 247 
space curve, 258 
curvature, extremal, 616 
polar coordinates, 191 


principal curvature 
surface, 265 
short, 133 
torsion, space curve, 260 
vector, 182 
raising to a power 
complex numbers, 38 
real numbers, 7, 9 
random numbers, 830, 843, 934 
application, 844 
congruence method, 844 
construction, 844 
different distributions, 844 
normal distributed, generation, 934 
pseudo-random, 844 
table, 1139 
uniformly distributed, 843 
random variable, 811 
continuous, 811, 812 
discrete, 811 
independent, 814, 839 
mixed, 811 
multidimensional, 814 
two-dimensional, 839 
random vector 
mathematical statistics, 830 
multidimensional random variable, 814 
random walk process, 846 
range, 48 
operator, 658 
sample function, 832 
statistics, 834 
rank 
matrix, 274 
tensor, 281 
vector space, 367 
rate 
effective, 22 
nominal, 22 
of interest, 22 
ray point, 548 
ray, notion, 129 
Rayleigh-Ritz algorithm, 319 
reaction, chemical, concentration, 117 
real part (complex number), 34 
rebate, 21 
recombination, 933 
intermediary, 933 
reconstruction 
dynamic from time series, 889 
embedding, 890 
immersion, 890 
mapping, 890 
pair sets, 890 
prevalence, 891 
theorem 
Kupka, Smale, 890 
Sauer, Yorke, Casdagli, 891 
Takens, 890 
reconstruction space 
dynamic, 890 
time series, 889 
rectangle, 136 
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rectangular formula, simple, 964 
rectangular sum, 964 
rectification, 108 
Reduce (computer algebra system), 1023 
reduction, 21 
angle, 175 
reduction formula, trigonometric functions, 79 
reflection principle, Schwarz, 741 
reflection, position coordinate, 287 
region, 119 
multiply-connected, 119 
non-connected, 119 
simply-connected, 119 
two-dimensional, 119 
regression 
analysis, 839 
coefficient, 841 
line, 841 
linear, 841 
multidimensional, 842 
regula falsi, 951 
regularity condition, 925, 941 
regularization method, 321 
regularization parameter, 314 
relation, 331 
binary, 331 
congruence relation, 394 
equivalence relation, 333 
fuzzy-valued, 422 
inverse, 332 
less or equal than (X relation), 326 
matrix, 331 
n ary, 331 
n place, 331 
order relation, 333 
product, 332 
uncertainty, 596 
relaxation method, 961 
relaxation parameter, 961 
reliability testing, 829 
relief, analytic function, 732 
remainder 
estimation, 466 
series, 459 
term, 459 
Remes algorithm, 990 
rent, 25 
ordinary, constant, 25 
representation function 
explicit form, 49 
implicit form, 49 
parametric form, 50 
representation of groups, 340 
adjoint, 342 
direct product, 344 
direct sum, 344 
equivalent, 342 
faithful, 342 
identity, 342 
irreducible, 344 
non-equivalent, 342 
particular, 342 
properties, 342 





reducible, 344 
reducible, complete, 344 
representation matrix, 340 
representation space, 340 
subspace, 343 
true, 342 
unitary, 342 
representation theorem (fuzzy logic), 417 
representative, 362 
resection 
Cassini, 150 
Snellius, 149 
residual spectrum, 681 
residual sum of squares, 313, 959 
residual vector, 313 
residue, 313, 958 
quadratic modulo m, 379 
complex function, 753 
theorem, 754 
residue class, 377 
addition, 377 
multiplication, 377 
primitive, 378 
relatively prime, 378 
ring, 361, 377 
modulo m, 378 
residue theorem, 752 
application, 755 
resolvent, 626, 628, 645, 680 
set, 680 
resonance torus, 904 
reversing the order of the bits, 995 
rhombus, 136 
Riemann 
formula, 584 
function, 584 
integral, 494 
comparison with Lebesgue integral, 506 
comparison with Stieltjes integral, 506 
method, 584 
sum, 494 
surface, many-sheeted, 745 
theorem, 463 
right-hand coordinate system, 209 
right-hand rule, 184, 722 
right-screw rule, 722 
right-singular vector, 321 
rigid-body motion, 358 
biquaternions, 306 
Chasles theorem, 359 
quaternions, 306 
screwing motion, 359 
ring, 361 
division ring of quaternions, 290 
factor ring, 363 
homomorphism, 362 
theorem, 362 
isomorphism, 362 
subring, 362 
risk theory, 21 
Ritz method, 618, 974 
robotics, 358 
Romberg method, 966 
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root 
complex function, 733 
complex number, 38 
equation, 43 
non-linear, 949 
notion, 8 
N-th of unity, 994 
real number, 8 
square root, complex, 737 
theorem of Vieta, 44 
root locus theory, 954 
root test of Cauchy, 461 
rotation 
angle, 215 
arbitrary zero point axis, 296 
around an arbitrary space axis, 232 
Cardan angles, 295, 298 
coordinate axis, 191 
coordinate system, 212 
definition, 713 
different coordinates, 714 
direction cosines, 213 
Euler angles, 296 
mapping, 877 
object, 213, 229 
around the coordinate axis, 295 
potential field, 715 
remark, 709 
vector components, 718 
vector field, 713 
rotation determinant, 213 
transformation properties, 214 
rotation field 
pure, 727 
zero-divergence field, 727 
rotation invariance, 283, 284 
rotation invariance, cartesian tensor, 283 
rotation matrix, 191, 212, 275 
arbitrary zero point axis, 296 
Cardan angles, 295, 298 
coordinate transformation, 231 
Euler angles, 296 
orthogonal, 281 
quaternions, 294, 297 
space, 212, 215, 216 
rotation number, 906 
rotations 
coordinate system, 212 
geometric 3D transformation, 234 
geometric, 2D transformation, 229 
object, 213 
quaternions, 297 
rotator, rigid, 598 
round-off, 1005 
error, 1007 
measurement, 848 
error method, 1007 
row echelon form, system of linear equations, 312 
row sum criterion, 960 
Ruelle-Takens-Newhouse scenario, 904 
rule, 971 
Adams and Bashforth, 971 
Bernoulli-l’ Hospital, 56 


Cartesian rule of signs, 953 
Cramer, 311 
De Morgan, 324 
Descartes, 45 
Guldin, first rule, 506 
Guldin, second rule, 506 
linguistic, 428 
Milne, 971 
Sarrus, 279 

ruled surface, 268 

rules 
calculation determinants, 278 
calculation matrices, 275 
calculation quaternions, 292 
composition, 424 
differentiation 

one variable, 433 
several variables, 450 

divisibility, elementary, 370 
integration (general), 482 
Neper, 170 
rank, 274 

Runge-Kutta method, 969 


saddle, 864, 871 
saddle form, 267 
saddle point 
differential equation, 548 
Lagrange function, 925 
Sage (computer algebra system), 1023 
sample, 814, 830 
unction, 831 
random, 830 
size, 814 
summarizing, statistical, 832 
variable, 830, 831 
Sarrus rule, 279 
scalar 
invariant, 281 
notion, 181 
scalar field, 702 
axial field, 703 
central field, 702 
coordinate definition, 703 
directional derivative, 708 
gradient, 710 
plane, 702 
scalar matrix, 270 
scalar product, 184, 367 
bilinear form, 368 
Hilbert space, 673 
quaternions, 291, 293 
representation in coordinates, 187, 189 
rotation invariance property, 288 
two functions, 985 
vectors, 273 
scale 
cartography, 145 
equation, 115 
factor, 115, 147 
logarithmic, 115 
notion, 115 
semilogarithmic, 116 
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scenario, Ruelle- Takens-Newhouse, 904 
Schauder fixed-point theorem, 691 
scheme 
Falk, 273 
Young, 345 
Schmidt, orthogonalization method, 637 
Schoenflies symbolism, 346 
Schroedinger equation 
central field, 598 
angular dependence, 601 
azimutal equation, 600 
polar equation, 600 
radial equation, 599 
solution, 599 
linear, 592 
linear harmonic oscillator, 601 
non-linear, partial, 604, 606 
spherical von Neumann functions, 565 
time-dependent, 593 
time-independent, 594 
Schur's lemma, 345 
Schwarz 
exchange theorem, 448 
reflection principle, 741 
Schwarz-Buniakowski inequality, 673 
Schwarz-Christoffel formula, 739 
Scilab, 1011 
screw 
left, 260 
Lie algebra, 360 
right, 260 
screw motion 
Chasles theorem, 359 
rigid-body motion, 359 
search procedure, numerical, 930 
secant, 139 
hyperbolic, 89 
theorem, 139 
trigonometric, 78 
geometric definition, 131 
secant-tangent theorem, 139 
section 
normal (surface), 265 
priciple normal (surface), 265 
section, golden, 2, 4, 194, 908 
sector formula, 725 
sector, spherical, 158 
segment 
normal, 245 
notion, 129 
polar normal, 245 
polar tangent, 245 
tangent, 245 
selection, 933 
self-similar, 883 
semantically equivalent, 324 
expressions, 398 
semi-linear, 369 
semi-monotone, 671 
semi-norm, 682 
semifocal chord 
ellipse, 199 
hyperbola, 201 


parabola, 71, 204 
semigroup, 336 
free, 336 
semilogarithmic paper, 116 
semiorbit, 857 
sense class, 134 
of a figure, 133 
sensitive, with respect to the initial values, 889 
sentence, 323 
separable sets, 683 
separation 
constant, 597 
theorems (convex sets), 683 
variables, 542, 579, 597 
separatrix 
loop, 868, 903 
surface, 866, 872 
sequence, 457 
bounded, 656 
Cauchy sequence, 665 
convergente, 664 
finite, 655 
in metric space, 664 
infinite, 457 
numbers, 457 
bounded, 457, 656 
bounded above, 458 
bounded below, 458 
convergence, 458 
converging to zero, 656 
divergence, 458 
finite, 655 
law of formation, 457 
limit, 458 
monotone, 457 
term, 457 
series, 18, 457 
alternating, 463 
arithmetic, 18 
Banach space, 670 
Clebsch-Gordan, 345 
comparison criterion, 460 
constant term, 459 
convergence, 459, 462 
absolute, 469 
non-uniform, 468 
theorems, 459 
uniform, 468, 469 
definite, 18 
divergence, 459, 462 
expansion, 471 
area functions, 1061 
binomial, 1057 
exponential functions, 1057 
Fourier, 474 
hyperbolic functions, 1061 
inverse trigonometric functions, 1060 
Laplace transformation, 779 
Laurent, 752 
logarithmic functions, 1059 
Maclaurin, 472 
power series, 469 
Taylor, 442, 471 
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series (continued) fractal, 888 
trigonometric functions, 1058 stable, 859 
finite, 18 irrational numbers, 2 
Fourier, 474 linear, 655 
complex representation, 475 manyness, 335 
function, 467 measurable, 694 
domain of convergence, 467 natural numbers, 1 
general term, 459 non-denumerable infinite, 335 
geometric, 19 notion of set, 327 
infinite, 19, 459 open, 663 
harmonic, 459 operation 
hypergeometric, 567 Cartesian product, 331 
infinite, 457, 459 complement, 328 
Laurent, 752 difference, 330 
Maclaurin, 472 intersection, 328 
Neumann, 626, 678 union, 328 
partial sum, 459 operations, 328 
power, 469 order-bounded, 660 
expansion, 471 original space, 49 
inverse, 471 power, 335 
remainder, 459, 467 power set, 328 
sum, 459 quotient set, 334 
Taylor, 442, 471 rational numbers, 1 
test real numbers, 2 
Cauchy, integral test, 462 relative compact, 686 
Cauchy, root test, 461 subset, 328 
D'Alembert's ratio test, 461 theory, 327 
Leibniz, alternating series test, 463 universal, 329 
uniformly convergent void, 328 
continuity, 468 set algebra, 693 
differentiation, 469 sets, 327 
integration, 469 coordinates z, y, 331 
properties, 468 difference, symmetric, 330 
Weierstrass criterion, 468 operation 
sesquilinear form, 369 symmetric difference, 330 
hermitian, 369 sexagesimal degree, 131 
set, 327 shearing, transformation, 230 
absorbing, 859 shift mapping, 880 
algebra, fundamental laws, 329 shift register, linear, 364 
axioms of closed sets, 664 shooting method, 975 
axioms of open sets, 664 simple, 975 
Borel, 694 shore-to-ship bearing, 172 
bounded in metric space, 664 side condition, 614 
cardinality, 335 side or leg of an angle, 129 
closed, 664 Sierpinski 
closure, 665 carpet, 884 
compact, 686, 860 gasket, 884 
complex numbers, 34 sieve of Eratosthenes, 370 
convex, 657 c additivity, 694 
dense, 665 c algebra, 694 
denumerable infinite, 335 Borelian, 694 
disjoint, 329 sign of a function, 50 
element, 327 signal, 800 
empty, 328 analysis, 800 
equality, 328 synthesis, 800 
equinumerous, 335 signature, universal algebra, 394 
fundamental, 329, 674 significance, 839 
fuzzy, 413 level, 836 
image, 49 level of type 1, 813 
infinite, 335 similarity transformation, 316, 317 
integers, 1 simplex 
invariant, 859 method, 913, 914 





chaotic, 888 revised, 917 
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pivot element, 915 
secondary program, 917 
step 
revised, 918 
tableau, 914 
degenerate case, 916 
non-generate case, 915 
revised, 917 
variable 
artificial, 916 
Simpson's formula, 965 
simulation 
digital, 843 
Monte Carlo, 843 
sine 
hyperbolic, 89 
geometric definition, 132 
trigonometric, 76 
geometric definition, 131 
sine law, 143 
sine integral, 513, 756 
sine law, 166 
sine-cosine law, 166 
polar, 166 
sine-Gordon equation, 607 
sine-Gordon equation (SG), 604 
single-target method, 975 
singleton, 416 
singular value, 321, 880 
decomposition, 321 
singularity 
analytic function, 733 
essential, 733, 753 
isolated, 752 
pole, 753 
removable, 733 
sink, 864, 871 
vector field, 712 
vertex, 411 
sinusoidal amount, 84 
skew field 
quaternions, 289 
rings, 361 
Slater condition, 926 
slide rule, 10 
logarithmic scale, 115 
slope 
plane, 195 
tangent, 245 
small circle, 160, 176 
arclength, 177 
course angle, 177 
intersection point, 177 
radius, plane, 176 
radius, spherical, 176 
smoothing 
continuous problem, 984 
error propagation, 856 
parameter, 998 
spline, 997 
cubic, 997 
Sobolev space, 671 
solid angle, 152 


soliton 
antikink, 608 
antisoliton, 606 
Boussinesq, 609 
Burgers, 609 
differential equations, partial, non-linear, 603 
dissipative, 604 
Hirota, 609 
interaction, 604 
Kadomzev-Pedviashwili, 609 
kink, 607 
lattice, 608 
kink-antikink, 608 
collision, 608 
doublet, 608 
kink-kink 
collision, 608 
Korteweg de Vries, 605 
non-linear, Schroedinger, 606 
SOR method (successive over-relaxation), 961 
source, 864, 871 
distribution 
continuous, 728 
discrete, 728 
field 
irrotational, 726 
pure, 726 
vector field, 712 
vertex, 411 
space 
abstract, 49 
Banach, 670 
complete, metric, 665, 666 
directions, 212 
finite-dimensional, 686 
fundamental, 699 
higher-dimensional, 118 
Hilbert, 673 
infinite-dimensional, 657 
isometric, 669 
Kantorovich, 661, 686 
linear, 654 
LP space, 697 
metric, 662 
completion, 668 
convergence of sequence, 664 
normable, 670 
separable, 665 
non-reflexive, 684 
normed 
axiom, 669 
properties, 670 
ordered normed, 671 
orthogonal, 674 
reflexive, 684 
Riesz, 660 
second adjoint, 684 
separable, 665 
Sobolev, 671 
unitary, 673 
vector, 654 
space curve, 256 
binormal, 257, 259 
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coordinate equation, 259 
curvature, 258 
direction, 256 
equation, 218, 256 
moving trihedral, 257 
normal plane, 257, 259 
osculating plane, 257, 259 
principal normal, 257, 259 
radius of curvature, 258 
radius of torsion, 260 
tangent, 257, 259 
torsion, 260 
vector equation, 256, 259 
space inversion, 287 
mixed product, 289 
scalar product, 288 
spectral radius, 678 
spectral theory, 680 
spectrum, 787 
amplitude, 787 
continuous, 681 
frequency, 787 
linear operator, 680 
phase, 787 
sphere, 158 
ellipsoid, 224 
equation, three forms, 261 
spherical 
biangle, 163 
coordinates, 211 
diagon, 163 
distance, 160 
field, 702 
helix, 178 
lune, 163 
spherical functions, 564 
complex, 564 
spinor, 290 
spiral, 105 
Archimedean, 105 
hyperbolic, 105 
logarithmic, 106, 251 
spline, 996 
basis spline, 998 
bicubic, 998 
approximation, 1000 
interpolation, 998 
net points, 998 
cubic, 996 
interpolation, 996 
smoothing, 997 
interpolation, 982, 996 
natural, 996 
normalized B-spline, 998 
periodic, 996 
smoothing, 997 
splitting field, polynomials, 363 
spur 
matrix, 270 
tensor, 284 
square, 136 
stability 
absolutely stable, 972 


first approximation, 864 


integration of differential equation, 972 


Lyapunov stability, 863 
orbital, 863 
periodic orbit, 864 


perturbation of initial values, 972 
round-off error, numerical calculation, 1007 


structural, 873, 874 
stability theory, 863 

classification, 864 
standard 

deviation, 813, 852, 854 

error, 851 

normal distribution, 819 
state 

degenerate, 598 

particle, 592 

space, stochastic, 825, 826 

stationary, 594 


steady, dynamical system, 857 


statistical analysis, 832 
statistical summarization, 832 
statistics, 805 
descriptive, 832 
estimate, 831 
mathematical, 805, 830 
sample function, 831 
steady state, 857 
Steffensen method, 951 
step 
from n ton+1, 5 
function, 757, 774, 794 
functions, 804 
interval parameter, 962 
size, 969 
change, 970 
steradian, 152, 1055 
stereometry, 151 
Stieltjes integral, 506, 686 


comparison with Riemann integral, 506 


notion, 506 
Stieltjes transformation, 768 
Stirling formula, 515 
stochastic 

basic notions, 826 

chains, 826 

process, 825 

processes, 826 
stochastics, 805 
Stokes, integral theorem, 725 
strangling, 282 
stream function, 741 
strip, characteristic, 573 
strophoide, 97 
structure 

algebraic, 323 

classical algebraic, 335 
Sturm 

chain, 44 

function, 44 

sequence, 954 

theorem, 45 
Sturm-Liouville problem, 569 
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subdeterminant, 278 
subdomain, 998 
subgraph, 402 
induced, 402 
subgroup, 337 
criterion, 337 
cyclic, 337 
invariant, 338 
normal, 338 
trivial, 337 
subinterval, 494 
subnormal, 245 
subring, 362 
trivial, 362 
subset, 328, 655 
affine, 655 
linear, 655 
open, 663 
subspace, 367 
affine, 655 
criterion, 367 
invariant, representation of groups, 343 
subtangent, 245 
subtraction 
complex numbers, 36 
computer calculation, 1005 
polynomials, 11 
quaternions, 292 
rational numbers, 1 
subtractive cancellation, 1005 
sum, 6 
algebraic, 419 
bounded, 420 
drastic, 419, 420 
of the digits 
alternating, first order, 372 
alternating, second order, 372 
alternating, third order, 372 
first order, 372 
second order, 372 
third order, 372 
residual squares, 986 
Riemann, 494 
rules of calculation, 6 
summation sign, 6 
transverse, 372 
vectors, 182 
summarization, statistical, 832 
summation convention, Einstein's, 280 
superposition 
fields, law, 728 
linear, 595 
non-linear, 606 
oscillations, 84 
principle, 595, 744 
differential equation, linear, 560 
differential equations, higher order, 554 
supplementary angle formulas, 80 
support 
compact, 802 
measure, 876 
membership function, 413 
supporting functional, 683 


supremum, 660 
surface, 261 
area, double integral, 527 
B-B representation, 1000 
barrel, 159 
block, 153 
cone, 157 
conical, 157, 218 
constant curvature, 267 
cube, 154 
curvature of a curve, 265, 267 
cylinder, 156 
cylindrical, 156, 218 
developable, 268 
element, 265 
element, vector components, 719 
equation, 228 
in normal form, 224 
space, 217 
equation, space, 261 
first quadratic fundamental form, 263 
Gauss curvature, 267 
geodesic line, 268 
harmonic, 601 
integral, 532, 722, 723 
first type, 532 
general form, 537 
second type, 535, 536 
line element, 263 
line of curvature, 266 
metric, 265 
minimal, 267 
normal, 262-264 
normal vector, 262 
oriented, 535 
patch, area, 265 
polyhedron, 153 
principal normal section, 266 
pyramid, 154 
quadratic, 228 
radius of principal curvature, 265 
rectangular parallelepiped, 153 
rectilinear generator, 226 
representation with splines, 996 
rotation symmetric, 218 
ruled, 268 
second order, 224, 228 
central surfaces, 228 
invariant signs, 229 
types, 228 
second quadratic fundamental form, 266 
sphere, 158 
tangent plane, 262 
torus, 159 
transversal, 866 
switch 
algebra, 395, 399 
function, 399 


value, 399 
Sylvester, theorem of inertia, 318 
symbol 


internal representation (computer), 1001 
Kronecker, 283 
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Landau, 57 
Legendre, 379 
symmetry 
axial, 134 
central, 133 
element, 346 
Fourier expansion, 476 
group, 346 
applications in physics, 351 
crystallography, 348 
molecules, 347 
quantum mechanics, 350 
operation, 346 
crystal lattice structure, 349 
improper orthogonal mapping, 346 
reflection, 346 
rotation, 346 
without fixed point, 346 
with respect to a line, 134 
system 
Canonical, 573 
chaotic according to Devaney, 889 
cognitive, 428 
complete, 676 
dynamical, 857 
chaotic, 889 
conservative, 858 
continuous, 857 
C7-smooth, 857 
discrete, 858 
dissipative, 858 
ergodic, 877 
invertible, 857 
mixing, 877 
motion, 857 
reconstruction space, 890 
reconstruction theorem, 890 
time continuous, 857 
time discrete, 857 
time dynamical, 858 
volume decreasing, 858 
volume preserving, 858 
volume shrinking, 858 
four points, 217 
generators, 337 
knowledge based interpolation, 430 
mixing, 889 
Morse-Smale, 875 
normal equations, 313 
orthogonal, 674 
orthonormal, 674 
term-substitutions, 394 
trigonometric, 674 
system of algebraic equations, 39 
extraneous roots, 39 
irrational equations, 39 
vanishing denominator, 39 
system of linear equations 
compatible, 309 
consistent, 309 
existence of the solution, 312 
fundamental system, 309 
homogeneous, 308 





inconsistency, 310 
inconsistent, 309 
inhomogeneous, 308 
linear, 307 
numerical solution, 955 
Cholesky method, 958 
direct method, 955 
Gauss algorithm, 956 
iteration, 955 
iteration methods, 960 
least squares method, 955 
orthogonalization method, 958 
over-determined, 955, 958 
row echelon form, 955 
triangular decomposition, 957 
under-determined, 955 
over-determined, 313 
pivoting, 307, 310 
row echelon form, 312 
solution, 308 
solvability, 309, 310 
trivial solution, 309 
system of non-linear equations 
numerical solution, 955, 961 
Gauss-Newton method, 962 
iteration method, 961 
Newton’s method, 962 
system of polynomial equations, 39 
systems of differential equations, first-order linear 
homogeneous systems, 559 
inhomogeneous systems, 559 
systems of differential equations, linear 
constant coefficients, 558 
systems of partial differential equations 
canonical systems, 573 
first-order, 573 
normal system, 574 


table with double entry, 120 
tacnode, curve, 250 
tangent 
circle, 199 
formula, 143 
hyperbolic, 89 
geometric definition, 132 
law, 143 
plane, 160, 264, 448 
surface, 262 
plane curve, 244 
polygon, 138 
space curve, 257, 259 
trigonometric, 77 
geometric definition, 131 
tangential 
element, 357 
vector, 357 
tautology 
Boolean function, 397 
predicate logic, 327 
propositional logic, 325 
Taylor 
expansion, 57, 442, 471 
one variable, 471 
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Taylor (continued) 
several variables, 450 
vector function, 702 
formula, 442 
several variables, 450 
two variables, 449 
series, 442, 471 
analytic function, 751 
one variable, 471 
several variables, 450 
theorem, 442 
several variables, 449 
telegraphic equation, 585 
telephone-call distribution, 829 
tensor, 280 
addition, subtraction, 287 
alternating, 284 
antisymmetric, 283 
components, 281 
contraction, 282, 287 
definition, 281 
dyadic product, 282 
eigenvalue, 283 
generalized Kronecker delta, 283 
inertia, 282 
invariant, 284 
linear transformation, coordinates, 280 
multiplication, 287 
oversliding, 287 
product, 282 
vectors, 273 
rank 0, 1,2, n, 281 
rules of calculation, 282, 283 
skew-symmetric, 283, 287 
spur, 284 
symmetric, 283, 287 
tension, 282 
trace, 284 
transformation invariance, 283 
tensor product approach, 999 
tent mapping, 876 
term algebra, 395 
test 
x? test, 835 
goodness of fit, 834 
hypothesis, 839 
independence, two variables, 840 
prime number 
AKS, primality, 383 
Fermat, 382 
Lucas-Lehmer, 371 
Rabin-Miller, 382 
sieve of Eratosthenes, 370 
statistical, 834 
test problem, linear, 972 
tetragon, 136 
tetrahedron, 155, 217 
Thales theorem, 140, 142 
theorem 
Abel (power series), 469 
Afraimovich-Shilnikov, 904 
alternating point, 988 
Andronov-Pontryagin, 874 


theorem (continued I) 


Andronov-Witt, 864, 865 
Apollonius, 200 
Arzela-Ascoli, 686 
Baire category, 666 
Banach 
continuity, inverse operator, 679 
fixed-point, 689 
Banach-Steinhaus, 678 
Bayes, 810, 811 
Berge, 409 
binomial, 12 
Birkhoff, 395, 877 
Block-Guckenheimer-Misiuriewicz, 889 
Bolzano 
one variable, 61 
several variables, 124 
boundedness of a function 
one variable, 61 
several variables, 124 
Cauchy integral theorem, 747 
Cayley, 339, 408 
center manifold 
differential equations, 892 
mappings, 897 
Chasles, 359 
Chebyshev, 489 
Chinese remainder, 379 
Clebsch-Gordan, 345 
closed graph, 678 
constant analytic function, 733 
convergence, measurable function, 697 
decomposition, 334 
Denjoy, 906 
differentiability, respect to initial conditions, 857 
Dirac, 406 
Douady-Oesterlé, 886 
Euclidean algorithm, 374 
Euler (polyeder), 155 
Euler (theory of numbers), 381 
Euler-Hierholzer, 405 
Fatou, 697 
Fermat, 381, 441 
Fermat-Euler, 381 
fixed-point 
Banach, 666, 689 
Brouwer, 691 
Schauder, 691 
Floquet, 863 
fundamental integral calculus, 511 
fundamental of elementary number theory, 371 
fundamental theorem of algebra, 364 
Girard, 165 
Grobman-Hartman, 871, 873 
Hadamard-Perron, 867, 872 
Hahn (extension theorem), 683 
Hellinger-Toeplitz, 678 
Hilbert-Schmidt, 688 
Holladay, 997 
Hurwitz, 556 
intermediate value 
one variable, 61 
several variables, 124 
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theorem (continued IT) 
KAM (Kolmogorov-Arnold-Moser), 875 
Krein-Losanovskij, 679 
Kupka, Smale, 890 
Kuratowski, 410 
Lagrange, 338 
Lebesgue, 697 
Ledrappier, 886 
Leibniz, 463 
Lerey-Schauder, 692 
Levi, B., 697 
limits 
functions, 458 
sequences of numbers, 458 
Lindeberg-Levy, 825 
Liouville, 733, 861 
Lyapunov, 863 
maximum value, analytic function, 733 
Meusnier, 265 
monotone convergence, 697 
nested balls, 666 
Ore, 406 
Oseledec, 880 
Palis-Smale, 875 
Picard-Lindelóf, 668, 857 
Poincaré-Bendixson, 865 
Posa, 406 
Ptolemy's, 137 
Pythagoras 
general triangle, 143 
orthogonal space, 674 
right-angled triangle, 142 
Radon-Nikodym, 697 
Riemann, 463 
Riesz, 682 
Riesz-Fischer, 676 
Rolle, 441 
Sauer, Yorke, Casdagli, 891 
Schauder, 687 
Schwarz, exchange, 448 
Sharkovsky, 889 
Shilnikov, 903 
Shinai, 889 
Shoshitaishvili, 892 
Smale, 902 
stability in the first approximation, 864 
Sturm, 45 
superposition law, 728 
Sylvester, of inertia, 318 
Takens, 890 
Taylor, 442 
one variable, 471 
several variables, 449 
Thales, 140, 142 
total probability, 810 
Tutte, 409 
variation of constants, 862 
Vieta, 44 
Wedderburn, 361 
Weierstrass, 468, 665 
one variable, 61 
several variables, 125 
Whitney, 890 


theorem (continued II) 
Wilson, 381 
Wintner-Conti, 860 
Young, 884, 885 
theorems 
Euclidean, 142 
propositional logic, 324 
theory 
distribution, 774 
elementary number, 370 
ergodic dynamical systems, 877 
field, 729 
function, 731 
graph, algorithms, 401 
probability, 805 
risk, 21 
set, 327 
spectral, 680 
vector fields, 701 
theta function, 764 
time frequency analysis, 803 
tolerance, 417 
topological 
conjugation, 870 
equivalence, 870 
torsion, 236 
torsion, space curve, 260 
torus, 159, 888, 889 
differential equation, linear, autonomous, 863 
dissolving, 904 
formation, 900, 905 
invariant set, 866 
losing smoothness, 904 
resonance torus, 904 
trace 
matrix, 270 
tensor, 284 
tractrix, 108 
trail, 404 
Euler trail, 405 
open, 405 
trajectory, 857 
transform, 767 
transformation 
algebraic expression, identical, 11 
cyclic permutations, 142 
invariance, cartesian tensor, 283 
iterative methods, 319 
linear, mappings, operators, homomorphisms, 658 
orthogonal coordinates, 212 
orthogonal-similarity, eigenvalue problem, 317 
principal axes 
eigenvalue problem, 317 
quadratic form, 317 
tensor, 283 
similarity, eigenvalue problem, 316, 317 
transformation (coordinates, plane) 
parallel translation, 191 
rotation, coordinate system, 191 
transformation (coordinates, space) 
basic transformations, 231 
parallel translation, 212 
rotation, coordinate system, 212 
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translation, rotation, etc., 231 
transformation (geometric), 234, 295 

affine, 230 

basic, 231 

bending, 237 

composition, 232 

contraction, 236 

coordinates, 229, 295 

deformations, 236 

mirror reflection, 230, 233 

object, 229 

properties, 230 

rotation (turning), 229, 234 

scaling, 230, 234 

shearing, 230, 234 

torsion, 236 

translation, 229, 234 
ransformation (geometric, 2D), 229 
ransformation (geometric, 3D), 233 

rotation of the object, 213 
ransformation (groups) 

affine, 2-dimensional space, 355 

covering, 336 
ransformation (plane) 

Cartesian «» polar coordinates, 192, 453 
ransformation (space) 

Cartesian + cylindrical coordinates, 212 

Cartesian + spherical coordinates, 212 
ransition 

matrix, probability, 827 

matrix, stochastic process, 826 

probability, stochastic, 826 
ransitivity, 29 
ranslation 

invariance, 283, 284 

invariance, cartesian tensor, 283 

primitive, 348 
ransport, network, 411 
ransposition law, 325 
rapezoid, 136 

Hermite’s trapezoidal formula, 965 
rapezoidal 

formula, 964 

sum, 964 
raversing, 148 
ree, 407 

hight, 407 

ordered binary, 407 

regular binary, 407 

rooted, 407 

spanning, 407, 408 

minimum, 408 

triangle 

altitude, 133 

area, 194 

bisector, 133 

center of gravity, 133 

circumcircle, 133 

congruent, 134 

coordinates, 982 

equilateral, 133 

Euler, 164 

incircle, 133 





inequality, 182 
axioms of norm, 276 
complex numbers, 30 
metric space, 662 
norm, 669 
real numbers, 30 
unitary space, 673 
isosceles, 133 
median, 133 
orthocenter, 133 
Pascal, 13 
plane, 132 
area, 142, 144 
basic problems, 144 
Euclidean theorems, 142 
general, 142 
incircle, 144 
radius of circumcircle, 143 
right-angled, 133, 142 
tangent formula, 143 
polar, 164 
similar, 134 
spherical, 163 
basic problems, 169 
calculation, 169 
Euler, 164 
oblique, 171 
right-angled, 169 
riangular 
decomposition, 956 
matrix, 271 
lower, 27 
upper, 271 
riangularization, FEM (finite element method), 979 
riangulation, geodesy 
first fundamental problem, 148 
second fundamental problem, 149 
rigonometry 
plain, 142 
spherical, 160 
rihedral 
angle, 152, 164 
moving, 256, 257 
riple (ordered 3 tuple), 331 
riple integral, 527 
application, 532 
rochoid, 102 
runcation, measurement error, 848 
ruth 
compositions of propositions, 323 
function, 323, 324 
conjunction, 323 
disjunction, 323 
equivalence, 323 
implication, 323 
NAND function, 325 
negation, 323 
NOR function, 325 
table, 323 
true or false, 323 
value, 323 
turbulence, 858, 904 
Tutte theorem, 409 
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two lines, transformation, 207 
two-body problem, 574 
type, universal algebra, 394 


umbilical point, 267 
uncertainty 
absolute, 852 
fuzzy, 413 
quantum mechanical, 596 
relation, 596 
relative, 852 
uncertainty relation, 596 
ungula, cylinder, 156 


union 
fuzzy sets, 418, 419 
sets, 328 


units, imaginary 
generalized i, j, κ, 290 
units, physical 
currently accepted for use with SI, 1056 
international system SI, 1055 
additional units, 1055 
basic units, 1055 
universal quantifier, 326 
universal substitution, 491 
urn model, 814 


vagueness, 413 
valence 
in-valence, 401 
out-valence, 401 
value 
expected, measurement, 813 
system (function of several variables), 118 
true, measurement, 850 
van der Pol differential equation, 895 
variable 
Boolean, 397 
bound variable, predicate logic, 326 
dependent, 48, 307 
free, predicate logic, 326 
independent, 48, 118, 307 
linguistic, 414 
propositional, 323 
random, 811, 825 
slack, 910 
variance, 813 
distribution, 813 
sample function, 831 
statistics, 834 
two-dimensional distribution, 840 
variation 
function, 61 
of constants, method, 554 
variation of constants 
differential equation, linear, 560 
theorem, 862 
variational calculus, 610 
auxiliary curve, 612 
comparable curve, 612 
Euler differential equation, 612 
side-condition, 610 
variational equation, 864, 881, 979 





equilibrium point points, 871 
variational problem, 610, 974 
brachistochrone problem, 611 
Dirichlet, 617 
extremal curves, 611 
first order, 610 
first variation, 619 
functional, 611 
higher order, 610 
higher order derivatives, 614 
isoperimetric, general, 611 
more general, 618 
numerical solution, 618 
direct method, 618 
finite element method, 619 
gradient method, 619 
Ritz method, 618 
parametric representation, 610, 615 
second variation, 619 
several unknown functions, 615 
side conditions, 614 
simple 
one variable, 611 
several variable, 617 
variety, 395 
vector 
absolute value, 181 
affine coordinates, 183 
axial, 181 
reflection behavior, 287 
base, 187 
base vector 
reciprocal, 186 
bound, 181 
Cartesian coordinates, 183 
column, 271 
components, 707 
conjugate, 932 
coordinates, 183 
Darboux vector, 261 
decomposition, 183 
diagram, oscillations, 85 
differentiation rules, 701 
direction in space, 181 
direction, vector triple, 181 
directional coefficient, 184 
expansion coefficient, 184 
field, 701 
free, 181 
left-singular, 321 
length, 190 
line, 708 
magnitude, 181 
matrix, 271 
metric coefficients, 186 
notion, 181 
null vector, 182 
polar, 181 
reflection behavior, 287 
pole, origin, 182 
position vector, 182 
complex-number plane, 34 
probability, 827 
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vector (continued) 
radius vector, 182 
complex-number plane, 34 
random, 830 
reciprocal, 187 
reciprocal basis vectors, 187 
residual, 313 
right-singular, 321 
row, 271 
scalar invariant, 716 
sliding, 181 
space, 655 
stochastic, 826 
unit, 181 
zero vector, 182 
vector algebra, 181 
geometric application, 190 
notions and principles, 181 
vector analysis, 701 
quaternions, 305 
vector equation 
line, 189 
plane, 189 
space curve, 256, 259 
vector equations, 188 
vector field, 704, 857 
Cartesian coordinates, 706 
central, 705 
circular field, 705 
components, 707 
contour integral, 721 
coordinate definition, 706 
cylindrical, 705 
cylindrical coordinates, 706 
directional derivative, 708 
divergence, 712 
point-like source, 727 
rotation, 713 
sink, 712 
source, 712 
spherical, 705 
spherical coordinates, 706 
vector function, 701, 704 
derivative, 701 
differentiation, 701 
hodograph, 701 
linear, 286 
scalar variable, 701 
Taylor expansion, 702 
vector gradient, 711 
vector iteration, 319, 321 
vector lattice, 660 
homomorphism, 661 
vector potential, 727 
vector product, 184 
hints, 273 
Lie algebra, 360 
quaternions, 291, 293 
representation in coordinates, 188 
vector space, 365, 654 
all null sequences, 656 
bounded sequences, 656 


B(T), 656 


C((a, b]), 656 
CC? ([a, 0]), 656 
complex, 655 
convergent sequences, 656 
Euclidean, 367 
R? (3 dimensional), 289 
R? (4 dimensional), 289 
R" (n dimensional), 272 
finite sequence of numbers, 655 
F”, 655 
F(T), 656 
functions, 656 
infinite-dimensional, 366, 657 
LP. 697 
12. 656 
n dimensional, 272, 366 
ordered by a cone, 659 
partial ordering, 659 
real, 365, 655 
s of all sequences, 655 
sequences, 655 


vector subspace 


stable, 867, 872 
unstable, 867, 872 


vectors, 181, 271 


angle between, 190 
collinear, 182 
collinearity, 185 
commutative law, 273 
coplanar, 182 
cyclic permutation, 209 
double vector product, 185 
dyadic product, 273 
equality, 181 
Lagrange identity, 186 
linear combination, 183, 185 
mixed product, 185, 188 
Cartesian coordinates, 186 
orthogonality, 185 
products 
affine coordinates, 186 
Cartesian coordinates, 186 
products, properties, 184 
scalar product, 273 
Cartesian coordinates, 186 
representation in coordinates, 187 
sum, 182 
tensor product, 273 
triple product, 185 
vector product 
representation in coordinates, 188 





Venn diagram, 328 
verification, proof, 5 
vertex 





angle, 129 
degree, 401 
ellipse, 199 
graph, 401 
hyperbola, 201 
initial, 401 
isolated, 401 
level, 407 


Index 1207 





parabola, 204 
plane curve, 250 
sink, 411 
source, 411 
terminal, 401 
vertices, distance, 404 
vibration, differential equation 
bar, 580 
round membrane, 581 
string, 579 
Vieta, root theorem, 44 
Volterra integral equation, 621, 667 
first kind, 643 
second kind, 645 
volume 
barrel, 159 
block, 153 
cone, 157 
cube, 154 
cylinder, 156 
double integral, 527 
element, vector components, 719 
hollow cylinder, 157 
obelisk, 155 
parallelepipedon with vectors, 190 
polyhedron, 153 
prism, 153 
pyramid, 154 
rectangular parallelepiped, 153 
sphere, 158 
subset, 693 
tetrahedron, 217 
torus, 159 
triple integral, 532 
wedge, 155 
volume derivative, 710 
volume integral, 527 
volume scale, 116 


Walsh 
functions, 804 
systems, 804 
wave 
non-linear, 604 
parameter, 84 
plane, 801 
expansion in terms of spherical functions, 565 
wave equation 
n-dimensional, 590 
one-dimensional, 792 
Schroedinger equation, 593 
wave function 
classical problem, 590 
expasion of eigenfunctions, 595 
heat-conduction equation, 592 
Schroedinger equation, 592 
statistical interpretation, 594 
wave-particle duality, 595 
wavelet, 801 
Daubechies, 802 
Haar, 801 
Mexican hat, 801 
orthogonal, 802 


transformation, 800, 801 
discrete, 803 
discrete, Haar, 803 
dyadic, 802 
fast, 803 
Weber 
differential equation, 601 
function, 562 
wedge, 155 
Weibull distribution, 821 
Weierstrass 
approximation theorem, 665 
criterion, uniform convergence, 468 
form of Euler Differential equation, 616 
function, 765 
theorem, 468 
one variable, 61 
several variables, 125 
weight 
measurement, 854 
of orthogonality, 570 
statistical, 813 
weighting factor, statistical, 850 
Whitney, theorem, 890 
witch of Agnesi, 95 
word, 662 
work (mechanics) 
general, 522 
special, 504 
Wronskian determinant, 553, 862 


Young scheme, 345 


zenith, 144 
distance, 144 

zero divisor, 361 

zero matrix, 269 

zero point, 1 

zero-point translational energy, 598 

zero-point vibration energy, 603 

Z-transformation, 794 
applications, 798 
convolution, 796 
damping, 796 
definition, 794 
difference, 796 
differentiation, 796 
integration, 796 
inverse, 797 
original sequence, 794 
rules of calculation, 795 
summation, 795 
transform, 794 
translation, 795 

Z-transformable, 794 





MATHEMATICAL SYMBOLS 


Relational Symbols 


ll 


equal to 7 approximately equal to 
identically equal to « less than 
— equal to by definition > greater than 


less than or equal to 


greater than or equal to 
unequal to, different from 


l>% IV IA 


« much less than 
< partial order relation 


>> much greater than corresponding to 


> partial order relation 


Greek Alphabet 


Αα Alpha ΒΒ Beta Cy Gamma Aó Delta Ec Epsilon ZG Zeta 
Hm Eta 000 Theta nn Iota Kk Kappa AA Lambda My Μα 
Nv Nu EE Xi Oo Omicron In Pi Pp Rho Xo Sigma 
Tr Tau Yu Upsilon p Phi Xx Chi Vy Psi Nw Omega 
Constants 
const constant amount (constant) C = 0.57722... Euler constant 
T — 3.14159... ratio of the perimeter of the circle to e = 2.71828... base of the natural logarithms 
the diameter 
Algebra 

A,B propositions aA, A negation of the proposition A 
AABN conjunction, logical AND AV B,U  disjunction, logical OR 
AB implication, IF A THEN B AeB equivalence, A IF AND ONLY IF B 
A, B, C... sets N set of natural numbers 
A closure of the set A or complement of Z set of the integers 

A with respect to a universal set Q set of the rational numbers 
ACB A is a proper subset of B. R set of the real numbers 
ACB A is a subset of B. R4 set of the positive real numbers 
A\B difference of two sets R” n-dimensional Euclidean vector space 
AAB symmetric difference ο set of the complex numbers 
AxB Cartesian product RoS relation product 
TEA x is an element of A zéA x is not an element of A 
card A cardinal number of the set A 0 empty set, zero set 
ANB intersection of two sets Mi Ai intersection of n sets A; 
AUB union of two sets Ui union of n sets A; 
Vr for all elements x zu there exists an element x 
[rv € X :p(x)) subset of all z from X {x:p(x)}, set of all x with the 

of the property p(x) {a|p(x)} property p(x) 
T:X—Y mapping T' from the space X = isomorphy of groups 

into the space Y ~R equivalence relation 
Θ residue class addition © residue class multiplication 
H = Hı § Hə orthogonal decomposition of space H A & B Kronecker product 
supp support iff if and only if 
supM supremum: least upper bound of the non-empty set M (M C R) bounded above 
inf M infimum: greatest lower bound of the non-empty set M (M C R) bounded below 
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[a,b] closed interval, i.e., {rER:a<x< b} 
(a,b),]a,b| open interval, i.e., [re R:a «zr « b) 
(a,b],]a,b] interval open from left, i.e, {te R:a<a<b} 
[a, 6), [a,b[ interval open from right, ie, {2 €R:a< x <b} 
signa sign of the number a, e.g., sign (+3) = +1, sign0 = 0 
la| absolute value of the number a 

la] integer, greater or equal to a (compare [r], p. C) 

a™ a to the power m, a to the m-th 


να 
Va 


square root of a 
n-th root of a 


log,a logarithm of the number a to the base b, e.g., log; 32 — 5 
loga decimal logarithm (base 10) of the number a, e.g., lg 100 = 2 
lna natural logarithm (base e) of the number a, e.g., Ine = 1 
a |b a is a divisor of b, a devides b, the ratio of a to b 
a fb a is not a divisor of b 
a = bmod m, a = b(m) a is congruent to b modulo m, i.e., b — a is divisible by m 
g.C.d.(a1, a2, ...,à5) greatest common divisor of a1, a2, . . . , an 
κο μμ an) least common multiple of a1,a2,..., à; 
(2 binomial coefficient, n choose k 
a 
(7) Legendre symbol 
m!=1-2-3-...-n factorial, e.g., 6! = 1-2-3-4-5-6 — 720; specially: 0! =1!=1 
(2n)!! = 2-4-6-...-(2n) =2"-n!; in particular: 0! = 1!! = 1 


(2n 4-1)! 2 1-3.5- 


a7 (n4 T) 


A = (aij) matrix A with elements aj; AT transposed matrix 

ΑΓ inverse matrix AH adjoint matrix 

E= (6) unit matrix 0 zero matrix 

rank rank of a matrix trace trace of a matrix 

det A, D determinant of the square matrix A 

δὲς Kronecker symbol: 6;; = 0 for i # j and 6;; = 1 for i = j 
a column vector in R” 

a? unit vector in the direction of (parallel to) a 

lall norm of a 

à b,c vectors in IR? 

ij k &,,6,,€, basis vectors (orthonormed) of the Cartesian coordinate system 
Gy, Ay, Az coordinates (components) of the vector & 

ja| absolute value, length of the vector a 

aa multiplication of a vector by a scalar 


scalar product, dot product 
vector product, cross product 
parallelepipedal product, mixed product (triple scalar product) 


zero vector 


tensor 
graph with the set of vertices V and the set of edges E 
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Geometry 
E orthogonal (perpendicular) | parallel 
equal and parallel ~ similar, e.g., AABC ~ ADEF; pro- 
portional 
A triangle x angle, e.g., ABC 
τ arc segment, e.g., AB the arc between A and B rad radian 
5 degree 
1 minute » as measure of angle and circular arc, e.g., 32? 14' 11.5" 
á second 
gon measure in grades (360° = 400 gon, see p. 131 and table3.5, p. 146) 
AB the line segment between A and B 
— 
AB the directed line segment from A to B, the vector from A to B 
Complex Numbers 
i (sometimes j) imaginary unit (i? = —1) 1 imaginary unit in computer algebra 
Re(z) real part of the number z Im (z) imaginary part of the number z 
E absolute value of z argz argument of the number z 
Z or z* complex conjugate of z, e.g., z = 2+ 3i, Lnz logarithm (natural) of a complex num- 
z—2—3 ber z 
Trigonometric Functions, Hyperbolic Functions 
sin sine cos cosine 
tan tangent cot cotangent 
sec secant cosec cosecant 
arcsin principal value of arc sine (sine inverse) arccos principal value of arc cosine (cosine inverse) 
arctan principal value of arc tangent (tangent inverse) arccot principal value of arc cotangent (cotangent in- 
verse) 
arcsec principal value of arc secant (secant inverse) arccosec principal value of arc cosecant (cosecant in- 
verse) 
sinh hyperbolic sine cosh hyperbolic cosine 
tanh hyperbolic tangent coth hyperbolic cotangent 
sech hyperbolic secant cosech hyperbolic cosecant 
Arsinh area-hyperbolic sine Arcosh area-hyperbolic cosine 
Artanh  area-hyperbolic tangent Arcoth area-hyperbolic cotangent 
Arsech —area-hyperbolic secant Arcosech area-hyperbolic cosecant 
Analysis 
lim tp, =A A is the limit of the sequence (ή). We also write £n > A as n — oo; 
noo 
; ayn _ 
e.g. lim (1+ x) =e 
lim f(z) 2 B B is the limit of the function f(x) as x tends to a 
ra 
[x] = int(x) (entier(x)) greatest integer less or equal to x (compare p. B) 
f —o(g)forz —a Landau symbol “small o” means: f(x)/g(x) — 0 as  — a 
f — O(g) for z —^a Landau symbol “big O” means: f(x)/g(x) —^ C (C — const, C Z0) asz —a 
n 
ο M sum of n terms for i equals 1 to n 
i=1 
' n 
11: 115: product of n factors for i equals 1 to n 
i=1 
Fpl) notation for a function, e.g., y = f(x), u = (x,y,z) 
A difference or increment, e.g., A x (delta £) 
d differential, e.g., dx (differential of x) 
d d d^ er αὶ e i 
— determination of the first, second, ..., n-th derivative with respect to x 


da’ dx?’ dan 


D Mathematical Symbols 





(2), f^ (2), f" (x), 
fO (z),.... f? (v) 


or 
jd... 9 0 
0 0 2? 
Ox’ Oy’ Oa?” 
82 
Oxdy 


Fus fus Fus Fus Fus 


D 
grad 
div 
rot 
a τι ὃν ὃν 
πο UR ta 
9? o? o 
A-aa* JA t JA 
op 
98 
b 
ns 
f(x,y, 2). 
©) 


ΠΠ 


ΠΟΙ 


(5) (5) 


ο ο. ο 
(5) (5) 


f (eve - fff fey ana: 


(V) (V) 


pne 
fre. --- νο 

νο )xd$- divos 
(S) 


(5) 


A = max! 


A = max 


first, second,... , 


n-th derivative of the function f(x) or of the function y 


determination of the first, second, ..., n-th partial derivative 


determination of the second partial derivative first with respect to x, 
then with respect to y 


first, second, ... partial derivative of function f(x, y) 


differential operator, e.g., Dy — y', D?y — y" 
gradient of a scalar field (grad y = Vy) 
divergence of a vector field (div v — V - V) 


rotation or curl of a vector field (rot V — V x v) 


nabla operator, here in Cartesian coordinates (also called the 
Hamiltonian differential operator, not to be confused with the Hamil- 
ton operator in quantum mechanics) 


Laplace operator 


directional derivative, i.e., derivative of a 


σ΄ =a-grady 


scalar field y into the direction a: 


definite integral of the function f between the limits a and b 


line integral of the first kind with respect to the space curve C with 
arclength s 


integral along a closed curve (circulatory integral) 


double integral over a planar region S 


surface integral of the first kind over a spatial surface S (see (8.151b), 


p. 534) 


triple integral or volume integral over the volume V 


surface integrals over a closed surface in vector analysis 


expression A is to be maximized, similarly min!, extreme! 


expression A is maximal, similarly min, extreme. 


